Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C20orf201 Sequence: fasta or formatted (240aa) NCBI GI: 55741630
Description:

hypothetical protein LOC198437

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine            14.6         35           3
 C cysteine            0.4          1           1
 D aspartate           2.5          6           1
 E glutamate           7.5         18           2
 F phenylalanine       1.2          3           1
 G glycine            10.4         25           2
 H histidine           2.9          7           2
 I isoleucine          2.1          5           1
 K lysine              3.8          9           1
 L leucine            10.4         25           2
 M methionine          1.7          4           1
 N asparagine          0.4          1           1
 P proline            12.5         30           3
 Q glutamine           4.6         11           1
 R arginine           11.7         28           2
 S serine              4.6         11           1
 T threonine           3.8          9           1
 V valine              2.9          7           1
 W tryptophan          1.2          3           1
 Y tyrosine            0.8          2           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC198437 
MECP2                 0.026   methyl CpG binding protein 2 isoform 2 
MECP2                 0.026   methyl CpG binding protein 2 isoform 1 
GDF6                  0.024   growth differentiation factor 6 precursor 
TBX1                  0.024   T-box 1 isoform C 
BASP1                 0.019   brain abundant, membrane attached signal protein 1 [...
GNAS                  0.019   GNAS complex locus alex 
PRB1                  0.019   proline-rich protein BstNI subfamily 1 isoform 2 pre...
PRB4                  0.019   proline-rich protein BstNI subfamily 4 precursor [Ho...
LOC100289467          0.019   PREDICTED: hypothetical protein 
LOC100289467          0.019   PREDICTED: hypothetical protein XP_002342401 
SF3B4                 0.019   splicing factor 3b, subunit 4 
CACNA1H               0.017   calcium channel, voltage-dependent, T type, alpha 1H...
CACNA1H               0.017   calcium channel, voltage-dependent, T type, alpha 1H...
DMRT3                 0.017   doublesex and mab-3 related transcription factor 3 [...
PRB2                  0.017   proline-rich protein BstNI subfamily 2 
PRB1                  0.017   proline-rich protein BstNI subfamily 1 isoform 1 pre...
COL1A1                0.017   alpha 1 type I collagen preproprotein 
COL3A1                0.017   collagen type III alpha 1 preproprotein 
LOC100293088          0.017   PREDICTED: hypothetical protein 
LOC644246             0.017   PREDICTED: hypothetical protein LOC644246 
MAGED4B               0.017   melanoma antigen family D, 4B isoform 2 
MAGED4B               0.017   melanoma antigen family D, 4B isoform 1 
MAGED4B               0.017   melanoma antigen family D, 4B isoform 1 
ZSWIM5                0.017   zinc finger, SWIM domain containing 5 
MAGED4                0.017   melanoma antigen family D, 4 
LOC100129571          0.017   PREDICTED: similar to hCG1646049 
LOC100129571          0.017   PREDICTED: similar to hCG1646049 
LOC100129571          0.017   PREDICTED: similar to hCG1646049 
LOC643355             0.015   PREDICTED: hypothetical protein 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press