Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C20orf166 Sequence: fasta or formatted (117aa) NCBI GI: 30425386
Description:

hypothetical protein LOC128826

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine            12.0         14           2
 C cysteine            6.0          7           1
 D aspartate           0.9          1           1
 E glutamate           7.7          9           1
 F phenylalanine       2.6          3           1
 G glycine             8.5         10           2
 H histidine           3.4          4           1
 I isoleucine          1.7          2           1
 K lysine              3.4          4           1
 L leucine             7.7          9           2
 M methionine          2.6          3           1
 N asparagine          1.7          2           1
 P proline             8.5         10           1
 Q glutamine           6.8          8           2
 R arginine            4.3          5           1
 S serine             10.3         12           1
 T threonine           4.3          5           1
 V valine              5.1          6           1
 W tryptophan          1.7          2           1
 Y tyrosine            0.9          1           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC128826 
CDK5R1                0.040   cyclin-dependent kinase 5, regulatory subunit 1 [Homo...
RUNDC1                0.031   RUN domain containing 1 
XIRP1                 0.022   xin actin-binding repeat containing 1 
SETD1A                0.018   SET domain containing 1A 
LOC100287812          0.013   PREDICTED: hypothetical protein XP_002343253 
EMILIN1               0.013   elastin microfibril interfacer 1 
ATXN2                 0.013   ataxin 2 
BSN                   0.013   bassoon protein 
MTF1                  0.013   metal-regulatory transcription factor 1 
ERC2                  0.013   cytomatrix protein p110 
ZNF408                0.013   zinc finger protein 408 
IGFBP6                0.009   insulin-like growth factor binding protein 6 
FAM53B                0.009   hypothetical protein LOC9679 
C17orf85              0.009   ELG protein isoform a 
MAP7D1                0.009   MAP7 domain containing 1 
LOC729175             0.009   PREDICTED: hypothetical protein 
LOC728965             0.009   PREDICTED: hypothetical protein 
LOC729175             0.009   PREDICTED: hypothetical protein 
LOC729175             0.009   PREDICTED: hypothetical protein 
LOC100294344          0.009   PREDICTED: similar to mitogen-activated protein kin...
KIAA2018              0.004   hypothetical protein LOC205717 
MLL                   0.004   myeloid/lymphoid or mixed-lineage leukemia protein [...
MARCKS                0.004   myristoylated alanine-rich protein kinase C substra...
YAP1                  0.004   Yes-associated protein 1, 65kDa isoform 1 
CLMN                  0.004   calmin 
LOC100293205          0.004   PREDICTED: hypothetical protein 
LOC100289336          0.004   PREDICTED: hypothetical protein 
LOC100289336          0.004   PREDICTED: hypothetical protein XP_002342623 
CENPJ                 0.004   centromere protein J 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press