Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C20orf94 Sequence: fasta or formatted (408aa) NCBI GI: 61102723
Description:

hypothetical protein LOC128710

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             6.4         26           2
 C cysteine            2.5         10           1
 D aspartate           4.2         17           1
 E glutamate           7.6         31           2
 F phenylalanine       2.7         11           1
 G glycine             4.4         18           1
 H histidine           2.5         10           2
 I isoleucine          2.9         12           2
 K lysine              8.3         34           2
 L leucine             9.6         39           3
 M methionine          0.7          3           1
 N asparagine          3.4         14           2
 P proline             5.1         21           2
 Q glutamine           5.9         24           2
 R arginine            7.8         32           3
 S serine             12.3         50           3
 T threonine           5.6         23           2
 V valine              6.4         26           2
 W tryptophan          0.5          2           1
 Y tyrosine            1.2          5           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC128710 
MTDH                  0.015   metadherin 
ANK2                  0.015   ankyrin 2 isoform 1 
GAS2L3                0.014   growth arrest-specific 2 like 3 
RSF1                  0.014   remodeling and spacing factor 1 
EHMT1                 0.014   euchromatic histone-lysine N-methyltransferase 1 is...
EHMT1                 0.014   euchromatic histone-lysine N-methyltransferase 1 is...
LYST                  0.013   lysosomal trafficking regulator 
SMC4                  0.013   SMC4 structural maintenance of chromosomes 4-like 1 ...
SMC4                  0.013   SMC4 structural maintenance of chromosomes 4-like 1 ...
NFAT5                 0.013   nuclear factor of activated T-cells 5 isoform a [Hom...
NFAT5                 0.013   nuclear factor of activated T-cells 5 isoform c [Homo...
NFAT5                 0.013   nuclear factor of activated T-cells 5 isoform d [Ho...
NFAT5                 0.013   nuclear factor of activated T-cells 5 isoform b [Ho...
NFAT5                 0.013   nuclear factor of activated T-cells 5 isoform a [Hom...
MLL                   0.012   myeloid/lymphoid or mixed-lineage leukemia protein [...
PDZD2                 0.012   PDZ domain containing 2 
PRG4                  0.010   proteoglycan 4 isoform B 
PRG4                  0.010   proteoglycan 4 isoform A 
PPRC1                 0.010   peroxisome proliferator-activated receptor gamma, co...
MUC6                  0.010   mucin 6, gastric 
HNF1B                 0.010   transcription factor 2 
SRrp35                0.010   serine-arginine repressor protein 
RAB11FIP1             0.010   RAB11 family interacting protein 1 isoform 3 
MKI67                 0.010   antigen identified by monoclonal antibody Ki-67 iso...
MKI67                 0.010   antigen identified by monoclonal antibody Ki-67 iso...
RTF1                  0.010   Paf1/RNA polymerase II complex component 
ATXN2                 0.009   ataxin 2 
EVC                   0.009   Ellis van Creveld syndrome protein 
LOC100290023          0.009   PREDICTED: hypothetical protein XP_002348147 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press