Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C20orf72 Sequence: fasta or formatted (344aa) NCBI GI: 16506297
Description:

hypothetical protein LOC92667

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             4.4         15           2
 C cysteine            2.0          7           1
 D aspartate           4.1         14           1
 E glutamate           8.1         28           2
 F phenylalanine       3.8         13           1
 G glycine             4.4         15           1
 H histidine           2.0          7           1
 I isoleucine          3.8         13           1
 K lysine              8.1         28           3
 L leucine             9.6         33           2
 M methionine          2.0          7           1
 N asparagine          4.4         15           1
 P proline             5.5         19           1
 Q glutamine           7.3         25           2
 R arginine            4.4         15           2
 S serine              9.0         31           2
 T threonine           4.1         14           1
 V valine              7.8         27           2
 W tryptophan          1.5          5           1
 Y tyrosine            3.8         13           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC92667 
GBP4                  0.021   guanylate binding protein 4 
TTN                   0.012   titin isoform N2-A 
LOC728047             0.010   PREDICTED: similar to Golgin subfamily A member 8-l...
CLUAP1                0.009   clusterin associated protein 1 isoform 2 
CLUAP1                0.009   clusterin associated protein 1 isoform 1 
LOC728080             0.009   PREDICTED: similar to Golgin subfamily A member 8-l...
LOC727909             0.009   PREDICTED: similar to Golgin subfamily A member 8-l...
LOC643699             0.007   PREDICTED: similar to golgi autoantigen, golgin sub...
FAM78B                0.007   hypothetical protein LOC149297 
C12orf30              0.007   mitochondrial distribution and morphology 20 
CDKL5                 0.006   cyclin-dependent kinase-like 5 
CDKL5                 0.006   cyclin-dependent kinase-like 5 
HES6                  0.006   hairy and enhancer of split 6 isoform b 
GPR107                0.006   G protein-coupled receptor 107 isoform 2 
GPR107                0.006   G protein-coupled receptor 107 isoform 1 
GPR107                0.006   G protein-coupled receptor 107 isoform 3 
SPTBN1                0.004   spectrin, beta, non-erythrocytic 1 isoform 2 
SPTBN1                0.004   spectrin, beta, non-erythrocytic 1 isoform 1 
MXD3                  0.004   MAX dimerization protein 3 isoform b 
MXD3                  0.004   MAX dimerization protein 3 isoform a 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press