Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C10orf107 Sequence: fasta or formatted (208aa) NCBI GI: 27734885
Description:

hypothetical protein LOC219621

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             4.3          9           1
 C cysteine            0.5          1           1
 D aspartate           6.7         14           2
 E glutamate          11.5         24           2
 F phenylalanine       5.3         11           1
 G glycine             3.8          8           1
 H histidine           1.4          3           1
 I isoleucine         10.6         22           2
 K lysine              6.7         14           2
 L leucine             8.7         18           2
 M methionine          3.8          8           1
 N asparagine          2.9          6           1
 P proline             4.3          9           2
 Q glutamine           6.2         13           1
 R arginine            1.4          3           1
 S serine              7.2         15           1
 T threonine           5.8         12           1
 V valine              4.8         10           2
 W tryptophan          1.0          2           1
 Y tyrosine            2.9          6           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC219621 
PCF11                 0.028   pre-mRNA cleavage complex II protein Pcf11 
C8orf74               0.028   hypothetical protein LOC203076 
TANC2                 0.018   tetratricopeptide repeat, ankyrin repeat and coiled...
C16orf93              0.016   hypothetical protein LOC90835 
VGF                   0.013   VGF nerve growth factor inducible precursor 
PRR22                 0.010   proline rich 22 isoform 1 
PRR22                 0.010   proline rich 22 isoform 2 
KIF5B                 0.010   kinesin family member 5B 
VCAN                  0.008   versican isoform 1 
TERF2IP               0.008   telomeric repeat binding factor 2, interacting prote...
KIF5C                 0.008   kinesin family member 5C 
KTN1                  0.008   kinectin 1 isoform b 
KTN1                  0.008   kinectin 1 isoform a 
KTN1                  0.008   kinectin 1 isoform c 
KTN1                  0.008   kinectin 1 isoform a 
CC2D1A                0.008   coiled-coil and C2 domain containing 1A 
GFPT2                 0.008   glutamine-fructose-6-phosphate transaminase 2 
AGAP1                 0.008   centaurin, gamma 2 isoform 2 
IRF2                  0.005   interferon regulatory factor 2 
NPAS1                 0.005   neuronal PAS domain protein 1 
ZNF207                0.005   zinc finger protein 207 isoform c 
ZNF207                0.005   zinc finger protein 207 isoform a 
ZNF207                0.005   zinc finger protein 207 isoform b 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press