Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C5orf47 Sequence: fasta or formatted (176aa) NCBI GI: 222352135
Description:

hypothetical protein LOC133491

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine            10.8         19           3
 C cysteine            1.7          3           1
 D aspartate           1.7          3           1
 E glutamate           5.7         10           2
 F phenylalanine       1.7          3           1
 G glycine            10.8         19           2
 H histidine           1.1          2           1
 I isoleucine          2.3          4           1
 K lysine              7.4         13           5
 L leucine             7.4         13           1
 M methionine          2.3          4           1
 N asparagine          2.3          4           1
 P proline             4.0          7           1
 Q glutamine           7.4         13           1
 R arginine           10.2         18           1
 S serine              9.7         17           2
 T threonine           2.8          5           2
 V valine              6.8         12           1
 W tryptophan          1.1          2           1
 Y tyrosine            2.8          5           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC133491 
E2F3                  0.037   E2F transcription factor 3 
AMMECR1               0.037   AMMECR1 protein isoform 2 
AMMECR1               0.037   AMMECR1 protein isoform 1 
GATA4                 0.028   GATA binding protein 4 
LOC100292370          0.025   PREDICTED: hypothetical protein 
NECAP1                0.022   NECAP endocytosis associated 1 
CLIC6                 0.022   chloride intracellular channel 6 
LRRC68                0.018   PREDICTED: leucine rich repeat containing 68 
LRRC68                0.018   PREDICTED: leucine rich repeat containing 68 
LOC730456             0.018   PREDICTED: hypothetical protein 
LRRC68                0.018   PREDICTED: leucine rich repeat containing 68 
TSPYL4                0.018   TSPY-like 4 
IRS4                  0.018   insulin receptor substrate 4 
CLEC11A               0.015   stem cell growth factor precursor 
RPAP2                 0.015   RNA polymerase II associated protein 2 
LOC728650             0.015   PREDICTED: hypothetical protein 
LOC100288668          0.012   PREDICTED: hypothetical protein XP_002343144 
LOC100291673          0.012   PREDICTED: hypothetical protein XP_002344645 
LOC100289881          0.012   PREDICTED: hypothetical protein XP_002347282 
BSN                   0.012   bassoon protein 
KIAA1462              0.012   hypothetical protein LOC57608 
LOC728650             0.009   PREDICTED: hypothetical protein 
LOC100294236          0.009   PREDICTED: similar to diffuse panbronchiolitis crit...
TAL1                  0.009   T-cell acute lymphocytic leukemia 1 
HMX1                  0.009   homeo box (H6 family) 1 
LOC100292655          0.009   PREDICTED: similar to armadillo repeat containing, ...
GIN1                  0.009   zinc finger, H2C2 domain containing 
LOC100290504          0.009   PREDICTED: hypothetical protein XP_002346416 
LOC100287141          0.009   PREDICTED: hypothetical protein XP_002343631 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press