Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C3orf71 Sequence: fasta or formatted (290aa) NCBI GI: 176866318
Description:

hypothetical protein LOC646450

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine            12.8         37           3
 C cysteine            1.4          4           1
 D aspartate           2.8          8           1
 E glutamate           2.8          8           1
 F phenylalanine       2.8          8           1
 G glycine            11.0         32           2
 H histidine           1.4          4           1
 I isoleucine          3.1          9           1
 K lysine              2.1          6           1
 L leucine             9.0         26           2
 M methionine          1.0          3           1
 N asparagine          1.0          3           1
 P proline             9.0         26           2
 Q glutamine           3.8         11           1
 R arginine           12.8         37           4
 S serine              8.3         24           2
 T threonine           6.2         18           2
 V valine              7.9         23           2
 W tryptophan          0.0          0           0
 Y tyrosine            1.0          3           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC646450 
ANKRD56               0.031   ankyrin repeat domain 56 
FLJ37078              0.026   hypothetical protein LOC222183 
PRR12                 0.026   proline rich 12 
LOC339742             0.022   PREDICTED: hypothetical protein 
LOC100290023          0.022   PREDICTED: hypothetical protein XP_002348147 
COL4A3                0.022   alpha 3 type IV collagen isoform 4 precursor 
LOC100293055          0.020   PREDICTED: similar to COL22A1 protein 
COL4A3                0.020   alpha 3 type IV collagen isoform 2 precursor 
COL4A3                0.020   alpha 3 type IV collagen isoform 1 precursor 
COL9A3                0.020   alpha 3 type IX collagen 
COL22A1               0.020   collagen, type XXII, alpha 1 
BSN                   0.020   bassoon protein 
SH3D19                0.018   SH3 domain containing 19 isoform b 
SH3D19                0.018   SH3 domain containing 19 isoform a 
LOC100287250          0.018   PREDICTED: hypothetical protein XP_002344493 
COL9A1                0.018   alpha 1 type IX collagen isoform 1 precursor 
COL9A1                0.018   alpha 1 type IX collagen isoform 2 precursor 
COL1A1                0.017   alpha 1 type I collagen preproprotein 
GNAS                  0.017   GNAS complex locus XLas 
LOC100292122          0.017   PREDICTED: hypothetical protein XP_002345138 
LOC100286986          0.017   PREDICTED: hypothetical protein XP_002344191 
SIN3A                 0.017   transcriptional co-repressor Sin3A 
SIN3A                 0.017   transcriptional co-repressor Sin3A 
SIN3A                 0.017   transcriptional co-repressor Sin3A 
NFKBIE                0.017   nuclear factor of kappa light polypeptide gene enhan...
LOC100289721          0.017   PREDICTED: hypothetical protein XP_002347280 
GLT8D4                0.017   glycosyltransferase 8 domain containing 4 
COL5A2                0.015   alpha 2 type V collagen preproprotein 
AHCYL2                0.015   S-adenosylhomocysteine hydrolase-like 2 isoform a [H...
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press