Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C6orf225 Sequence: fasta or formatted (80aa) NCBI GI: 75677559
Description:

hypothetical protein LOC619208

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             5.0          4           2
 C cysteine            3.8          3           1
 D aspartate           2.5          2           1
 E glutamate           6.2          5           1
 F phenylalanine       3.8          3           1
 G glycine             7.5          6           2
 H histidine           2.5          2           1
 I isoleucine          2.5          2           1
 K lysine              5.0          4           1
 L leucine             5.0          4           1
 M methionine          3.8          3           1
 N asparagine          1.2          1           1
 P proline            12.5         10           2
 Q glutamine           5.0          4           1
 R arginine            7.5          6           2
 S serine             11.2          9           3
 T threonine           8.8          7           2
 V valine              5.0          4           1
 W tryptophan          0.0          0           0
 Y tyrosine            1.2          1           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC619208 
LOC100128071          0.229   PREDICTED: hypothetical protein LOC100128071 
LOC100128071          0.229   PREDICTED: hypothetical protein LOC100128071 
LOC100128071          0.229   PREDICTED: hypothetical protein LOC100128071 
TCF20                 0.042   transcription factor 20 isoform 2 
TCF20                 0.042   transcription factor 20 isoform 1 
LOC100129906          0.035   PREDICTED: hypothetical protein 
LOC643376             0.021   hypothetical protein LOC643376 
LOC100129906          0.014   PREDICTED: hypothetical protein 
LOC100129906          0.014   PREDICTED: hypothetical protein 
CMYA5                 0.014   cardiomyopathy associated 5 
SSFA2                 0.014   sperm specific antigen 2 isoform 1 
SSFA2                 0.014   sperm specific antigen 2 isoform 2 
MUC12                 0.014   PREDICTED: mucin 12 
MUC12                 0.014   PREDICTED: mucin 12, cell surface associated 
ZBTB20                0.014   zinc finger and BTB domain containing 20 
PRG4                  0.014   proteoglycan 4 isoform D 
PRG4                  0.014   proteoglycan 4 isoform C 
PRG4                  0.014   proteoglycan 4 isoform B 
PRG4                  0.014   proteoglycan 4 isoform A 
LOC100292157          0.014   PREDICTED: hypothetical protein 
LOC100290926          0.014   PREDICTED: hypothetical protein XP_002347568 
LOC100288995          0.014   PREDICTED: hypothetical protein XP_002343379 
DKK1                  0.007   dickkopf homolog 1 precursor 
IGDCC4                0.007   immunoglobulin superfamily, DCC subclass, member 4 [...
C12orf52              0.007   hypothetical protein LOC84934 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press