Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C9orf21 Sequence: fasta or formatted (226aa) NCBI GI: 63054819
Description:

hypothetical protein LOC195827

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             8.4         19           3
 C cysteine            1.3          3           1
 D aspartate           3.5          8           1
 E glutamate           4.9         11           2
 F phenylalanine       4.4         10           1
 G glycine             8.0         18           2
 H histidine           4.4         10           2
 I isoleucine          6.2         14           1
 K lysine              3.1          7           1
 L leucine             9.3         21           2
 M methionine          0.9          2           1
 N asparagine          3.5          8           2
 P proline             7.5         17           1
 Q glutamine           5.3         12           2
 R arginine            7.1         16           2
 S serine              7.1         16           2
 T threonine           2.7          6           1
 V valine              9.3         21           3
 W tryptophan          0.4          1           1
 Y tyrosine            2.7          6           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC195827 
C1orf93               0.088   hypothetical protein LOC127281 
C10orf58              0.025   hypothetical protein LOC84293 
E4F1                  0.019   p120E4F 
LOC100292541          0.014   PREDICTED: hypothetical protein 
MAP3K6                0.012   mitogen-activated protein kinase kinase kinase 6 [Ho...
FLJ37078              0.012   hypothetical protein LOC222183 
POGZ                  0.009   pogo transposable element with ZNF domain isoform 1 ...
POGZ                  0.009   pogo transposable element with ZNF domain isoform 2 ...
CTNND2                0.009   catenin (cadherin-associated protein), delta 2 (neur...
MYO16                 0.009   myosin heavy chain Myr 8 
C15orf39              0.007   hypothetical protein LOC56905 
LOC100291544          0.007   PREDICTED: hypothetical protein XP_002347761 
TRIM16                0.007   tripartite motif-containing 16 
ADAM19                0.007   ADAM metallopeptidase domain 19 preproprotein 
SLC38A10              0.007   solute carrier family 38, member 10 isoform b 
TAF4                  0.007   TBP-associated factor 4 
RBM47                 0.007   RNA binding motif protein 47 isoform a 
GLTSCR1               0.007   glioma tumor suppressor candidate region gene 1 [Ho...
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press