Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: LOC100289939 Sequence: fasta or formatted (140aa) NCBI GI: 239751371
Description:

PREDICTED: hypothetical protein XP_002347821

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             6.4          9           2
 C cysteine            3.6          5           1
 D aspartate           3.6          5           1
 E glutamate           1.4          2           1
 F phenylalanine       2.1          3           1
 G glycine             5.7          8           1
 H histidine           2.1          3           1
 I isoleucine          1.4          2           1
 K lysine              0.7          1           1
 L leucine             7.9         11           2
 M methionine          0.7          1           1
 N asparagine          0.0          0           0
 P proline            13.6         19           2
 Q glutamine           2.9          4           1
 R arginine           21.4         30           4
 S serine             13.6         19           2
 T threonine          10.7         15           1
 V valine              0.7          1           1
 W tryptophan          1.4          2           1
 Y tyrosine            0.0          0           0
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   PREDICTED: hypothetical protein XP_002347821 
LOC100292592          1.000   PREDICTED: hypothetical protein 
LOC100288866          1.000   PREDICTED: hypothetical protein XP_002343571 
SRRM2                 0.097   splicing coactivator subunit SRm300 
SRRM1                 0.071   serine/arginine repetitive matrix 1 
LOC284297             0.059   hypothetical protein LOC284297 
PRPF4B                0.052   serine/threonine-protein kinase PRP4K 
LOC100130360          0.048   PREDICTED: hypothetical protein 
C2orf16               0.048   hypothetical protein LOC84226 
SON                   0.045   SON DNA-binding protein isoform F 
SON                   0.045   SON DNA-binding protein isoform B 
LOC100291176          0.045   PREDICTED: hypothetical protein XP_002346954 
LOC100287322          0.045   PREDICTED: hypothetical protein XP_002342818 
CHERP                 0.041   calcium homeostasis endoplasmic reticulum protein [...
SFRS2                 0.041   splicing factor, arginine/serine-rich 2 
LOC100293375          0.037   PREDICTED: hypothetical protein 
SCAF1                 0.037   SR-related CTD-associated factor 1 
ARL6IP4               0.037   SRp25 nuclear protein isoform 1 
LOC729417             0.037   PREDICTED: hypothetical protein 
LOC729417             0.037   PREDICTED: hypothetical protein 
LOC729417             0.037   PREDICTED: hypothetical protein 
BRSK1                 0.037   BR serine/threonine kinase 1 
RBMXL2                0.037   testes-specific heterogenous nuclear ribonucleoprot...
SHANK1                0.037   SH3 and multiple ankyrin repeat domains 1 
LOC100128556          0.033   PREDICTED: hypothetical protein 
LOC100293495          0.033   PREDICTED: hypothetical protein 
FUSIP1                0.033   FUS interacting protein (serine-arginine rich) 1 iso...
ARHGEF5               0.033   rho guanine nucleotide exchange factor 5 
YLPM1                 0.033   YLP motif containing 1 
LOC283999             0.030   hypothetical protein LOC283999 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press