Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: LOC100170229 Sequence: fasta or formatted (715aa) NCBI GI: 224548936
Description:

hypothetical protein LOC100170229

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             3.9         28           1
 C cysteine            0.6          4           1
 D aspartate           3.4         24           2
 E glutamate           3.6         26           2
 F phenylalanine       0.1          1           1
 G glycine             4.3         31           2
 H histidine           3.2         23           1
 I isoleucine          0.8          6           2
 K lysine              7.8         56           2
 L leucine             1.5         11           1
 M methionine          1.1          8           1
 N asparagine          2.8         20           1
 P proline             8.5         61           2
 Q glutamine           4.3         31           2
 R arginine           18.6        133           3
 S serine             26.4        189           6
 T threonine           5.6         40           2
 V valine              1.5         11           1
 W tryptophan          0.6          4           1
 Y tyrosine            1.1          8           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC100170229 
SRRM2                 0.140   splicing coactivator subunit SRm300 
SFRS4                 0.108   splicing factor, arginine/serine-rich 4 
PPIG                  0.100   peptidylprolyl isomerase G 
C2orf16               0.096   hypothetical protein LOC84226 
SRRM1                 0.095   serine/arginine repetitive matrix 1 
TRIOBP                0.094   TRIO and F-actin binding protein isoform 6 
ZC3H13                0.094   zinc finger CCCH-type containing 13 
SFRS12                0.091   splicing factor, arginine/serine-rich 12 isoform a ...
SFRS12                0.091   splicing factor, arginine/serine-rich 12 isoform b [...
LOC100286959          0.085   PREDICTED: hypothetical protein XP_002343921 
PRPF38B               0.084   PRP38 pre-mRNA processing factor 38 (yeast) domain ...
DSPP                  0.081   dentin sialophosphoprotein preproprotein 
KIAA1853              0.072   KIAA1853 protein 
SFRS18                0.071   splicing factor, arginine/serine-rich 130 
SFRS18                0.071   splicing factor, arginine/serine-rich 130 
FLJ37078              0.069   hypothetical protein LOC222183 
NEFH                  0.067   neurofilament, heavy polypeptide 200kDa 
NKTR                  0.066   natural killer-tumor recognition sequence 
SFRS16                0.066   splicing factor, arginine/serine-rich 16 
RSRC2                 0.061   arginine/serine-rich coiled-coil 2 isoform a 
RBM25                 0.060   RNA binding motif protein 25 
SON                   0.060   SON DNA-binding protein isoform F 
SON                   0.060   SON DNA-binding protein isoform B 
SFRS11                0.060   splicing factor, arginine/serine-rich 11 
PRPF4B                0.058   serine/threonine-protein kinase PRP4K 
CDC2L5                0.057   cell division cycle 2-like 5 isoform 1 
CDC2L5                0.057   cell division cycle 2-like 5 isoform 2 
LOC100131202          0.056   PREDICTED: hypothetical protein 
LOC100288837          0.054   PREDICTED: hypothetical protein XP_002343931, parti...
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press