Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: LOC100289573 Sequence: fasta or formatted (240aa) NCBI GI: 239744661
Description:

PREDICTED: hypothetical protein XP_002343218

Not currently referenced in the text

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             8.3         20           2
 C cysteine            2.9          7           1
 D aspartate           3.3          8           1
 E glutamate           5.8         14           2
 F phenylalanine       2.9          7           1
 G glycine            11.2         27           3
 H histidine           2.9          7           1
 I isoleucine          1.7          4           1
 K lysine              2.1          5           1
 L leucine            11.2         27           2
 M methionine          0.4          1           1
 N asparagine          0.8          2           1
 P proline             6.2         15           1
 Q glutamine           7.1         17           2
 R arginine           13.8         33           2
 S serine              9.2         22           2
 T threonine           2.1          5           1
 V valine              6.7         16           2
 W tryptophan          0.4          1           1
 Y tyrosine            0.8          2           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   PREDICTED: hypothetical protein XP_002343218 
LOC100292703          1.000   PREDICTED: hypothetical protein 
LOC100290100          1.000   PREDICTED: hypothetical protein XP_002347380 
LOC100289978          0.013   PREDICTED: hypothetical protein XP_002348035 
LOC100291579          0.011   PREDICTED: hypothetical protein XP_002345911 
LOC100291870          0.011   PREDICTED: hypothetical protein XP_002345664 
ASXL2                 0.009   additional sex combs like 2 
LOC100289408          0.009   PREDICTED: hypothetical protein 
LOC100289408          0.009   PREDICTED: hypothetical protein XP_002344217 
PCDHGB1               0.009   protocadherin gamma subfamily B, 1 isoform 2 precurs...
PCDHGB6               0.009   protocadherin gamma subfamily B, 6 isoform 1 precurs...
PCDHGB6               0.009   protocadherin gamma subfamily B, 6 isoform 2 precurs...
PCDHGB1               0.009   protocadherin gamma subfamily B, 1 isoform 1 precurs...
MUC5AC                0.007   mucin 5AC 
CSF2RB                0.007   colony stimulating factor 2 receptor, beta precursor ...
OGT                   0.007   O-linked GlcNAc transferase isoform 2 
OGT                   0.007   O-linked GlcNAc transferase isoform 1 
MUC5B                 0.007   mucin 5, subtype B, tracheobronchial 
SYN2                  0.007   synapsin II isoform IIa 
SYN2                  0.007   synapsin II isoform IIb 
TESK2                 0.007   testis-specific protein kinase 2 
FLNA                  0.007   filamin A, alpha isoform 2 
FLNA                  0.007   filamin A, alpha isoform 1 
CTCF                  0.007   CCCTC-binding factor 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press