Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: SEMG1 Sequence: fasta or formatted (462aa) NCBI GI: 4506883
Description:

semenogelin I isoform a preproprotein

Referenced in:

Testes and Sperm

Other entries for this name:
alt prot [402aa] semenogelin I isoform b preproprotein
Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             3.2         15           2
 C cysteine            0.2          1           1
 D aspartate           4.1         19           2
 E glutamate           6.9         32           2
 F phenylalanine       1.5          7           1
 G glycine             8.7         40           2
 H histidine           6.7         31           2
 I isoleucine          3.7         17           2
 K lysine              9.1         42           1
 L leucine             6.3         29           3
 M methionine          0.4          2           1
 N asparagine          5.4         25           2
 P proline             2.4         11           1
 Q glutamine          13.2         61           2
 R arginine            4.3         20           2
 S serine             11.9         55           3
 T threonine           4.1         19           2
 V valine              4.3         20           2
 W tryptophan          0.4          2           1
 Y tyrosine            3.0         14           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   semenogelin I isoform a preproprotein 
SEMG1                 0.719   semenogelin I isoform b preproprotein 
SEMG2                 0.681   semenogelin II precursor 
FLG                   0.037   filaggrin 
RPTN                  0.030   repetin 
FLG2                  0.028   filaggrin family member 2 
HRNR                  0.025   hornerin 
CWC22                 0.023   CWC22 spliceosome-associated protein homolog 
LOC100133758          0.022   PREDICTED: hypothetical protein, partial 
FAM133A               0.022   hypothetical protein LOC286499 
LOC440243             0.021   PREDICTED: Putative golgin subfamily A member 6-lik...
ATRX                  0.021   transcriptional regulator ATRX isoform 2 
ATRX                  0.021   transcriptional regulator ATRX isoform 1 
LOC727832             0.020   golgi autoantigen, golgin subfamily a-like 
HRC                   0.020   histidine rich calcium binding protein precursor [Hom...
LOC100170229          0.020   hypothetical protein LOC100170229 
CALD1                 0.020   caldesmon 1 isoform 1 
EIF5B                 0.020   eukaryotic translation initiation factor 5B 
GOLIM4                0.019   golgi integral membrane protein 4 
C2orf16               0.019   hypothetical protein LOC84226 
LOC645202             0.019   PREDICTED: hypothetical protein LOC645202 
EEA1                  0.018   early endosome antigen 1, 162kD 
RBBP6                 0.016   retinoblastoma-binding protein 6 isoform 2 
RBBP6                 0.016   retinoblastoma-binding protein 6 isoform 1 
CDC2L2                0.016   cell division cycle 2-like 2 isoform 1 
TCHH                  0.016   trichohyalin 
LOC283767             0.016   golgi autoantigen, golgin subfamily a-like 
BOD1L                 0.015   biorientation of chromosomes in cell division 1-like...
SLTM                  0.015   modulator of estrogen induced transcription isoform ...
SLTM                  0.015   modulator of estrogen induced transcription isoform ...
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press