Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: MSLN Sequence: fasta or formatted (630aa) NCBI GI: 53988380
Description:

mesothelin isoform 2 preproprotein

Not currently referenced in the text

Other entries for this name:
alt prot {622aa} mesothelin isoform 1 preproprotein
Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             8.9         56           2
 C cysteine            1.9         12           1
 D aspartate           5.4         34           2
 E glutamate           5.7         36           2
 F phenylalanine       2.9         18           2
 G glycine             7.0         44           3
 H histidine           1.0          6           1
 I isoleucine          2.7         17           2
 K lysine              3.5         22           2
 L leucine            16.3        103           3
 M methionine          1.4          9           1
 N asparagine          1.9         12           1
 P proline             8.7         55           2
 Q glutamine           5.2         33           2
 R arginine            7.0         44           2
 S serine              6.8         43           2
 T threonine           4.6         29           1
 V valine              5.9         37           1
 W tryptophan          1.4          9           1
 Y tyrosine            1.7         11           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   mesothelin isoform 2 preproprotein 
MSLN                  0.980   mesothelin isoform 1 preproprotein 
MSLNL                 0.108   mesothelin-like 
CHADL                 0.014   chondroadherin-like 
EDEM2                 0.006   ER degradation enhancer, mannosidase alpha-like 2 i...
EDEM2                 0.006   ER degradation enhancer, mannosidase alpha-like 2 i...
KDM5C                 0.005   jumonji, AT rich interactive domain 1C isoform 1 [H...
LOC100293168          0.005   PREDICTED: hypothetical protein 
LOC100287177          0.005   PREDICTED: hypothetical protein 
LOC100287177          0.005   PREDICTED: hypothetical protein XP_002343692 
SARDH                 0.005   sarcosine dehydrogenase precursor 
SARDH                 0.005   sarcosine dehydrogenase precursor 
EPN1                  0.004   epsin 1 isoform c 
EPN1                  0.004   epsin 1 isoform b 
EPN1                  0.004   epsin 1 isoform a 
RGS20                 0.004   regulator of G-protein signaling 20 isoform a 
NAT14                 0.004   N-acetyltransferase 14 
RSC1A1                0.004   regulatory solute carrier protein, family 1, member 1...
PUM1                  0.004   pumilio 1 isoform 2 
PUM1                  0.004   pumilio 1 isoform 1 
LOC400558             0.004   PREDICTED: hypothetical protein LOC400558 
LOC400558             0.004   PREDICTED: hypothetical protein LOC400558 
LOC400558             0.004   PREDICTED: hypothetical protein LOC400558 
GPSM2                 0.004   LGN protein 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press