Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: SETD1A Sequence: fasta or formatted (1707aa) NCBI GI: 55741677
Description:

SET domain containing 1A

Referenced in:

Protein Composition and Structure
Histones, Related Proteins, and Modifying Enzymes
PHD Finger Proteins

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             8.3        141           2
 C cysteine            0.8         14           2
 D aspartate           5.0         85           3
 E glutamate          10.3        176           6
 F phenylalanine       2.6         44           1
 G glycine             6.7        114           8
 H histidine           1.5         26           2
 I isoleucine          2.5         42           2
 K lysine              4.5         76           3
 L leucine             6.3        108           2
 M methionine          1.5         25           1
 N asparagine          2.1         36           1
 P proline            12.3        210          13
 Q glutamine           3.9         67           3
 R arginine            7.0        120           4
 S serine             12.0        205          24
 T threonine           5.3         91           2
 V valine              4.0         68           3
 W tryptophan          0.6         11           2
 Y tyrosine            2.8         48           2
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   SET domain containing 1A 
SETD1B                0.294   SET domain containing 1B 
MLL4                  0.038   myeloid/lymphoid or mixed-lineage leukemia 4 
MLL                   0.036   myeloid/lymphoid or mixed-lineage leukemia protein [...
MLL2                  0.030   myeloid/lymphoid or mixed-lineage leukemia 2 
MLL3                  0.030   myeloid/lymphoid or mixed-lineage leukemia 3 
NOLC1                 0.023   nucleolar and coiled-body phosphoprotein 1 
MICAL3                0.023   microtubule associated monoxygenase, calponin and L...
PRR12                 0.022   proline rich 12 
TTN                   0.022   titin isoform N2-A 
EZH2                  0.021   enhancer of zeste 2 isoform b 
EZH2                  0.021   enhancer of zeste 2 isoform a 
EZH1                  0.021   enhancer of zeste homolog 1 
TNRC18                0.021   trinucleotide repeat containing 18 
NEFH                  0.019   neurofilament, heavy polypeptide 200kDa 
BRD4                  0.019   bromodomain-containing protein 4 isoform long 
KIAA0754              0.019   hypothetical protein LOC643314 
TCOF1                 0.019   Treacher Collins-Franceschetti syndrome 1 isoform d...
SRRM2                 0.019   splicing coactivator subunit SRm300 
KIAA1522              0.018   hypothetical protein LOC57648 
PRB2                  0.018   proline-rich protein BstNI subfamily 2 
SETD2                 0.018   SET domain containing 2 
PELP1                 0.017   proline, glutamic acid and leucine rich protein 1 [...
POLR2A                0.017   DNA-directed RNA polymerase II A 
NSD1                  0.017   nuclear receptor binding SET domain protein 1 isofor...
NSD1                  0.017   nuclear receptor binding SET domain protein 1 isofor...
TCOF1                 0.017   Treacher Collins-Franceschetti syndrome 1 isoform e...
SRRM1                 0.017   serine/arginine repetitive matrix 1 
WHSC1                 0.017   Wolf-Hirschhorn syndrome candidate 1 protein isoform...
WHSC1                 0.017   Wolf-Hirschhorn syndrome candidate 1 protein isoform...
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press