Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: CPSF1 Sequence: fasta or formatted (1443aa) NCBI GI: 56676371
Description:

cleavage and polyadenylation specific factor 1, 160kDa

Referenced in:

Polyadenylation
Capping and Splicing

Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             7.1        103           3
 C cysteine            1.5         22           1
 D aspartate           4.6         67           3
 E glutamate           7.5        108           3
 F phenylalanine       4.1         59           2
 G glycine             7.3        105           3
 H histidine           2.7         39           2
 I isoleucine          4.2         61           2
 K lysine              4.3         62           3
 L leucine            10.7        155           3
 M methionine          2.4         34           1
 N asparagine          3.1         45           2
 P proline             5.8         84           2
 Q glutamine           3.7         54           2
 R arginine            6.2         90           2
 S serine              6.7         97           2
 T threonine           5.8         83           2
 V valine              7.8        113           3
 W tryptophan          1.0         14           1
 Y tyrosine            3.3         48           2
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   cleavage and polyadenylation specific factor 1, 160k... 
LOC100290337          0.021   PREDICTED: hypothetical protein XP_002347285 
DDB1                  0.021   damage-specific DNA binding protein 1 
SF3B3                 0.018   splicing factor 3b, subunit 3 
HOXC11                0.004   homeobox C11 
NFKBIL1               0.003   nuclear factor of kappa light polypeptide gene enha...
NFKBIL1               0.003   nuclear factor of kappa light polypeptide gene enha...
NFKBIL1               0.003   nuclear factor of kappa light polypeptide gene enha...
NFKBIL1               0.003   nuclear factor of kappa light polypeptide gene enha...
FMOD                  0.003   fibromodulin precursor 
FIP1L1                0.002   FIP1 like 1 isoform 3 
FIP1L1                0.002   FIP1 like 1 isoform 2 
GAB3                  0.002   Gab3 protein isoform 1 
GAB3                  0.002   Gab3 protein isoform 2 
SLC9A5                0.002   solute carrier family 9 (sodium/hydrogen exchanger), ...
TPR                   0.002   nuclear pore complex-associated protein TPR 
NIN                   0.002   ninein isoform 4 
NIN                   0.002   ninein isoform 5 
NIN                   0.002   ninein isoform 1 
NIN                   0.002   ninein isoform 2 
NACAD                 0.002   NAC alpha domain containing 
NACAD                 0.002   PREDICTED: NAC alpha domain containing 
NACAD                 0.002   PREDICTED: NAC alpha domain containing 
NACAD                 0.002   PREDICTED: NAC alpha domain containing 
NACAD                 0.002   PREDICTED: NAC alpha domain containing 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press