Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help
Name: C1orf226 Sequence: fasta or formatted (315aa) NCBI GI: 207028635
Description:

hypothetical protein LOC400793 isoform 1

Not currently referenced in the text

Other entries for this name:
alt prot {272aa} hypothetical protein LOC400793 isoform 2
Composition:

amino acid map
 Amino acid        Percentage    Count  Longest homopolymer
 A alanine             7.6         24           2
 C cysteine            0.6          2           1
 D aspartate           5.1         16           1
 E glutamate           7.3         23           2
 F phenylalanine       2.5          8           1
 G glycine             7.6         24           2
 H histidine           2.2          7           1
 I isoleucine          1.9          6           1
 K lysine              5.1         16           1
 L leucine            11.7         37           2
 M methionine          1.9          6           1
 N asparagine          2.5          8           1
 P proline            10.2         32           4
 Q glutamine           4.1         13           1
 R arginine            4.8         15           2
 S serine             11.7         37           2
 T threonine           5.7         18           2
 V valine              5.7         18           1
 W tryptophan          1.0          3           1
 Y tyrosine            0.6          2           1
Comparative genomics:

Search single species RefSeq proteins at NCBI
   H. sapiens
   M. musculus
   D. rerio
   C. intestinalis
   S. purpuratus
   D. melanogaster
   C. elegans
   A. thaliana
   S. cerevisiae
   E. coli W3110
   A. pernix K1

Search summary

comparative genomics plot

   Figure data

Additional searches of
RefSeq proteins at NCBI

   All
   Eukaryotes
   Bacteria
   Archaea
   Viruses
   Primates
   Mammals
   Vertebrates

Related human proteins:
Protein          Relative score         Description

Self-match            1.000   hypothetical protein LOC400793 isoform 1 
C1orf226              0.860   hypothetical protein LOC400793 isoform 2 
SETD1A                0.021   SET domain containing 1A 
DNMT1                 0.020   DNA (cytosine-5-)-methyltransferase 1 isoform a [Ho...
MYLK                  0.020   myosin light chain kinase isoform 3A 
MYLK                  0.020   myosin light chain kinase isoform 1 
MYLK                  0.020   myosin light chain kinase isoform 3B 
MYLK                  0.020   myosin light chain kinase isoform 2 
RAVER1                0.018   RAVER1 
TEAD3                 0.018   TEA domain family member 3 
CRTC1                 0.017   mucoepidermoid carcinoma translocated 1 isoform 3 [...
CRTC1                 0.017   mucoepidermoid carcinoma translocated 1 isoform 1 [...
MBD6                  0.017   methyl-CpG binding domain protein 6 
SDC3                  0.017   syndecan 3 
PUM2                  0.015   pumilio homolog 2 
FAM9A                 0.015   family with sequence similarity 9, member A 
PSD4                  0.015   pleckstrin and Sec7 domain containing 4 
NES                   0.013   nestin 
ZFHX2                 0.013   PREDICTED: zinc finger homeobox 2 
ZFHX2                 0.013   PREDICTED: zinc finger homeobox 2 
ZFHX2                 0.013   PREDICTED: zinc finger homeobox 2 
SYNJ1                 0.013   synaptojanin 1 isoform c 
SYNJ1                 0.013   synaptojanin 1 isoform a 
SYNJ1                 0.013   synaptojanin 1 isoform b 
SYNJ1                 0.013   synaptojanin 1 isoform d 
ISM2                  0.013   isthmin 2 homolog isoform 3 
ISM2                  0.013   isthmin 2 homolog isoform 1 
STIM2                 0.013   stromal interaction molecule 2 
DNMT1                 0.012   DNA (cytosine-5-)-methyltransferase 1 isoform b [Homo...
GOLGA6D               0.012   golgi autoantigen, golgin subfamily a, 6D 
Human BLASTP results (used to prepare the table)

Gene descriptions are from NCBI RefSeq. Search results were obtained with NCBI BLAST and RefSeq entries. When identical proteins are present, the self-match may not be listed first in BLASTP output. In such cases, the table above has been reordered to place it first.

See About the Figures for the scoring system used in the figure above right. The same scoring system was used in the table of BLASTP results.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press