Guide to the Human Genome
Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Search of human proteins with 40288197

BLASTP 2.2.11 [Jun-05-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|40288197 GATA binding protein 6 [Homo sapiens]
         (595 letters)

Database: hs.faa 
           37,866 sequences; 18,247,518 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|40288197 GATA binding protein 6 [Homo sapiens]                    1236   0.0  
gi|33188461 GATA binding protein 4 [Homo sapiens]                     362   e-100
gi|17998698 GATA binding protein 5 [Homo sapiens]                     320   3e-87
gi|224611699 GATA binding protein 2 isoform 1 [Homo sapiens]          221   1e-57
gi|20070352 GATA binding protein 2 isoform 1 [Homo sapiens]           221   1e-57
gi|50541959 GATA binding protein 3 isoform 1 [Homo sapiens]           204   2e-52
gi|4503929 GATA binding protein 3 isoform 2 [Homo sapiens]            202   7e-52
gi|224611701 GATA binding protein 2 isoform 2 [Homo sapiens]          196   7e-50
gi|4503925 GATA binding protein 1 [Homo sapiens]                      192   1e-48
gi|239757043 PREDICTED: functional smad suppressing element 18 [...    86   1e-16
gi|239751555 PREDICTED: functional smad suppressing element 18 [...    86   1e-16
gi|239746067 PREDICTED: functional smad suppressing element 18 [...    86   1e-16
gi|5453936 POU class 3 homeobox 3 [Homo sapiens]                       80   4e-15
gi|90652851 zinc finger transcription factor TRPS1 [Homo sapiens]      72   1e-12
gi|40068464 AT rich interactive domain 1B (SWI1-like) isoform 2 ...    63   9e-10
gi|40068466 AT rich interactive domain 1B (SWI1-like) isoform 1 ...    63   9e-10
gi|40068462 AT rich interactive domain 1B (SWI1-like) isoform 3 ...    63   9e-10
gi|22547197 zinc finger protein of the cerebellum 2 [Homo sapiens]     62   1e-09
gi|110624765 POU domain, class 3, transcription factor 1 [Homo s...    62   1e-09
gi|120587025 SH3 and multiple ankyrin repeat domains 1 [Homo sap...    61   3e-09
gi|111118976 collagen, type II, alpha 1 isoform 1 precursor [Hom...    60   4e-09
gi|111118974 collagen, type II, alpha 1 isoform 2 precursor [Hom...    60   4e-09
gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]      60   7e-09
gi|21264565 AT rich interactive domain 1A isoform a [Homo sapiens]     59   1e-08
gi|21264575 AT rich interactive domain 1A isoform b [Homo sapiens]     59   1e-08
gi|5031757 T-cell leukemia homeobox 1 [Homo sapiens]                   58   2e-08
gi|169636435 caudal type homeobox 2 [Homo sapiens]                     58   3e-08
gi|73427806 v-maf musculoaponeurotic fibrosarcoma oncogene homol...    58   3e-08
gi|5453736 v-maf musculoaponeurotic fibrosarcoma oncogene homolo...    58   3e-08
gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]      57   4e-08

>gi|40288197 GATA binding protein 6 [Homo sapiens]
          Length = 595

 Score = 1236 bits (3198), Expect = 0.0
 Identities = 595/595 (100%), Positives = 595/595 (100%)

Query: 1   MALTDGGWCLPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNC 60
           MALTDGGWCLPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNC
Sbjct: 1   MALTDGGWCLPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNC 60

Query: 61  GTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQ 120
           GTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQ
Sbjct: 61  GTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQ 120

Query: 121 AATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA 180
           AATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA
Sbjct: 121 AATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA 180

Query: 181 AAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGG 240
           AAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGG
Sbjct: 181 AAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGG 240

Query: 241 AAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSL 300
           AAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSL
Sbjct: 241 AAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSL 300

Query: 301 AAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLH 360
           AAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLH
Sbjct: 301 AAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLH 360

Query: 361 SLQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKM 420
           SLQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKM
Sbjct: 361 SLQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKM 420

Query: 421 NGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
           NGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP
Sbjct: 421 NGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480

Query: 481 LAMKKEGIQTRKRKPKNINKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASG 540
           LAMKKEGIQTRKRKPKNINKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASG
Sbjct: 481 LAMKKEGIQTRKRKPKNINKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASG 540

Query: 541 AGAPVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA 595
           AGAPVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA
Sbjct: 541 AGAPVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA 595


>gi|33188461 GATA binding protein 4 [Homo sapiens]
          Length = 442

 Score =  362 bits (929), Expect = e-100
 Identities = 228/476 (47%), Positives = 277/476 (58%), Gaps = 70/476 (14%)

Query: 147 MYQTLAALSSQGP--AAYD-GAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPY 203
           MYQ+LA  ++ GP   AY+ G PG F+H A AA       SSPVYVPT RV S + GL Y
Sbjct: 1   MYQSLAMAANHGPPPGAYEAGGPGAFMHGAGAA-------SSPVYVPTPRVPSSVLGLSY 53

Query: 204 HLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARF 263
            LQG G+G A+  GGA        AS   P  G+  G+ G   AG  GA      VS RF
Sbjct: 54  -LQGGGAGSAS--GGASGGSSGGAASGAGP--GTQQGSPGWSQAGADGAAYTPPPVSPRF 108

Query: 264 PYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTY 323
            +  +      AA      AAA    A   SGGG++ A + GRE QY     A    G+Y
Sbjct: 109 SFPGTTGSLAAAAA-----AAAAREAAAYSSGGGAAGAGLAGRE-QYGRAGFA----GSY 158

Query: 324 HHHHHHHHHHPSPYSPYVGAPLTPAWPA------GPFETPVLHSLQSRAGAPLPVPRGPS 377
                      SPY  Y+ A +  +W A      GPF++PVLHSL  RA    P  R P+
Sbjct: 159 S----------SPYPAYM-ADVGASWAAAAAASAGPFDSPVLHSLPGRAN---PAARHPN 204

Query: 378 ADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSS 437
            D+ +D SE RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPLIKPQ+R+ +S
Sbjct: 205 LDMFDDFSEGRECVNCGAMSTPLWRRDGTGHYLCNACGLYHKMNGINRPLIKPQRRLSAS 264

Query: 438 RRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKN 497
           RR+GLSCANC TTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAM+KEGIQTRKRKPKN
Sbjct: 265 RRVGLSCANCQTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMRKEGIQTRKRKPKN 324

Query: 498 INKSKTCSGNSNNSIPMTPTSTSSNSDDCSKNTSPTTQPTASGAG--------------- 542
           +NKSKT +  S +      +  SSNS + + ++S   +P  +  G               
Sbjct: 325 LNKSKTPAAPSGSESLPPASGASSNSSNATTSSSEEMRPIKTEPGLSSHYGHSSSVSQTF 384

Query: 543 -APVMTGAGESTNPENSELKYSGQDGLYIGVSLASPAEVT--SSVRPDSWCALALA 595
               M+G G S +P  S LK S Q         ASP   +  +S + DSW +L LA
Sbjct: 385 SVSAMSGHGPSIHPVLSALKLSPQ-------GYASPVSQSPQTSSKQDSWNSLVLA 433


>gi|17998698 GATA binding protein 5 [Homo sapiens]
          Length = 397

 Score =  320 bits (819), Expect = 3e-87
 Identities = 198/455 (43%), Positives = 250/455 (54%), Gaps = 64/455 (14%)

Query: 147 MYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQ 206
           MYQ+LA  +S   AAY  + G F+H        A  A SP++VP  RV SML  L     
Sbjct: 1   MYQSLALAASPRQAAYADS-GSFLH--------APGAGSPMFVPPARVPSMLSYL----- 46

Query: 207 GSGSGPANHAGGAGAHPGWPQ-ASADSPPYGSGGGAAGGGAAGPGGAGSAAAHV--SARF 263
            SG  P+       A PGW Q A+ADS             A GPG     AAH   +  F
Sbjct: 47  -SGCEPSPQPPELAARPGWAQTATADS------------SAFGPGSPHPPAAHPPGATAF 93

Query: 264 PYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTY 323
           P++ SP             +  GSGG+ G   G +   A+  RE   + L   RP+  +Y
Sbjct: 94  PFAHSP-------------SGPGSGGSAGGRDGSAYQGALLPREQFAAPL--GRPVGTSY 138

Query: 324 HHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPSADLLED 383
                      + Y  YV   +  +W AGPF+  VLH L  R       P   S  L E 
Sbjct: 139 ----------SATYPAYVSPDVAQSWTAGPFDGSVLHGLPGRR------PTFVSDFLEEF 182

Query: 384 LSESRECVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLS 443
             E RECVNCG++ TPLWRRDGTGHYLCNACGLY KMNG++RPL++PQKR+ SSRR GL 
Sbjct: 183 PGEGRECVNCGALSTPLWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLC 242

Query: 444 CANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKT 503
           C NCHTT TTLWRRN+EGEPVCNACGLYMKLHGVPRPLAMKKE IQTRKRKPK I K++ 
Sbjct: 243 CTNCHTTNTTLWRRNSEGEPVCNACGLYMKLHGVPRPLAMKKESIQTRKRKPKTIAKARG 302

Query: 504 CSGNSNNSIPMTPTSTSSNSDDCSKNTSPT-TQPTASGAG-APVMTG-AGESTNPENSEL 560
            SG++ N+        S++S   +    P+   P   G   AP  +G   +S  P + E 
Sbjct: 303 SSGSTRNASASPSAVASTDSSAATSKAKPSLASPVCPGPSMAPQASGQEDDSLAPGHLEF 362

Query: 561 KYSGQDGLYIGVSLASPAEVTSSVRPDSWCALALA 595
           K+  +D  +   + +  A +  ++R ++WCALALA
Sbjct: 363 KFEPEDFAFPSTAPSPQAGLRGALRQEAWCALALA 397


>gi|224611699 GATA binding protein 2 isoform 1 [Homo sapiens]
          Length = 480

 Score =  221 bits (564), Expect = 1e-57
 Identities = 153/389 (39%), Positives = 185/389 (47%), Gaps = 57/389 (14%)

Query: 144 PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSML----- 198
           P+E+      L SQG   Y          A  A A A  + SP +   T  G M      
Sbjct: 42  PDEVDVFFNHLDSQGNPYY----------ANPAHARARVSYSPAHARLTG-GQMCRPHLL 90

Query: 199 --PGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPY----------------GSGGG 240
             PGLP+     G   A  A  A  H  W  +     P                 G+GGG
Sbjct: 91  HSPGLPWL---DGGKAALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGG 147

Query: 241 AAGGGAAGPGGAGSAAAHVSAR-FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSS 299
           + GG  +        AAH  +  F + P+PP          G A+  S  AGG +  G  
Sbjct: 148 SGGGSGSSVASLTPTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGED 207

Query: 300 LAAMGGREPQYSSL-----SAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
              +  +     S+     S  RP   T       HH  P+ Y  YV A       A  +
Sbjct: 208 KDGVKYQVSLTESMKMESGSPLRPGLATMGTQPATHHPIPT-YPSYVPAA------AHDY 260

Query: 355 ETPVLHS---LQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLC 411
            + + H    L   A +  P  R  +       SE RECVNCG+  TPLWRRDGTGHYLC
Sbjct: 261 SSGLFHPGGFLGGPASSFTPKQRSKA----RSCSEGRECVNCGATATPLWRRDGTGHYLC 316

Query: 412 NACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLY 471
           NACGLY KMNG +RPLIKP++R+ ++RR G  CANC TTTTTLWRRNA G+PVCNACGLY
Sbjct: 317 NACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLY 376

Query: 472 MKLHGVPRPLAMKKEGIQTRKRKPKNINK 500
            KLH V RPL MKKEGIQTR RK  N +K
Sbjct: 377 YKLHNVNRPLTMKKEGIQTRNRKMSNKSK 405


>gi|20070352 GATA binding protein 2 isoform 1 [Homo sapiens]
          Length = 480

 Score =  221 bits (564), Expect = 1e-57
 Identities = 153/389 (39%), Positives = 185/389 (47%), Gaps = 57/389 (14%)

Query: 144 PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSML----- 198
           P+E+      L SQG   Y          A  A A A  + SP +   T  G M      
Sbjct: 42  PDEVDVFFNHLDSQGNPYY----------ANPAHARARVSYSPAHARLTG-GQMCRPHLL 90

Query: 199 --PGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPY----------------GSGGG 240
             PGLP+     G   A  A  A  H  W  +     P                 G+GGG
Sbjct: 91  HSPGLPWL---DGGKAALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGG 147

Query: 241 AAGGGAAGPGGAGSAAAHVSAR-FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSS 299
           + GG  +        AAH  +  F + P+PP          G A+  S  AGG +  G  
Sbjct: 148 SGGGSGSSVASLTPTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGED 207

Query: 300 LAAMGGREPQYSSL-----SAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
              +  +     S+     S  RP   T       HH  P+ Y  YV A       A  +
Sbjct: 208 KDGVKYQVSLTESMKMESGSPLRPGLATMGTQPATHHPIPT-YPSYVPAA------AHDY 260

Query: 355 ETPVLHS---LQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLC 411
            + + H    L   A +  P  R  +       SE RECVNCG+  TPLWRRDGTGHYLC
Sbjct: 261 SSGLFHPGGFLGGPASSFTPKQRSKA----RSCSEGRECVNCGATATPLWRRDGTGHYLC 316

Query: 412 NACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLY 471
           NACGLY KMNG +RPLIKP++R+ ++RR G  CANC TTTTTLWRRNA G+PVCNACGLY
Sbjct: 317 NACGLYHKMNGQNRPLIKPKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLY 376

Query: 472 MKLHGVPRPLAMKKEGIQTRKRKPKNINK 500
            KLH V RPL MKKEGIQTR RK  N +K
Sbjct: 377 YKLHNVNRPLTMKKEGIQTRNRKMSNKSK 405


>gi|50541959 GATA binding protein 3 isoform 1 [Homo sapiens]
          Length = 444

 Score =  204 bits (519), Expect = 2e-52
 Identities = 108/201 (53%), Positives = 131/201 (65%), Gaps = 15/201 (7%)

Query: 332 HHP-SPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPSADLLEDLSESREC 390
           HHP + Y PYV     P + +G F    L             P+  S+      +E REC
Sbjct: 216 HHPITTYPPYV-----PEYSSGLFPPSSLLGGSPTGFGCKSRPKARSS------TEGREC 264

Query: 391 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTT 450
           VNCG+  TPLWRRDGTGHYLCNACGLY KMNG +RPLIKP++R+ ++RR G SCANC TT
Sbjct: 265 VNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTT 324

Query: 451 TTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKTCSGNSNN 510
           TTTLWRRNA G+PVCNACGLY KLH + RPL MKKEGIQTR RK    +KSK C    ++
Sbjct: 325 TTTLWRRNANGDPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMS--SKSKKCK-KVHD 381

Query: 511 SIPMTPTSTSSNSDDCSKNTS 531
           S+   P ++S N    S++ S
Sbjct: 382 SLEDFPKNSSFNPAALSRHMS 402


>gi|4503929 GATA binding protein 3 isoform 2 [Homo sapiens]
          Length = 443

 Score =  202 bits (514), Expect = 7e-52
 Identities = 108/201 (53%), Positives = 129/201 (64%), Gaps = 16/201 (7%)

Query: 332 HHP-SPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPSADLLEDLSESREC 390
           HHP + Y PYV     P + +G F    L             P+  S       S  REC
Sbjct: 216 HHPITTYPPYV-----PEYSSGLFPPSSLLGGSPTGFGCKSRPKARS-------STGREC 263

Query: 391 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTT 450
           VNCG+  TPLWRRDGTGHYLCNACGLY KMNG +RPLIKP++R+ ++RR G SCANC TT
Sbjct: 264 VNCGATSTPLWRRDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTT 323

Query: 451 TTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKTCSGNSNN 510
           TTTLWRRNA G+PVCNACGLY KLH + RPL MKKEGIQTR RK    +KSK C    ++
Sbjct: 324 TTTLWRRNANGDPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMS--SKSKKCK-KVHD 380

Query: 511 SIPMTPTSTSSNSDDCSKNTS 531
           S+   P ++S N    S++ S
Sbjct: 381 SLEDFPKNSSFNPAALSRHMS 401


>gi|224611701 GATA binding protein 2 isoform 2 [Homo sapiens]
          Length = 466

 Score =  196 bits (497), Expect = 7e-50
 Identities = 146/389 (37%), Positives = 176/389 (45%), Gaps = 71/389 (18%)

Query: 144 PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSML----- 198
           P+E+      L SQG   Y          A  A A A  + SP +   T  G M      
Sbjct: 42  PDEVDVFFNHLDSQGNPYY----------ANPAHARARVSYSPAHARLTG-GQMCRPHLL 90

Query: 199 --PGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPY----------------GSGGG 240
             PGLP+     G   A  A  A  H  W  +     P                 G+GGG
Sbjct: 91  HSPGLPWL---DGGKAALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGG 147

Query: 241 AAGGGAAGPGGAGSAAAHVSAR-FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSS 299
           + GG  +        AAH  +  F + P+PP          G A+  S  AGG +  G  
Sbjct: 148 SGGGSGSSVASLTPTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGED 207

Query: 300 LAAMGGREPQYSSL-----SAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
              +  +     S+     S  RP   T       HH  P+ Y  YV A       A  +
Sbjct: 208 KDGVKYQVSLTESMKMESGSPLRPGLATMGTQPATHHPIPT-YPSYVPAA------AHDY 260

Query: 355 ETPVLHS---LQSRAGAPLPVPRGPSADLLEDLSESRECVNCGSIQTPLWRRDGTGHYLC 411
            + + H    L   A +  P  R  +       SE RECVNCG+  TPLWRRDGTGHYLC
Sbjct: 261 SSGLFHPGGFLGGPASSFTPKQRSKA----RSCSEGRECVNCGATATPLWRRDGTGHYLC 316

Query: 412 NACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLY 471
           NACGLY KMNG +RPLIKP++R+              TTTTTLWRRNA G+PVCNACGLY
Sbjct: 317 NACGLYHKMNGQNRPLIKPKRRL--------------TTTTTLWRRNANGDPVCNACGLY 362

Query: 472 MKLHGVPRPLAMKKEGIQTRKRKPKNINK 500
            KLH V RPL MKKEGIQTR RK  N +K
Sbjct: 363 YKLHNVNRPLTMKKEGIQTRNRKMSNKSK 391


>gi|4503925 GATA binding protein 1 [Homo sapiens]
          Length = 413

 Score =  192 bits (487), Expect = 1e-48
 Identities = 108/230 (46%), Positives = 127/230 (55%), Gaps = 27/230 (11%)

Query: 334 PSPYSPYVGAPLTPAW--PAG-PFETPVLHSLQSRAGAPLPVPRGPSADLLEDLSESREC 390
           P P S Y G   +  +  P G P  +    S + R   PLP              E+REC
Sbjct: 157 PVPNSAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPP------------CEAREC 204

Query: 391 VNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPLIKPQKRVPSSRRLGLSCANCHTT 450
           VNCG+  TPLWRRD TGHYLCNACGLY KMNG +RPLI+P+KR+  S+R G  C NC TT
Sbjct: 205 VNCGATATPLWRRDRTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTT 264

Query: 451 TTTLWRRNAEGEPVCNACGLYMKLHGVPRPLAMKKEGIQTRKRKPKNINKSKTCSGNSNN 510
           TTTLWRRNA G+PVCNACGLY KLH V RPL M+K+GIQTR RK     K K  S     
Sbjct: 265 TTTLWRRNASGDPVCNACGLYYKLHQVNRPLTMRKDGIQTRNRKASGKGKKKRGSSLGGT 324

Query: 511 SIPMTP------TSTSSNSDDCSKNTS------PTTQPTASGAGAPVMTG 548
                P       +  S S +C +  S      P T     G G  V++G
Sbjct: 325 GAAEGPAGGFMVVAGGSGSGNCGEVASGLTLGPPGTAHLYQGLGPVVLSG 374


>gi|239757043 PREDICTED: functional smad suppressing element 18
           [Homo sapiens]
          Length = 996

 Score = 85.5 bits (210), Expect = 1e-16
 Identities = 75/262 (28%), Positives = 98/262 (37%), Gaps = 40/262 (15%)

Query: 127 LLWSSR--GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAAS 184
           + W  R  G    P   + P +    L     + PA    A   F+  A    AA +A +
Sbjct: 469 MFWPPRTPGGLPVPTYLQPPPQPPSALGCALGESPALLRQA---FLDLAEPGGAAGSAEA 525

Query: 185 SPVYVPTTRVGSMLPGLPYHL--QGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAA 242
           +P            PG P  +   G GSGP   AGGAG+      A  +SPP GSGG  +
Sbjct: 526 APP-----------PGQPPQVVANGPGSGPPPPAGGAGSR----DALFESPPGGSGGDCS 570

Query: 243 GGG--------AAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVS 294
            G         AAG G A + +    +R P    P +  G     G Y  + +    G  
Sbjct: 571 AGSTPPADSVAAAGAGAAAAGSGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGK 630

Query: 295 GGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
               SLA + G        +   P +  + HHHHHHHH P P SP +  P  P  P    
Sbjct: 631 DDAESLAKLHGASAGAPHSAQTHPHHHHHPHHHHHHHHPPQPPSPLLLLPPQPDEPGSER 690

Query: 355 ETPVLHSLQSRAGAPLPVPRGP 376
             P          AP P P  P
Sbjct: 691 HHP----------APPPPPPPP 702



 Score = 33.1 bits (74), Expect = 0.74
 Identities = 32/114 (28%), Positives = 38/114 (33%), Gaps = 37/114 (32%)

Query: 229 SADSPPYGSGGGAAGGG-------AAGPGGAGSAAAHVSARFPYSPSP------------ 269
           +A S   G+GGG AGGG        AG G    A A     +P  P P            
Sbjct: 339 AAASGGAGTGGGGAGGGCVAGVGVGAGAGAGAGAGAKGPRSYPVIPVPSKGSFGGVLQKF 398

Query: 270 -------------PMANGA-----AREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
                        P A  A      +E  G AA   GGAG    G +  A + G
Sbjct: 399 PGCGGLFPHPYTFPAAAAAFSLCHKKEDAGAAAEALGGAGAGGAGAAPKAGLSG 452



 Score = 31.6 bits (70), Expect = 2.2
 Identities = 41/130 (31%), Positives = 54/130 (41%), Gaps = 11/130 (8%)

Query: 15  GAAG-ADASDSRAFP----AREPSTPPSPISSSSSSCSRGGERGPGGAS-NCG---TPQL 65
           GAAG A+A+     P    A  P + P P +  + S     E  PGG+  +C    TP  
Sbjct: 518 GAAGSAEAAPPPGQPPQVVANGPGSGPPPPAGGAGSRDALFESPPGGSGGDCSAGSTPPA 577

Query: 66  DTEAAAGPPARSLLLSSYASHPFGAPHGPS-APGVAGPGGNLSSWEDLLLFTDLDQAATA 124
           D+ AAAG  A +   S  A     APH P    G    GG+             D A + 
Sbjct: 578 DSVAAAGAGAAAAG-SGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGKDDAESL 636

Query: 125 SKLLWSSRGA 134
           +KL  +S GA
Sbjct: 637 AKLHGASAGA 646


>gi|239751555 PREDICTED: functional smad suppressing element 18
           [Homo sapiens]
          Length = 1252

 Score = 85.5 bits (210), Expect = 1e-16
 Identities = 75/262 (28%), Positives = 98/262 (37%), Gaps = 40/262 (15%)

Query: 127 LLWSSR--GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAAS 184
           + W  R  G    P   + P +    L     + PA    A   F+  A    AA +A +
Sbjct: 469 MFWPPRTPGGLPVPTYLQPPPQPPSALGCALGESPALLRQA---FLDLAEPGGAAGSAEA 525

Query: 185 SPVYVPTTRVGSMLPGLPYHL--QGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAA 242
           +P            PG P  +   G GSGP   AGGAG+      A  +SPP GSGG  +
Sbjct: 526 APP-----------PGQPPQVVANGPGSGPPPPAGGAGSR----DALFESPPGGSGGDCS 570

Query: 243 GGG--------AAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVS 294
            G         AAG G A + +    +R P    P +  G     G Y  + +    G  
Sbjct: 571 AGSTPPADSVAAAGAGAAAAGSGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGK 630

Query: 295 GGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
               SLA + G        +   P +  + HHHHHHHH P P SP +  P  P  P    
Sbjct: 631 DDAESLAKLHGASAGAPHSAQTHPHHHHHPHHHHHHHHPPQPPSPLLLLPPQPDEPGSER 690

Query: 355 ETPVLHSLQSRAGAPLPVPRGP 376
             P          AP P P  P
Sbjct: 691 HHP----------APPPPPPPP 702



 Score = 33.1 bits (74), Expect = 0.74
 Identities = 32/114 (28%), Positives = 38/114 (33%), Gaps = 37/114 (32%)

Query: 229 SADSPPYGSGGGAAGGG-------AAGPGGAGSAAAHVSARFPYSPSP------------ 269
           +A S   G+GGG AGGG        AG G    A A     +P  P P            
Sbjct: 339 AAASGGAGTGGGGAGGGCVAGVGVGAGAGAGAGAGAKGPRSYPVIPVPSKGSFGGVLQKF 398

Query: 270 -------------PMANGA-----AREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
                        P A  A      +E  G AA   GGAG    G +  A + G
Sbjct: 399 PGCGGLFPHPYTFPAAAAAFSLCHKKEDAGAAAEALGGAGAGGAGAAPKAGLSG 452



 Score = 31.6 bits (70), Expect = 2.2
 Identities = 41/130 (31%), Positives = 54/130 (41%), Gaps = 11/130 (8%)

Query: 15  GAAG-ADASDSRAFP----AREPSTPPSPISSSSSSCSRGGERGPGGAS-NCG---TPQL 65
           GAAG A+A+     P    A  P + P P +  + S     E  PGG+  +C    TP  
Sbjct: 518 GAAGSAEAAPPPGQPPQVVANGPGSGPPPPAGGAGSRDALFESPPGGSGGDCSAGSTPPA 577

Query: 66  DTEAAAGPPARSLLLSSYASHPFGAPHGPS-APGVAGPGGNLSSWEDLLLFTDLDQAATA 124
           D+ AAAG  A +   S  A     APH P    G    GG+             D A + 
Sbjct: 578 DSVAAAGAGAAAAG-SGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGKDDAESL 636

Query: 125 SKLLWSSRGA 134
           +KL  +S GA
Sbjct: 637 AKLHGASAGA 646


>gi|239746067 PREDICTED: functional smad suppressing element 18
           [Homo sapiens]
          Length = 1252

 Score = 85.5 bits (210), Expect = 1e-16
 Identities = 75/262 (28%), Positives = 98/262 (37%), Gaps = 40/262 (15%)

Query: 127 LLWSSR--GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAAS 184
           + W  R  G    P   + P +    L     + PA    A   F+  A    AA +A +
Sbjct: 469 MFWPPRTPGGLPVPTYLQPPPQPPSALGCALGESPALLRQA---FLDLAEPGGAAGSAEA 525

Query: 185 SPVYVPTTRVGSMLPGLPYHL--QGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAA 242
           +P            PG P  +   G GSGP   AGGAG+      A  +SPP GSGG  +
Sbjct: 526 APP-----------PGQPPQVVANGPGSGPPPPAGGAGSR----DALFESPPGGSGGDCS 570

Query: 243 GGG--------AAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVS 294
            G         AAG G A + +    +R P    P +  G     G Y  + +    G  
Sbjct: 571 AGSTPPADSVAAAGAGAAAAGSGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGK 630

Query: 295 GGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPF 354
               SLA + G        +   P +  + HHHHHHHH P P SP +  P  P  P    
Sbjct: 631 DDAESLAKLHGASAGAPHSAQTHPHHHHHPHHHHHHHHPPQPPSPLLLLPPQPDEPGSER 690

Query: 355 ETPVLHSLQSRAGAPLPVPRGP 376
             P          AP P P  P
Sbjct: 691 HHP----------APPPPPPPP 702



 Score = 33.1 bits (74), Expect = 0.74
 Identities = 32/114 (28%), Positives = 38/114 (33%), Gaps = 37/114 (32%)

Query: 229 SADSPPYGSGGGAAGGG-------AAGPGGAGSAAAHVSARFPYSPSP------------ 269
           +A S   G+GGG AGGG        AG G    A A     +P  P P            
Sbjct: 339 AAASGGAGTGGGGAGGGCVAGVGVGAGAGAGAGAGAKGPRSYPVIPVPSKGSFGGVLQKF 398

Query: 270 -------------PMANGA-----AREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
                        P A  A      +E  G AA   GGAG    G +  A + G
Sbjct: 399 PGCGGLFPHPYTFPAAAAAFSLCHKKEDAGAAAEALGGAGAGGAGAAPKAGLSG 452



 Score = 31.6 bits (70), Expect = 2.2
 Identities = 41/130 (31%), Positives = 54/130 (41%), Gaps = 11/130 (8%)

Query: 15  GAAG-ADASDSRAFP----AREPSTPPSPISSSSSSCSRGGERGPGGAS-NCG---TPQL 65
           GAAG A+A+     P    A  P + P P +  + S     E  PGG+  +C    TP  
Sbjct: 518 GAAGSAEAAPPPGQPPQVVANGPGSGPPPPAGGAGSRDALFESPPGGSGGDCSAGSTPPA 577

Query: 66  DTEAAAGPPARSLLLSSYASHPFGAPHGPS-APGVAGPGGNLSSWEDLLLFTDLDQAATA 124
           D+ AAAG  A +   S  A     APH P    G    GG+             D A + 
Sbjct: 578 DSVAAAGAGAAAAG-SGPAGSRVPAPHHPHLLEGRKAGGGSYHHSSAFRPVGGKDDAESL 636

Query: 125 SKLLWSSRGA 134
           +KL  +S GA
Sbjct: 637 AKLHGASAGA 646


>gi|5453936 POU class 3 homeobox 3 [Homo sapiens]
          Length = 500

 Score = 80.5 bits (197), Expect = 4e-15
 Identities = 64/195 (32%), Positives = 77/195 (39%), Gaps = 39/195 (20%)

Query: 173 AAAAAAAAAAASSPVYVPTTRVGSMLPGLPYH-------------LQGSGSGPANHAGGA 219
           AAAAAAAAAAA+     P +     + G P               ++G       HAG A
Sbjct: 104 AAAAAAAAAAAAVEASSPWSGSAVGMAGSPQQPPQPPPPPPQGPDVKGGAGRDDLHAGTA 163

Query: 220 GAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPY---SPSPPMANGAA 276
             H G P      PP   G    G GAA    A +AAA  +A  P       PP  +   
Sbjct: 164 LHHRGPPHLGPPPPPPHQGH-PGGWGAAAAAAAAAAAAAAAAHLPSMAGGQQPPPQSLLY 222

Query: 277 REPGGYAAAG--------SGGAGGVSGGGSSLAAMG---GREPQYSSLSAARPLNGTYHH 325
            +PGG+   G         GG GG  GG  SL   G   G  P+ +            HH
Sbjct: 223 SQPGGFTVNGMLSAPPGPGGGGGGAGGGAQSLVHPGLVRGDTPELAE-----------HH 271

Query: 326 HHHHHHHHPSPYSPY 340
           HHHHHH HP P  P+
Sbjct: 272 HHHHHHAHPHPPHPH 286



 Score = 55.1 bits (131), Expect = 2e-07
 Identities = 79/285 (27%), Positives = 103/285 (36%), Gaps = 67/285 (23%)

Query: 49  GGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSS 108
           GG  G GG    G+  + + A  G P+   ++ S          G  A   A  GG++ S
Sbjct: 42  GGAGGGGGGMQPGSAAVTSGAYRGDPSSVKMVQS------DFMQGAMA---ASNGGHMLS 92

Query: 109 ----WEDLLLFTDLDQAATASKLL-----WSSRGAKL--SPFAPEQP------------- 144
               W   L       AA A+  +     WS     +  SP  P QP             
Sbjct: 93  HAHQWVTALPHAAAAAAAAAAAAVEASSPWSGSAVGMAGSPQQPPQPPPPPPQGPDVKGG 152

Query: 145 ---EEMYQTLAA-------LSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRV 194
              ++++   A        L    P  + G PGG+  +AAAAAAAAAAA++  ++P+   
Sbjct: 153 AGRDDLHAGTALHHRGPPHLGPPPPPPHQGHPGGWGAAAAAAAAAAAAAAA-AHLPSMAG 211

Query: 195 GSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAG------ 248
           G   P  P  L  S  G     G   A PG           G GGG AGGGA        
Sbjct: 212 GQQPP--PQSLLYSQPGGFTVNGMLSAPPG----------PGGGGGGAGGGAQSLVHPGL 259

Query: 249 -----PGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSG 288
                P  A     H     P+ P P  A G     GG   AG G
Sbjct: 260 VRGDTPELAEHHHHHHHHAHPHPPHPHHAQGPPHHGGGGGGAGPG 304



 Score = 35.4 bits (80), Expect = 0.15
 Identities = 50/191 (26%), Positives = 59/191 (30%), Gaps = 52/191 (27%)

Query: 236 GSGGGAAGGGAAGPGGAG--------SAAAHVSARFPYSPSPP-------MANGAAREPG 280
           G+GGG  GGG  G GGAG         +AA  S  +   PS         M    A   G
Sbjct: 28  GAGGGGGGGGGGGGGGAGGGGGGMQPGSAAVTSGAYRGDPSSVKMVQSDFMQGAMAASNG 87

Query: 281 GY----------------AAAGSGGAGGV------SGGGSSLAAMGGREPQYSSLSAARP 318
           G+                AAA +  A  V      SG    +A    + PQ        P
Sbjct: 88  GHMLSHAHQWVTALPHAAAAAAAAAAAAVEASSPWSGSAVGMAGSPQQPPQPPPPPPQGP 147

Query: 319 -----------LNGTYHHHHHHHHHHPSPYSPYVGAP----LTPAWPAGPFETPVLHSLQ 363
                        GT  HH    H  P P  P+ G P       A  A          L 
Sbjct: 148 DVKGGAGRDDLHAGTALHHRGPPHLGPPPPPPHQGHPGGWGAAAAAAAAAAAAAAAAHLP 207

Query: 364 SRAGAPLPVPR 374
           S AG   P P+
Sbjct: 208 SMAGGQQPPPQ 218



 Score = 32.3 bits (72), Expect = 1.3
 Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 260 SARFPYSPSPPM-ANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSA 315
           +A  PY P   + A G+        A G GG GG  GGG +    GG +P  +++++
Sbjct: 4   AASNPYLPGNSLLAAGSIVHSDAAGAGGGGGGGGGGGGGGAGGGGGGMQPGSAAVTS 60


>gi|90652851 zinc finger transcription factor TRPS1 [Homo sapiens]
          Length = 1294

 Score = 72.4 bits (176), Expect = 1e-12
 Identities = 38/82 (46%), Positives = 49/82 (59%), Gaps = 5/82 (6%)

Query: 421 NGLSRPLIKPQKRVPSSRRLGLSCANCHTTTTTLWRRNAEGEPVCNACGLYMKLHGVPRP 480
           +G ++   + Q  +   R  G+ CANC TT T+LWR+NA G  VCNACGLY KLH  PRP
Sbjct: 886 SGENKSKDESQSLLRRRRGSGVFCANCLTTKTSLWRKNANGGYVCNACGLYQKLHSTPRP 945

Query: 481 LAMKKEG-----IQTRKRKPKN 497
           L + K+      I+ R RK  N
Sbjct: 946 LNIIKQNNGEQIIRRRTRKRLN 967



 Score = 53.5 bits (127), Expect = 5e-07
 Identities = 20/38 (52%), Positives = 27/38 (71%)

Query: 390 CVNCGSIQTPLWRRDGTGHYLCNACGLYSKMNGLSRPL 427
           C NC + +T LWR++  G Y+CNACGLY K++   RPL
Sbjct: 909 CANCLTTKTSLWRKNANGGYVCNACGLYQKLHSTPRPL 946


>gi|40068464 AT rich interactive domain 1B (SWI1-like) isoform 2
           [Homo sapiens]
          Length = 2191

 Score = 62.8 bits (151), Expect = 9e-10
 Identities = 93/364 (25%), Positives = 122/364 (33%), Gaps = 88/364 (24%)

Query: 38  PISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAP 97
           P    +   + GG+  P G      P  + +A   PP           HP     G   P
Sbjct: 99  PQHGGAKDSAAGGQADPPGPPLLSKPGDEDDA---PPKMGEPAGGRYEHPGLGALGTQQP 155

Query: 98  GVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQ 157
            VA PGG                                 P A  +    Y + AA +S 
Sbjct: 156 PVAVPGGGGG------------------------------PAAVPEFNNYYGS-AAPASG 184

Query: 158 GPAAYDG----------APG-GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQ 206
           GP    G          +PG G +HSA+AAAA A              GSM P    H +
Sbjct: 185 GPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAP-------------GSMDPLQNSH-E 230

Query: 207 GSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYS 266
           G  +   NH      +PG+ +  A     G GGG  GGG+ G GG G A A  +     +
Sbjct: 231 GYPNSQCNH------YPGYSRPGAGGG--GGGGGGGGGGSGGGGGGGGAGAGGAGAGAVA 282

Query: 267 PSPPMANGAAREPGGYAAAGSGGAGGV------SGGGSSLAAMGGREPQYSSLSAARPLN 320
            +   A  AA   GG    GS    GV       GGG  +   GG     S  +A     
Sbjct: 283 AAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAG 342

Query: 321 GTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSL------QSRAGAPLPVPR 374
           G         + HPS  +P +   LT         +P++ S        S   AP P P 
Sbjct: 343 G--FQRFAGQNQHPSGATPTLNQLLTS-------PSPMMRSYGGSYPEYSSPSAPPPPPS 393

Query: 375 GPSA 378
            P +
Sbjct: 394 QPQS 397



 Score = 38.9 bits (89), Expect = 0.014
 Identities = 52/208 (25%), Positives = 70/208 (33%), Gaps = 22/208 (10%)

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
           Y  L++   QG     G  GG   S + AAA +AA     +    +  S        L  
Sbjct: 307 YGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLT 366

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYG---SGGGAAGGGAAGPGGAGSAAAHVSARFP 264
           S S      GG+  +P +   SA  PP     S   AAG  A G   A            
Sbjct: 367 SPSPMMRSYGGS--YPEYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQ 424

Query: 265 YSPSPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSLAAMGGREPQYSSLSAARPLNGT 322
           Y+ + P    AA +   + A   G  G   G   GS +  M  + PQ   + +       
Sbjct: 425 YAAASPA--WAAAQQRSHPAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGS------- 475

Query: 323 YHHHHHHHHHHPSPYSPYVGAPLTPAWP 350
                 + H  P   SPY G    P  P
Sbjct: 476 ------NPHSQPQQSSPYPGGSYGPPGP 497



 Score = 37.7 bits (86), Expect = 0.030
 Identities = 42/158 (26%), Positives = 55/158 (34%), Gaps = 39/158 (24%)

Query: 214 NHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG--------------------AG 253
           N  GGAG     P    + P +G    +A GG A P G                    AG
Sbjct: 80  NSLGGAGGGAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAG 139

Query: 254 SAAAHVSARFPYSPSPPMA-----NGAAREPG-----GYAAAGSGGAGGVSG------GG 297
               H       +  PP+A      G A  P      G AA  SGG GG +G      GG
Sbjct: 140 GRYEHPGLGALGTQQPPVAVPGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGG 199

Query: 298 SSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPS 335
                MG     +S+ +AA    G+     + H  +P+
Sbjct: 200 QQSPGMG---MMHSASAAAAGAPGSMDPLQNSHEGYPN 234



 Score = 34.7 bits (78), Expect = 0.26
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 315 AARPLNGTYHHHHHHHHHH 333
           AA P    +HHHH HHHHH
Sbjct: 19  AAPPHQQHHHHHHAHHHHH 37



 Score = 32.0 bits (71), Expect = 1.7
 Identities = 57/278 (20%), Positives = 87/278 (31%), Gaps = 38/278 (13%)

Query: 28  PAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHP 87
           P    +T  S +S+S S+ S+G +  P                    A+S      + H 
Sbjct: 631 PTGTEATLSSAVSASGSTSSQGDQSNP--------------------AQSPFSPHASPHL 670

Query: 88  FGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEM 147
              P GPS   V  P G+             +Q+ +      S  G+++ P  P    E 
Sbjct: 671 SSIPGGPSPSPVGSPVGS-------------NQSRSGPISPASIPGSQMPPQPPGSQSES 717

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
             +  ALS        G   G   +   A          +    +  G M  G+    Q 
Sbjct: 718 -SSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISSFQQS 776

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSP 267
           + SG      G       PQ +   PP  SG  +A     GPG   SA   +  + P  P
Sbjct: 777 NSSGTY----GPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQP 832

Query: 268 SPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
              +  G     G       G    ++     ++  GG
Sbjct: 833 CGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGG 870



 Score = 29.6 bits (65), Expect = 8.2
 Identities = 10/14 (71%), Positives = 10/14 (71%), Gaps = 4/14 (28%)

Query: 324 HHHHHH----HHHH 333
           HHHHHH    HHHH
Sbjct: 33  HHHHHHAHHLHHHH 46


>gi|40068466 AT rich interactive domain 1B (SWI1-like) isoform 1
           [Homo sapiens]
          Length = 2231

 Score = 62.8 bits (151), Expect = 9e-10
 Identities = 93/364 (25%), Positives = 122/364 (33%), Gaps = 88/364 (24%)

Query: 38  PISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAP 97
           P    +   + GG+  P G      P  + +A   PP           HP     G   P
Sbjct: 99  PQHGGAKDSAAGGQADPPGPPLLSKPGDEDDA---PPKMGEPAGGRYEHPGLGALGTQQP 155

Query: 98  GVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQ 157
            VA PGG                                 P A  +    Y + AA +S 
Sbjct: 156 PVAVPGGGGG------------------------------PAAVPEFNNYYGS-AAPASG 184

Query: 158 GPAAYDG----------APG-GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQ 206
           GP    G          +PG G +HSA+AAAA A              GSM P    H +
Sbjct: 185 GPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAP-------------GSMDPLQNSH-E 230

Query: 207 GSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYS 266
           G  +   NH      +PG+ +  A     G GGG  GGG+ G GG G A A  +     +
Sbjct: 231 GYPNSQCNH------YPGYSRPGAGGG--GGGGGGGGGGSGGGGGGGGAGAGGAGAGAVA 282

Query: 267 PSPPMANGAAREPGGYAAAGSGGAGGV------SGGGSSLAAMGGREPQYSSLSAARPLN 320
            +   A  AA   GG    GS    GV       GGG  +   GG     S  +A     
Sbjct: 283 AAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAG 342

Query: 321 GTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSL------QSRAGAPLPVPR 374
           G         + HPS  +P +   LT         +P++ S        S   AP P P 
Sbjct: 343 G--FQRFAGQNQHPSGATPTLNQLLTS-------PSPMMRSYGGSYPEYSSPSAPPPPPS 393

Query: 375 GPSA 378
            P +
Sbjct: 394 QPQS 397



 Score = 38.9 bits (89), Expect = 0.014
 Identities = 52/208 (25%), Positives = 70/208 (33%), Gaps = 22/208 (10%)

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
           Y  L++   QG     G  GG   S + AAA +AA     +    +  S        L  
Sbjct: 307 YGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLT 366

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYG---SGGGAAGGGAAGPGGAGSAAAHVSARFP 264
           S S      GG+  +P +   SA  PP     S   AAG  A G   A            
Sbjct: 367 SPSPMMRSYGGS--YPEYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQ 424

Query: 265 YSPSPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSLAAMGGREPQYSSLSAARPLNGT 322
           Y+ + P    AA +   + A   G  G   G   GS +  M  + PQ   + +       
Sbjct: 425 YAAASPA--WAAAQQRSHPAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGS------- 475

Query: 323 YHHHHHHHHHHPSPYSPYVGAPLTPAWP 350
                 + H  P   SPY G    P  P
Sbjct: 476 ------NPHSQPQQSSPYPGGSYGPPGP 497



 Score = 37.7 bits (86), Expect = 0.030
 Identities = 42/158 (26%), Positives = 55/158 (34%), Gaps = 39/158 (24%)

Query: 214 NHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG--------------------AG 253
           N  GGAG     P    + P +G    +A GG A P G                    AG
Sbjct: 80  NSLGGAGGGAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAG 139

Query: 254 SAAAHVSARFPYSPSPPMA-----NGAAREPG-----GYAAAGSGGAGGVSG------GG 297
               H       +  PP+A      G A  P      G AA  SGG GG +G      GG
Sbjct: 140 GRYEHPGLGALGTQQPPVAVPGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGG 199

Query: 298 SSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPS 335
                MG     +S+ +AA    G+     + H  +P+
Sbjct: 200 QQSPGMG---MMHSASAAAAGAPGSMDPLQNSHEGYPN 234



 Score = 34.7 bits (78), Expect = 0.26
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 315 AARPLNGTYHHHHHHHHHH 333
           AA P    +HHHH HHHHH
Sbjct: 19  AAPPHQQHHHHHHAHHHHH 37



 Score = 32.0 bits (71), Expect = 1.7
 Identities = 57/278 (20%), Positives = 87/278 (31%), Gaps = 38/278 (13%)

Query: 28  PAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHP 87
           P    +T  S +S+S S+ S+G +  P                    A+S      + H 
Sbjct: 618 PTGTEATLSSAVSASGSTSSQGDQSNP--------------------AQSPFSPHASPHL 657

Query: 88  FGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEM 147
              P GPS   V  P G+             +Q+ +      S  G+++ P  P    E 
Sbjct: 658 SSIPGGPSPSPVGSPVGS-------------NQSRSGPISPASIPGSQMPPQPPGSQSES 704

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
             +  ALS        G   G   +   A          +    +  G M  G+    Q 
Sbjct: 705 -SSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISSFQQS 763

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSP 267
           + SG      G       PQ +   PP  SG  +A     GPG   SA   +  + P  P
Sbjct: 764 NSSGTY----GPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQP 819

Query: 268 SPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
              +  G     G       G    ++     ++  GG
Sbjct: 820 CGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGG 857



 Score = 29.6 bits (65), Expect = 8.2
 Identities = 10/14 (71%), Positives = 10/14 (71%), Gaps = 4/14 (28%)

Query: 324 HHHHHH----HHHH 333
           HHHHHH    HHHH
Sbjct: 33  HHHHHHAHHLHHHH 46


>gi|40068462 AT rich interactive domain 1B (SWI1-like) isoform 3
           [Homo sapiens]
          Length = 2178

 Score = 62.8 bits (151), Expect = 9e-10
 Identities = 93/364 (25%), Positives = 122/364 (33%), Gaps = 88/364 (24%)

Query: 38  PISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAP 97
           P    +   + GG+  P G      P  + +A   PP           HP     G   P
Sbjct: 99  PQHGGAKDSAAGGQADPPGPPLLSKPGDEDDA---PPKMGEPAGGRYEHPGLGALGTQQP 155

Query: 98  GVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQ 157
            VA PGG                                 P A  +    Y + AA +S 
Sbjct: 156 PVAVPGGGGG------------------------------PAAVPEFNNYYGS-AAPASG 184

Query: 158 GPAAYDG----------APG-GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQ 206
           GP    G          +PG G +HSA+AAAA A              GSM P    H +
Sbjct: 185 GPGGRAGPCFDQHGGQQSPGMGMMHSASAAAAGAP-------------GSMDPLQNSH-E 230

Query: 207 GSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYS 266
           G  +   NH      +PG+ +  A     G GGG  GGG+ G GG G A A  +     +
Sbjct: 231 GYPNSQCNH------YPGYSRPGAGGG--GGGGGGGGGGSGGGGGGGGAGAGGAGAGAVA 282

Query: 267 PSPPMANGAAREPGGYAAAGSGGAGGV------SGGGSSLAAMGGREPQYSSLSAARPLN 320
            +   A  AA   GG    GS    GV       GGG  +   GG     S  +A     
Sbjct: 283 AAAAAAAAAAGGGGGGGYGGSSAGYGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAG 342

Query: 321 GTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSL------QSRAGAPLPVPR 374
           G         + HPS  +P +   LT         +P++ S        S   AP P P 
Sbjct: 343 G--FQRFAGQNQHPSGATPTLNQLLTS-------PSPMMRSYGGSYPEYSSPSAPPPPPS 393

Query: 375 GPSA 378
            P +
Sbjct: 394 QPQS 397



 Score = 38.9 bits (89), Expect = 0.014
 Identities = 52/208 (25%), Positives = 70/208 (33%), Gaps = 22/208 (10%)

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
           Y  L++   QG     G  GG   S + AAA +AA     +    +  S        L  
Sbjct: 307 YGVLSSPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLT 366

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYG---SGGGAAGGGAAGPGGAGSAAAHVSARFP 264
           S S      GG+  +P +   SA  PP     S   AAG  A G   A            
Sbjct: 367 SPSPMMRSYGGS--YPEYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQ 424

Query: 265 YSPSPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSLAAMGGREPQYSSLSAARPLNGT 322
           Y+ + P    AA +   + A   G  G   G   GS +  M  + PQ   + +       
Sbjct: 425 YAAASPA--WAAAQQRSHPAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGS------- 475

Query: 323 YHHHHHHHHHHPSPYSPYVGAPLTPAWP 350
                 + H  P   SPY G    P  P
Sbjct: 476 ------NPHSQPQQSSPYPGGSYGPPGP 497



 Score = 37.7 bits (86), Expect = 0.030
 Identities = 42/158 (26%), Positives = 55/158 (34%), Gaps = 39/158 (24%)

Query: 214 NHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG--------------------AG 253
           N  GGAG     P    + P +G    +A GG A P G                    AG
Sbjct: 80  NSLGGAGGGAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAG 139

Query: 254 SAAAHVSARFPYSPSPPMA-----NGAAREPG-----GYAAAGSGGAGGVSG------GG 297
               H       +  PP+A      G A  P      G AA  SGG GG +G      GG
Sbjct: 140 GRYEHPGLGALGTQQPPVAVPGGGGGPAAVPEFNNYYGSAAPASGGPGGRAGPCFDQHGG 199

Query: 298 SSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPS 335
                MG     +S+ +AA    G+     + H  +P+
Sbjct: 200 QQSPGMG---MMHSASAAAAGAPGSMDPLQNSHEGYPN 234



 Score = 34.7 bits (78), Expect = 0.26
 Identities = 12/19 (63%), Positives = 13/19 (68%)

Query: 315 AARPLNGTYHHHHHHHHHH 333
           AA P    +HHHH HHHHH
Sbjct: 19  AAPPHQQHHHHHHAHHHHH 37



 Score = 32.0 bits (71), Expect = 1.7
 Identities = 57/278 (20%), Positives = 87/278 (31%), Gaps = 38/278 (13%)

Query: 28  PAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHP 87
           P    +T  S +S+S S+ S+G +  P                    A+S      + H 
Sbjct: 618 PTGTEATLSSAVSASGSTSSQGDQSNP--------------------AQSPFSPHASPHL 657

Query: 88  FGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEM 147
              P GPS   V  P G+             +Q+ +      S  G+++ P  P    E 
Sbjct: 658 SSIPGGPSPSPVGSPVGS-------------NQSRSGPISPASIPGSQMPPQPPGSQSES 704

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
             +  ALS        G   G   +   A          +    +  G M  G+    Q 
Sbjct: 705 -SSHPALSQSPMPQERGFMAGTQRNPQMAQYGPQQTGPSMSPHPSPGGQMHAGISSFQQS 763

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSP 267
           + SG      G       PQ +   PP  SG  +A     GPG   SA   +  + P  P
Sbjct: 764 NSSGTY----GPQMSQYGPQGNYSRPPAYSGVPSASYSGPGPGMGISANNQMHGQGPSQP 819

Query: 268 SPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGG 305
              +  G     G       G    ++     ++  GG
Sbjct: 820 CGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPGMSQQGG 857



 Score = 29.6 bits (65), Expect = 8.2
 Identities = 10/14 (71%), Positives = 10/14 (71%), Gaps = 4/14 (28%)

Query: 324 HHHHHH----HHHH 333
           HHHHHH    HHHH
Sbjct: 33  HHHHHHAHHLHHHH 46


>gi|22547197 zinc finger protein of the cerebellum 2 [Homo sapiens]
          Length = 532

 Score = 62.4 bits (150), Expect = 1e-09
 Identities = 87/307 (28%), Positives = 110/307 (35%), Gaps = 100/307 (32%)

Query: 71  AGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTD---LDQAATASKL 127
           AGP   ++ + S+A H     H  SA   A     +   E  L       +D AA     
Sbjct: 5   AGPQFPAIGVGSFARH-----HHHSAAAAAAAAAEMQDRELSLAAAQNGFVDSAA----- 54

Query: 128 LWSSRGA-KLSPFAPE-QPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASS 185
             +  GA KL+P A E  P +     +A +SQGP AY G+         AAAAAAAAA  
Sbjct: 55  --AHMGAFKLNPGAHELSPGQS----SAFTSQGPGAYPGS---------AAAAAAAAALG 99

Query: 186 PVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPY---GSGGGAA 242
           P                            HA   G++ G P  S     +   G G  A 
Sbjct: 100 P----------------------------HAAHVGSYSGPPFNSTRDFLFRSRGFGDSAP 131

Query: 243 GGGAAG---PGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSS 299
           GGG  G   PG  G   AH  A+           G    PG     G  G+  V  G   
Sbjct: 132 GGGQHGLFGPGAGGLHHAHSDAQ-----------GHLLFPGLPEQHGPHGSQNVLNGQMR 180

Query: 300 LAAMG---GREPQYSSLSAAR--------------PLN--------GTYHHHHHHHHHHP 334
           L   G   GR  QY  +++ R              P+N            HHHHHHHHHP
Sbjct: 181 LGLPGEVFGRSEQYRQVASPRTDPYSAAQLHNQYGPMNMNMGMNMAAAAAHHHHHHHHHP 240

Query: 335 SPYSPYV 341
             +  Y+
Sbjct: 241 GAFFRYM 247



 Score = 45.4 bits (106), Expect = 1e-04
 Identities = 40/132 (30%), Positives = 50/132 (37%), Gaps = 28/132 (21%)

Query: 170 VHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQAS 229
           VH ++   + ++ A+S  Y  +T  G + P        S   PA  A  A A       S
Sbjct: 414 VHESSPQGSESSPAASSGYESSTPPGLVSPSAEPQ-SSSNLSPAAAAAAAAAAAAAAAVS 472

Query: 230 ADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGG 289
           A     GSG G AGGG+ G  G+G                          GG   AG GG
Sbjct: 473 AVHRGGGSGSGGAGGGSGGGSGSG--------------------------GGGGGAGGGG 506

Query: 290 AGGVSGGGSSLA 301
            G  SGGGS  A
Sbjct: 507 GGS-SGGGSGTA 517



 Score = 44.7 bits (104), Expect = 2e-04
 Identities = 36/110 (32%), Positives = 47/110 (42%), Gaps = 4/110 (3%)

Query: 206 QGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPY 265
           QGS S PA  +G   + P  P   + S    S    +   AA    A +AAA VSA   +
Sbjct: 420 QGSESSPAASSGYESSTP--PGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSA--VH 475

Query: 266 SPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSA 315
                 + GA    GG + +G GG G   GGG S     G    +S LS+
Sbjct: 476 RGGGSGSGGAGGGSGGGSGSGGGGGGAGGGGGGSSGGGSGTAGGHSGLSS 525



 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 42/129 (32%), Positives = 58/129 (44%), Gaps = 26/129 (20%)

Query: 130 SSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYV 189
           S +G++ SP A       Y++        P+A   +      +AAAAAAAAAAA++ V  
Sbjct: 418 SPQGSESSPAASSG----YESSTPPGLVSPSAEPQSSSNLSPAAAAAAAAAAAAAAAVSA 473

Query: 190 PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGP 249
                GS   G      G GSG  + +GG G               G  GG  GGG++G 
Sbjct: 474 VHRGGGSGSGGA-----GGGSGGGSGSGGGG---------------GGAGGG-GGGSSG- 511

Query: 250 GGAGSAAAH 258
           GG+G+A  H
Sbjct: 512 GGSGTAGGH 520



 Score = 29.6 bits (65), Expect = 8.2
 Identities = 24/85 (28%), Positives = 32/85 (37%), Gaps = 9/85 (10%)

Query: 222 HPGWPQASADSPPYGSG-GGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPG 280
           H   PQ S  SP   SG   +   G   P     +++++S        P  A  AA    
Sbjct: 415 HESSPQGSESSPAASSGYESSTPPGLVSPSAEPQSSSNLS--------PAAAAAAAAAAA 466

Query: 281 GYAAAGSGGAGGVSGGGSSLAAMGG 305
             AA  +   GG SG G +    GG
Sbjct: 467 AAAAVSAVHRGGGSGSGGAGGGSGG 491


>gi|110624765 POU domain, class 3, transcription factor 1 [Homo
           sapiens]
          Length = 451

 Score = 62.0 bits (149), Expect = 1e-09
 Identities = 73/231 (31%), Positives = 89/231 (38%), Gaps = 60/231 (25%)

Query: 150 TLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTT--RVGSMLPGLPYHLQG 207
           T A    +GP    G  G  +H  AAAAAAAAAA+  ++       V  ++     H + 
Sbjct: 3   TTAQYLPRGPGGGAGGTGPLMHPDAAAAAAAAAAAERLHAGAAYREVQKLM-----HHEW 57

Query: 208 SGSGPANHAGGAGAHPGWPQASADSPPYGSGGGA--AGG-----GAAGPGGAGSA----- 255
            G+G A H  G  AHP W        P G GGG   AGG     G AG GG G A     
Sbjct: 58  LGAG-AGHPVGL-AHPQW-------LPTGGGGGGDWAGGPHLEHGKAGGGGTGRADDGGG 108

Query: 256 ------------AAHVSARF-----------PYSPSPPMANGAAREPGGYAAAGSGGAGG 292
                       AAH  A +             SPSP  + G   +P G  A  +   GG
Sbjct: 109 GGGFHARLVHQGAAHAGAAWAQGSTAHHLGPAMSPSPGASGGHQPQPLGLYAQAAYPGGG 168

Query: 293 VSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGA 343
             G    LAA GG         A   L+   H   H     PSP  P++GA
Sbjct: 169 GGGLAGMLAAGGG--------GAGPGLHHALHEDGHEAQLEPSP-PPHLGA 210


>gi|120587025 SH3 and multiple ankyrin repeat domains 1 [Homo sapiens]
          Length = 2161

 Score = 61.2 bits (147), Expect = 3e-09
 Identities = 100/383 (26%), Positives = 131/383 (34%), Gaps = 80/383 (20%)

Query: 11   PKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAA 70
            P   G+AG     S+    R    PP   S++    +R G RG  G      P +     
Sbjct: 1064 PSHHGSAGGGGGSSQGPALRYFQLPPRAASAAMYVPARSG-RGRKG------PLVKQTKV 1116

Query: 71   AGPPARSLLLSSYASHPFGAPHGPSAP----GVAGPGGNLSSWEDLLLFTDLDQAATASK 126
             G P +   L      P  +P  P++P     VA P    S    + + T + +A + S 
Sbjct: 1117 EGEPQKGGGLP-----PAPSPTSPASPQPPPAVAAP----SEKNSIPIPTIIIKAPSTSS 1167

Query: 127  LLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSP 186
               SS+G+      P QPE                  G  GG   S + A A +    SP
Sbjct: 1168 SGRSSQGSSTEAEPPTQPEPT----------------GGGGGGGSSPSPAPAMSPVPPSP 1211

Query: 187  VYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWP-QASADSPPYGSGGGAAGGG 245
              VPT       P  P  L  +    A   G A    GW  +A   S  + S    AG  
Sbjct: 1212 SPVPTPAS----PSGPATLDFTSQFGAALVGAARREGGWQNEARRRSTLFLSTD--AGDE 1265

Query: 246  AAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAA--------AGSGGAGGVSGGG 297
              G GG G+ AA         P P + +  + + G ++A        AGSG   G  G G
Sbjct: 1266 DGGDGGLGTGAA---------PGPRLRHSKSIDEGMFSAEPYLRLESAGSGAGYGGYGAG 1316

Query: 298  SSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAG---PF 354
            S     GG    ++S    RPL                   P  G  L PA P G     
Sbjct: 1317 SRAYGGGGGSSAFTSFLPPRPL-----------------VHPLTGKALDPASPLGLALAA 1359

Query: 355  ETPVLHSLQSRAGAPLPVPRGPS 377
                L       GAP P PR PS
Sbjct: 1360 RERALKESSEGGGAPQPPPRPPS 1382



 Score = 49.3 bits (116), Expect = 1e-05
 Identities = 46/168 (27%), Positives = 56/168 (33%), Gaps = 49/168 (29%)

Query: 226  PQASADSPPYGSGGG----AAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGG 281
            P+    +PP  S  G    +  GG   PG  G   A   A F   PSPP     +RE   
Sbjct: 929  PEPPYSTPPVPSSSGRLTPSPRGGPFNPGSGGPLPASSPASFD-GPSPPDTRVGSRE--- 984

Query: 282  YAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHH---PSPYS 338
                                          SL  + PL   +HH  HHHHHH   P P+ 
Sbjct: 985  -----------------------------KSLYHSGPLPPAHHHPPHHHHHHAPPPQPHH 1015

Query: 339  PYVGAPLTPAWPAG--PFETPVLHSLQS-------RAGAPLPVPRGPS 377
             +   P  P    G  P + P   +L         R G P P P  PS
Sbjct: 1016 HHAHPPHPPEMETGGSPDDPPPRLALGPQPSLRGWRGGGPSPTPGAPS 1063



 Score = 46.6 bits (109), Expect = 6e-05
 Identities = 106/426 (24%), Positives = 136/426 (31%), Gaps = 105/426 (24%)

Query: 17   AGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPAR 76
            +G +  DSR+       T  S  + SS S   GG  G GG +  G        A+GP   
Sbjct: 1673 SGIEEVDSRSSSDHPLETISSASTLSSLSAEGGGSAGGGGGAGAG-------VASGPE-- 1723

Query: 77   SLLLSSYASHPFG-APHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKL----LWSS 131
              LL +Y ++  G A  G S PG   P                 Q  T SKL    L +S
Sbjct: 1724 --LLDTYVAYLDGQAFGGSSTPGPPYP----------------PQLMTPSKLRGRALGAS 1765

Query: 132  RGAKLSPFA----PEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPV 187
             G +  P      P  P     ++    + G  A     G      A    A      PV
Sbjct: 1766 GGLRPGPSGGLRDPVTPTSPTVSVTGAGTDGLLALRACSGPPTAGVAGGPVAVEPEVPPV 1825

Query: 188  YVPTTRVGSMLPG--LPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGS-------- 237
             +PT    S LP   LP+  +G G  P    G        PQASA +    S        
Sbjct: 1826 PLPT---ASSLPRKLLPWE-EGPGPPPPPLPGPLAQ----PQASALATVKASIISELSSK 1877

Query: 238  ----GGGAAGGGA-----AGPGGAGSA----AAHVSAR---------------------- 262
                GG +A GGA      G GG G +    A++V  R                      
Sbjct: 1878 LQQFGGSSAAGGALPWARGGSGGGGDSHHGGASYVPERTSSLQRQRLSDDSQSSLLSKPV 1937

Query: 263  ------FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAA 316
                  +P  P PP+  G    P   AA G+      S   S+    G        L   
Sbjct: 1938 SSLFQNWPKPPLPPLPTGTGVSPTAAAAPGATSPSASSSSTSTRHLQGVEFEMRPPLLRR 1997

Query: 317  RPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAG------APL 370
             P         H     P P S     P+ P+ P  P    +  S    AG      AP+
Sbjct: 1998 APSPSLLPASEHKVSPAPRPSS----LPILPSGPLYPGLFDIRGSPTGGAGGSADPFAPV 2053

Query: 371  PVPRGP 376
             VP  P
Sbjct: 2054 FVPPHP 2059



 Score = 38.9 bits (89), Expect = 0.014
 Identities = 71/288 (24%), Positives = 93/288 (32%), Gaps = 30/288 (10%)

Query: 18   GADASDSRAFPAREP--STPPSPISSSSSSCSRGGERG-PGGASNCGTPQLDTEAAAGPP 74
            G  + D    P   P  S PPSP S  +S      E G P        P +D E      
Sbjct: 1509 GPPSEDGPGVPPPSPRRSVPPSPTSPRASE-----ENGLPLLVLPPPAPSVDVEDGEFLF 1563

Query: 75   ARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGA 134
               L      S+ F  P  P  PG   P  +  +    L        A A   L S+  +
Sbjct: 1564 VEPLPPPLEFSNSFEKPESPLTPGPPHPLPDTPAPATPLPPVPPPAVAAAPPTLDSTASS 1623

Query: 135  KLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRV 194
              S          Y +  A  +QG +A  G P    H     A AA A ++P   P    
Sbjct: 1624 LTS----------YDSEVATLTQGASAAPGDP----HPPGPPAPAAPAPAAPQPGPDPPP 1669

Query: 195  GSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGS 254
            G+         + S   P      A            S   G GGGA  G A+GP    +
Sbjct: 1670 GTDSGIEEVDSRSSSDHPLETISSASTLSSLSAEGGGSA--GGGGGAGAGVASGPELLDT 1727

Query: 255  AAAHVSAR------FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGG 296
              A++  +       P  P PP     ++  G    A  G   G SGG
Sbjct: 1728 YVAYLDGQAFGGSSTPGPPYPPQLMTPSKLRGRALGASGGLRPGPSGG 1775



 Score = 35.4 bits (80), Expect = 0.15
 Identities = 26/85 (30%), Positives = 32/85 (37%), Gaps = 1/85 (1%)

Query: 218  GAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAR 277
            GA A PG P       P      A   G   P G  S    V +R   S  P     +A 
Sbjct: 1637 GASAAPGDPHPPGPPAPAAPAPAAPQPGPDPPPGTDSGIEEVDSR-SSSDHPLETISSAS 1695

Query: 278  EPGGYAAAGSGGAGGVSGGGSSLAA 302
                 +A G G AGG  G G+ +A+
Sbjct: 1696 TLSSLSAEGGGSAGGGGGAGAGVAS 1720



 Score = 35.0 bits (79), Expect = 0.20
 Identities = 28/101 (27%), Positives = 41/101 (40%), Gaps = 10/101 (9%)

Query: 223 PGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMAN---GAAREP 279
           PG   + A  P  GS G +          +G+  +  S R   + SP        A R+P
Sbjct: 454 PGAASSGAPGPTSGSQGQSQPSAPTTKLSSGTLRSASSPRGARARSPSRGRHPEDAKRQP 513

Query: 280 GGYAAAG-------SGGAGGVSGGGSSLAAMGGREPQYSSL 313
            G  ++        +GG GG  G G SL + G R   YS++
Sbjct: 514 RGRPSSSGTPREGPAGGTGGSGGPGGSLGSRGRRRKLYSAV 554



 Score = 32.0 bits (71), Expect = 1.7
 Identities = 25/91 (27%), Positives = 35/91 (38%), Gaps = 6/91 (6%)

Query: 165 APGGFVHSAAAAAAAAAAASSPVYVPTTRVGS---MLPGLPYHLQGSGSGPANHAGGAGA 221
           APG     A    + +   S P   PTT++ S        P   +        H   A  
Sbjct: 453 APGAASSGAPGPTSGSQGQSQPS-APTTKLSSGTLRSASSPRGARARSPSRGRHPEDAKR 511

Query: 222 HPGWPQASADSPPYGSGGGAAGGGAAGPGGA 252
            P    +S+ +P  G  GG   GG+ GPGG+
Sbjct: 512 QPRGRPSSSGTPREGPAGGT--GGSGGPGGS 540


>gi|111118976 collagen, type II, alpha 1 isoform 1 precursor [Homo
           sapiens]
          Length = 1487

 Score = 60.5 bits (145), Expect = 4e-09
 Identities = 108/399 (27%), Positives = 128/399 (32%), Gaps = 76/399 (19%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG-----------GERGPGGASNC--- 60
           GAAGA  +D +  PA     PP P+  +      G           G RGP GA      
Sbjct: 339 GAAGARGNDGQPGPAG----PPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGE 394

Query: 61  -GTPQLDTEA-AAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDL 118
            GTP     A A+G P    +  +  S   GAP    APG  GP G              
Sbjct: 395 PGTPGSPGPAGASGNPGTDGIPGAKGSA--GAPGIAGAPGFPGPRGPPGP---------- 442

Query: 119 DQAATASKLLWSSRGAK-LSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAA 177
            Q AT         G   ++ F  EQ  +           GPA   GAPG         A
Sbjct: 443 -QGATGPLGPKGQTGEPGIAGFKGEQGPK--------GEPGPAGPQGAPGPAGEEGKRGA 493

Query: 178 AAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGS-------GPANHAGGAGAH-----PGW 225
                   P+  P  R      G P     +G        GP+  AG  GA+     PG 
Sbjct: 494 RGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGE 553

Query: 226 PQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAA 285
           P         G  G A   G  GP GA              P PP   GA  +PG     
Sbjct: 554 PGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGR--------PGPPGPQGARGQPGVMGFP 605

Query: 286 GSGGAGGVSG--------GGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPY 337
           G  GA G  G        G   L  + G++       AA P               P P 
Sbjct: 606 GPKGANGEPGKAGEKGLPGAPGLRGLPGKD---GETGAAGPPGPAGPAGERGEQGAPGP- 661

Query: 338 SPYVGAPLTPAWPAGPFETPVLHSLQSRAGAP-LPVPRG 375
           S + G P  P  P G    P    +   AGAP L  PRG
Sbjct: 662 SGFQGLP-GPPGPPGEGGKPGDQGVPGEAGAPGLVGPRG 699



 Score = 57.0 bits (136), Expect = 5e-08
 Identities = 83/319 (26%), Positives = 102/319 (31%), Gaps = 47/319 (14%)

Query: 13  RFGAAGADASDSRAFPAREPSTPPSP-------ISSSSSSCSRGGERG-PGGASNCGTPQ 64
           + G +GA   D R  P         P          ++    + GE+G PG     G P 
Sbjct: 574 KVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPG 633

Query: 65  LDTEA-AAGPPARSLLLSSYASHPFGAPHG----PSAPGVAGPGGNLSSWEDLLLFTDLD 119
            D E  AAGPP  +             P G    P  PG  G GG             + 
Sbjct: 634 KDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGD-------QGVP 686

Query: 120 QAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAA 179
             A A  L+   RG +  P     P        A   QGP    G PG      A+  A 
Sbjct: 687 GEAGAPGLV-GPRGERGFPGERGSP-------GAQGLQGPRGLPGTPGTDGPKGASGPAG 738

Query: 180 AAAASSPVYVPTTRVGSMLPGLPYHLQGSG-SGPANHAGGAGAH-----PGWPQASADSP 233
              A  P           L G+P     +G +GP    G  G       PG       + 
Sbjct: 739 PPGAQGP---------PGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTG 789

Query: 234 PYGSGGGAAGGGAAG----PGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGG 289
           P G  G A   G  G    PG AGSA A  +        PP   G A  PG     G+ G
Sbjct: 790 PIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKG 849

Query: 290 AGGVSGGGSSLAAMGGREP 308
             G +G      A G + P
Sbjct: 850 EQGEAGQKGDAGAPGPQGP 868



 Score = 50.8 bits (120), Expect = 3e-06
 Identities = 104/393 (26%), Positives = 124/393 (31%), Gaps = 77/393 (19%)

Query: 15   GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG--GERGPGGASNCGTPQLDTEAAAG 72
            G AG    D+ A   + PS  P P   +  +  +G  G +GP GA+  G P        G
Sbjct: 852  GEAG-QKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGAT--GFP--GAAGRVG 906

Query: 73   PPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSR 132
            PP              G+   P  PG  GP G            D  + A         R
Sbjct: 907  PP--------------GSNGNPGPPGPPGPSGK-----------DGPKGA---------R 932

Query: 133  GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA---AAASSPVYV 189
            G    P    +P             GP    G PG    S A         A     V +
Sbjct: 933  GDSGPPGRAGEP-------GLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGL 985

Query: 190  PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGP 249
            P  R     PGLP      G   A  A G    PG       + P G  G     GA GP
Sbjct: 986  PGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGP 1045

Query: 250  GGAGSAAAHVSAR-------FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAA 302
             G   AA     R        P +P PP + G A   G     G  GA G  G      A
Sbjct: 1046 PGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGA 1105

Query: 303  MGGREPQ-------YSSLSAARPLNGTYHHHHHHHHHHPSPYSPY----VGAPLTPAWPA 351
             G + PQ        +     R L G  H         P P  P        P  P+ P 
Sbjct: 1106 RGIQGPQGPRGDKGEAGEPGERGLKG--HRGFTGLQGLPGPPGPSGDQGASGPAGPSGPR 1163

Query: 352  GPFETPVLHSLQSRA-GAPLPV----PRGPSAD 379
            GP   PV  S +  A G P P+    PRG S +
Sbjct: 1164 GP-PGPVGPSGKDGANGIPGPIGPPGPRGRSGE 1195



 Score = 50.1 bits (118), Expect = 6e-06
 Identities = 77/305 (25%), Positives = 89/305 (29%), Gaps = 17/305 (5%)

Query: 10  LPKRFGAAG-ADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTE 68
           +P   GA G       R FP    S     +          G  GP GAS    P     
Sbjct: 685 VPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPP---- 740

Query: 69  AAAGPPARSLLLSSYASHPFGAPHGPSAP-GVAGPGGNLSSWEDLLLFTDLDQ----AAT 123
            A GPP    +     +     P G     G  GP G         L   +       A 
Sbjct: 741 GAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGAN 800

Query: 124 ASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAA 183
             K      G   S  A   P E  +T       GPA + G PG      A      A  
Sbjct: 801 GEKGEVGPPGPAGSAGARGAPGERGET----GPPGPAGFAGPPGADGQPGAKGEQGEAGQ 856

Query: 184 SSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAG 243
                 P  +  S  PG       +G   A  A G     G+P A+    P GS G    
Sbjct: 857 KGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGP 916

Query: 244 GGAAGPGGA-GSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSL 300
            G  GP G  G   A   +  P     P   G A  PG     G  G  G  G  G   L
Sbjct: 917 PGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGL 976

Query: 301 AAMGG 305
           A   G
Sbjct: 977 AGQRG 981



 Score = 49.7 bits (117), Expect = 8e-06
 Identities = 74/297 (24%), Positives = 89/297 (29%), Gaps = 44/297 (14%)

Query: 32  PSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAP 91
           P  PP P   +     RG +RG  G      P+   +   G P              G P
Sbjct: 118 PKGPPGPQGPAGEQGPRG-DRGDKGEKGAPGPR-GRDGEPGTPGNP-----------GPP 164

Query: 92  HGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTL 151
             P  PG  G GGN ++     +    D+ A  ++L     G    P  P  P       
Sbjct: 165 GPPGPPGPPGLGGNFAAQ----MAGGFDEKAGGAQL-----GVMQGPMGPMGPRGPPGPA 215

Query: 152 AALSSQG------------------PAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTR 193
            A   QG                  P    G PG       A     A    P      R
Sbjct: 216 GAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGAR 275

Query: 194 VGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG-- 251
                PGLP      G    + A G    PG    S      GS G     G  G  G  
Sbjct: 276 GFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRT 335

Query: 252 --AGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGR 306
             AG+A A  +   P    PP   G A  PG   A G+ G  G +G      A G R
Sbjct: 336 GPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPR 392



 Score = 48.5 bits (114), Expect = 2e-05
 Identities = 79/309 (25%), Positives = 98/309 (31%), Gaps = 62/309 (20%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGAS----NCGTPQLDTEAA 70
           G AGA      +    E  +P  P+        RG   GP GA+    N G P       
Sbjct: 300 GEAGAPGVKGESGSPGENGSP-GPMGPRGLPGERG-RTGPAGAAGARGNDGQP-----GP 352

Query: 71  AGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWS 130
           AGPP            P G P  P APG  G  G                  T ++    
Sbjct: 353 AGPPG--------PVGPAGGPGFPGAPGAKGEAG-----------------PTGARGPEG 387

Query: 131 SRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPG-GFVHSAAAAAAAAAAASSPVYV 189
           ++G +  P  P  P             GPA   G PG   +  A  +A A   A +P   
Sbjct: 388 AQGPRGEPGTPGSP-------------GPAGASGNPGTDGIPGAKGSAGAPGIAGAP--- 431

Query: 190 PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGP 249
                G   P  P   QG+ +GP    G  G  PG      +  P G  G A   GA  P
Sbjct: 432 -----GFPGPRGPPGPQGA-TGPLGPKGQTG-EPGIAGFKGEQGPKGEPGPAGPQGA--P 482

Query: 250 GGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQ 309
           G AG      +   P    P    G    PG     G  G  G  G        G   P+
Sbjct: 483 GPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPK 542

Query: 310 YSSLSAARP 318
            ++    RP
Sbjct: 543 GANGDPGRP 551



 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 75/285 (26%), Positives = 92/285 (32%), Gaps = 36/285 (12%)

Query: 32  PSTPPSP--ISSSSSSCSRGGERGPGGASNCGTPQ--LDTEAAAGPPARSLLLSSYASHP 87
           P  PP P  +  + ++   GG     G +  G  Q  +      GPP  +          
Sbjct: 166 PPGPPGPPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQ- 224

Query: 88  FGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEM 147
            G P  P  PGV+GP G            D  +A    K     RG    P  P+     
Sbjct: 225 -GNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKA--GERG----PPGPQGARGF 277

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
             T      +G   Y G  G      A A      + SP        GS  P  P  L G
Sbjct: 278 PGTPGLPGVKGHRGYPGLDGA--KGEAGAPGVKGESGSP-----GENGSPGPMGPRGLPG 330

Query: 208 SG--SGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPG-----GAGSAAAHVS 260
               +GPA  AG  G          D  P  +G     G A GPG     GA   A    
Sbjct: 331 ERGRTGPAGAAGARGN---------DGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTG 381

Query: 261 ARFPYSPSPPMAN-GAAREPGGYAAAGSGGAGGVSGGGSSLAAMG 304
           AR P     P    G    PG   A+G+ G  G+ G   S  A G
Sbjct: 382 ARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPG 426



 Score = 44.3 bits (103), Expect = 3e-04
 Identities = 98/352 (27%), Positives = 111/352 (31%), Gaps = 68/352 (19%)

Query: 35   PPSPISSSSSSCSRG--GERGPGG-ASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAP 91
            PP P  S+ +  + G  GE GP G A   G P  D +  A            A  P   P
Sbjct: 808  PPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAP--GP 865

Query: 92   HGPS-APGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQT 150
             GPS APG  GP G                  T  K    +RGA+  P A   P      
Sbjct: 866  QGPSGAPGPQGPTG-----------------VTGPK---GARGAQGPPGATGFP------ 899

Query: 151  LAALSSQGPAAYDGAPG--GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGS 208
              A    GP   +G PG  G    +       A   S    P  R G   PGL       
Sbjct: 900  -GAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSG---PPGRAGE--PGL------- 946

Query: 209  GSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAG-PGGAGSAAAHVSARFPYSP 267
              GPA   G  G  PG    S    P G  G A   G  G PG  G         FP  P
Sbjct: 947  -QGPAGPPGEKG-EPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERG------FPGLP 998

Query: 268  SPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSLAAMGGREPQYSSLSAARPLNGTYHH 325
             P      + EPG   A G+ G  G  G  G   L    G   +  S  A    +G    
Sbjct: 999  GP------SGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGA----DGPPGR 1048

Query: 326  HHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPS 377
                        +  VGAP  P  P  P           R  A    P GPS
Sbjct: 1049 DGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPS 1100



 Score = 42.7 bits (99), Expect = 0.001
 Identities = 77/278 (27%), Positives = 89/278 (32%), Gaps = 56/278 (20%)

Query: 50  GERGPGGASNCGTPQLDTEAAA-------GPPARSLLLSSYASHPFGAPHGPSAPGVAGP 102
           G RGP G    G P  D EA         GPP            P GA   P  PG+ G 
Sbjct: 240 GPRGPPGPP--GKPGDDGEAGKPGKAGERGPPG-----------PQGARGFPGTPGLPGV 286

Query: 103 GGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAY 162
            G+         +  LD A   +     + G K    +P +            S GP   
Sbjct: 287 KGHRG-------YPGLDGAKGEA----GAPGVKGESGSPGEN----------GSPGPMGP 325

Query: 163 DGAPGGFVHSAAAAAAAAAAASS---PVYVPTTRVGSMLPGLPYHLQGSG-SGPANHAGG 218
            G PG    +  A AA A        P   P     +  PG P      G +GP    G 
Sbjct: 326 RGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGP 385

Query: 219 AGAHPGWPQASADSPPYGSGGGAAGGGAAG----PGGAGSAAAHVSA---RFPYSPSPPM 271
            GA    P+    +P  GS G A   G  G    PG  GSA A   A    FP    PP 
Sbjct: 386 EGAQG--PRGEPGTP--GSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPG 441

Query: 272 ANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQ 309
             GA    G     G  G  G  G        G   PQ
Sbjct: 442 PQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQ 479



 Score = 42.0 bits (97), Expect = 0.002
 Identities = 66/270 (24%), Positives = 78/270 (28%), Gaps = 55/270 (20%)

Query: 49   GGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSS 108
            G +  PG + + G P        GPP          + P G P    +PG  GP G    
Sbjct: 1005 GKQGAPGASGDRGPP-----GPVGPPG--------LTGPAGEPGREGSPGADGPPGR--- 1048

Query: 109  WEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGG 168
                         A   K      GA  +P AP  P           S GPA   G  G 
Sbjct: 1049 -----------DGAAGVKGDRGETGAVGAPGAPGPP----------GSPGPAGPTGKQGD 1087

Query: 169  FVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPAN-----HAGGAGAH- 222
               + A      +        P    G   P  P   +G    P       H G  G   
Sbjct: 1088 RGEAGAQGPMGPSG-------PAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQG 1140

Query: 223  -PGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMA----NGAAR 277
             PG P  S D    G  G +   G  GP G            P  P  P       G A 
Sbjct: 1141 LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAG 1200

Query: 278  EPGGYAAAGSGGAGGVSGGGSSLAAMGGRE 307
             PG     G  G  G     S+ A +G RE
Sbjct: 1201 PPGNPGPPGPPGPPGPGIDMSAFAGLGPRE 1230



 Score = 36.6 bits (83), Expect = 0.067
 Identities = 36/138 (26%), Positives = 42/138 (30%), Gaps = 9/138 (6%)

Query: 218 GAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAR 277
           G    PG    + +  P G  G     GA GP G            P +P PP   G   
Sbjct: 117 GPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPG-----TPGNPGPPGPPGPPG 171

Query: 278 EP--GGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPS 335
            P  GG  AA   G      GG+ L  M G            P           +   P 
Sbjct: 172 PPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPG 231

Query: 336 PYSPYVGAPLTPAWPAGP 353
              P V  P+ P  P GP
Sbjct: 232 --EPGVSGPMGPRGPPGP 247


>gi|111118974 collagen, type II, alpha 1 isoform 2 precursor [Homo
           sapiens]
          Length = 1418

 Score = 60.5 bits (145), Expect = 4e-09
 Identities = 108/399 (27%), Positives = 128/399 (32%), Gaps = 76/399 (19%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG-----------GERGPGGASNC--- 60
           GAAGA  +D +  PA     PP P+  +      G           G RGP GA      
Sbjct: 270 GAAGARGNDGQPGPAG----PPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGE 325

Query: 61  -GTPQLDTEA-AAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDL 118
            GTP     A A+G P    +  +  S   GAP    APG  GP G              
Sbjct: 326 PGTPGSPGPAGASGNPGTDGIPGAKGSA--GAPGIAGAPGFPGPRGPPGP---------- 373

Query: 119 DQAATASKLLWSSRGAK-LSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAA 177
            Q AT         G   ++ F  EQ  +           GPA   GAPG         A
Sbjct: 374 -QGATGPLGPKGQTGEPGIAGFKGEQGPK--------GEPGPAGPQGAPGPAGEEGKRGA 424

Query: 178 AAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGS-------GPANHAGGAGAH-----PGW 225
                   P+  P  R      G P     +G        GP+  AG  GA+     PG 
Sbjct: 425 RGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGE 484

Query: 226 PQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAA 285
           P         G  G A   G  GP GA              P PP   GA  +PG     
Sbjct: 485 PGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGR--------PGPPGPQGARGQPGVMGFP 536

Query: 286 GSGGAGGVSG--------GGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPY 337
           G  GA G  G        G   L  + G++       AA P               P P 
Sbjct: 537 GPKGANGEPGKAGEKGLPGAPGLRGLPGKD---GETGAAGPPGPAGPAGERGEQGAPGP- 592

Query: 338 SPYVGAPLTPAWPAGPFETPVLHSLQSRAGAP-LPVPRG 375
           S + G P  P  P G    P    +   AGAP L  PRG
Sbjct: 593 SGFQGLP-GPPGPPGEGGKPGDQGVPGEAGAPGLVGPRG 630



 Score = 57.0 bits (136), Expect = 5e-08
 Identities = 83/319 (26%), Positives = 102/319 (31%), Gaps = 47/319 (14%)

Query: 13  RFGAAGADASDSRAFPAREPSTPPSP-------ISSSSSSCSRGGERG-PGGASNCGTPQ 64
           + G +GA   D R  P         P          ++    + GE+G PG     G P 
Sbjct: 505 KVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPG 564

Query: 65  LDTEA-AAGPPARSLLLSSYASHPFGAPHG----PSAPGVAGPGGNLSSWEDLLLFTDLD 119
            D E  AAGPP  +             P G    P  PG  G GG             + 
Sbjct: 565 KDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGD-------QGVP 617

Query: 120 QAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAA 179
             A A  L+   RG +  P     P        A   QGP    G PG      A+  A 
Sbjct: 618 GEAGAPGLV-GPRGERGFPGERGSP-------GAQGLQGPRGLPGTPGTDGPKGASGPAG 669

Query: 180 AAAASSPVYVPTTRVGSMLPGLPYHLQGSG-SGPANHAGGAGAH-----PGWPQASADSP 233
              A  P           L G+P     +G +GP    G  G       PG       + 
Sbjct: 670 PPGAQGP---------PGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTG 720

Query: 234 PYGSGGGAAGGGAAG----PGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGG 289
           P G  G A   G  G    PG AGSA A  +        PP   G A  PG     G+ G
Sbjct: 721 PIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKG 780

Query: 290 AGGVSGGGSSLAAMGGREP 308
             G +G      A G + P
Sbjct: 781 EQGEAGQKGDAGAPGPQGP 799



 Score = 50.8 bits (120), Expect = 3e-06
 Identities = 104/393 (26%), Positives = 124/393 (31%), Gaps = 77/393 (19%)

Query: 15   GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG--GERGPGGASNCGTPQLDTEAAAG 72
            G AG    D+ A   + PS  P P   +  +  +G  G +GP GA+  G P        G
Sbjct: 783  GEAG-QKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGAT--GFP--GAAGRVG 837

Query: 73   PPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSR 132
            PP              G+   P  PG  GP G            D  + A         R
Sbjct: 838  PP--------------GSNGNPGPPGPPGPSGK-----------DGPKGA---------R 863

Query: 133  GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA---AAASSPVYV 189
            G    P    +P             GP    G PG    S A         A     V +
Sbjct: 864  GDSGPPGRAGEP-------GLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGL 916

Query: 190  PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGP 249
            P  R     PGLP      G   A  A G    PG       + P G  G     GA GP
Sbjct: 917  PGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGP 976

Query: 250  GGAGSAAAHVSAR-------FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAA 302
             G   AA     R        P +P PP + G A   G     G  GA G  G      A
Sbjct: 977  PGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGA 1036

Query: 303  MGGREPQ-------YSSLSAARPLNGTYHHHHHHHHHHPSPYSPY----VGAPLTPAWPA 351
             G + PQ        +     R L G  H         P P  P        P  P+ P 
Sbjct: 1037 RGIQGPQGPRGDKGEAGEPGERGLKG--HRGFTGLQGLPGPPGPSGDQGASGPAGPSGPR 1094

Query: 352  GPFETPVLHSLQSRA-GAPLPV----PRGPSAD 379
            GP   PV  S +  A G P P+    PRG S +
Sbjct: 1095 GP-PGPVGPSGKDGANGIPGPIGPPGPRGRSGE 1126



 Score = 50.1 bits (118), Expect = 6e-06
 Identities = 77/305 (25%), Positives = 89/305 (29%), Gaps = 17/305 (5%)

Query: 10  LPKRFGAAG-ADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTE 68
           +P   GA G       R FP    S     +          G  GP GAS    P     
Sbjct: 616 VPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPP---- 671

Query: 69  AAAGPPARSLLLSSYASHPFGAPHGPSAP-GVAGPGGNLSSWEDLLLFTDLDQ----AAT 123
            A GPP    +     +     P G     G  GP G         L   +       A 
Sbjct: 672 GAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGAN 731

Query: 124 ASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAA 183
             K      G   S  A   P E  +T       GPA + G PG      A      A  
Sbjct: 732 GEKGEVGPPGPAGSAGARGAPGERGET----GPPGPAGFAGPPGADGQPGAKGEQGEAGQ 787

Query: 184 SSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAG 243
                 P  +  S  PG       +G   A  A G     G+P A+    P GS G    
Sbjct: 788 KGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGP 847

Query: 244 GGAAGPGGA-GSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSL 300
            G  GP G  G   A   +  P     P   G A  PG     G  G  G  G  G   L
Sbjct: 848 PGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGL 907

Query: 301 AAMGG 305
           A   G
Sbjct: 908 AGQRG 912



 Score = 49.7 bits (117), Expect = 8e-06
 Identities = 74/297 (24%), Positives = 89/297 (29%), Gaps = 44/297 (14%)

Query: 32  PSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAP 91
           P  PP P   +     RG +RG  G      P+   +   G P              G P
Sbjct: 49  PKGPPGPQGPAGEQGPRG-DRGDKGEKGAPGPR-GRDGEPGTPGNP-----------GPP 95

Query: 92  HGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTL 151
             P  PG  G GGN ++     +    D+ A  ++L     G    P  P  P       
Sbjct: 96  GPPGPPGPPGLGGNFAAQ----MAGGFDEKAGGAQL-----GVMQGPMGPMGPRGPPGPA 146

Query: 152 AALSSQG------------------PAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTR 193
            A   QG                  P    G PG       A     A    P      R
Sbjct: 147 GAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGAR 206

Query: 194 VGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG-- 251
                PGLP      G    + A G    PG    S      GS G     G  G  G  
Sbjct: 207 GFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERGRT 266

Query: 252 --AGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGR 306
             AG+A A  +   P    PP   G A  PG   A G+ G  G +G      A G R
Sbjct: 267 GPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGPEGAQGPR 323



 Score = 48.5 bits (114), Expect = 2e-05
 Identities = 79/309 (25%), Positives = 98/309 (31%), Gaps = 62/309 (20%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGAS----NCGTPQLDTEAA 70
           G AGA      +    E  +P  P+        RG   GP GA+    N G P       
Sbjct: 231 GEAGAPGVKGESGSPGENGSP-GPMGPRGLPGERG-RTGPAGAAGARGNDGQP-----GP 283

Query: 71  AGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWS 130
           AGPP            P G P  P APG  G  G                  T ++    
Sbjct: 284 AGPPG--------PVGPAGGPGFPGAPGAKGEAG-----------------PTGARGPEG 318

Query: 131 SRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPG-GFVHSAAAAAAAAAAASSPVYV 189
           ++G +  P  P  P             GPA   G PG   +  A  +A A   A +P   
Sbjct: 319 AQGPRGEPGTPGSP-------------GPAGASGNPGTDGIPGAKGSAGAPGIAGAP--- 362

Query: 190 PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGP 249
                G   P  P   QG+ +GP    G  G  PG      +  P G  G A   GA  P
Sbjct: 363 -----GFPGPRGPPGPQGA-TGPLGPKGQTG-EPGIAGFKGEQGPKGEPGPAGPQGA--P 413

Query: 250 GGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQ 309
           G AG      +   P    P    G    PG     G  G  G  G        G   P+
Sbjct: 414 GPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPK 473

Query: 310 YSSLSAARP 318
            ++    RP
Sbjct: 474 GANGDPGRP 482



 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 75/285 (26%), Positives = 92/285 (32%), Gaps = 36/285 (12%)

Query: 32  PSTPPSP--ISSSSSSCSRGGERGPGGASNCGTPQ--LDTEAAAGPPARSLLLSSYASHP 87
           P  PP P  +  + ++   GG     G +  G  Q  +      GPP  +          
Sbjct: 97  PPGPPGPPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQ- 155

Query: 88  FGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEM 147
            G P  P  PGV+GP G            D  +A    K     RG    P  P+     
Sbjct: 156 -GNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKA--GERG----PPGPQGARGF 208

Query: 148 YQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQG 207
             T      +G   Y G  G      A A      + SP        GS  P  P  L G
Sbjct: 209 PGTPGLPGVKGHRGYPGLDGA--KGEAGAPGVKGESGSP-----GENGSPGPMGPRGLPG 261

Query: 208 SG--SGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPG-----GAGSAAAHVS 260
               +GPA  AG  G          D  P  +G     G A GPG     GA   A    
Sbjct: 262 ERGRTGPAGAAGARGN---------DGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTG 312

Query: 261 ARFPYSPSPPMAN-GAAREPGGYAAAGSGGAGGVSGGGSSLAAMG 304
           AR P     P    G    PG   A+G+ G  G+ G   S  A G
Sbjct: 313 ARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPG 357



 Score = 44.3 bits (103), Expect = 3e-04
 Identities = 98/352 (27%), Positives = 111/352 (31%), Gaps = 68/352 (19%)

Query: 35   PPSPISSSSSSCSRG--GERGPGG-ASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAP 91
            PP P  S+ +  + G  GE GP G A   G P  D +  A            A  P   P
Sbjct: 739  PPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAP--GP 796

Query: 92   HGPS-APGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQT 150
             GPS APG  GP G                  T  K    +RGA+  P A   P      
Sbjct: 797  QGPSGAPGPQGPTG-----------------VTGPK---GARGAQGPPGATGFP------ 830

Query: 151  LAALSSQGPAAYDGAPG--GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGS 208
              A    GP   +G PG  G    +       A   S    P  R G   PGL       
Sbjct: 831  -GAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSG---PPGRAGE--PGL------- 877

Query: 209  GSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAG-PGGAGSAAAHVSARFPYSP 267
              GPA   G  G  PG    S    P G  G A   G  G PG  G         FP  P
Sbjct: 878  -QGPAGPPGEKG-EPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERG------FPGLP 929

Query: 268  SPPMANGAAREPGGYAAAGSGGAGGVSG--GGSSLAAMGGREPQYSSLSAARPLNGTYHH 325
             P      + EPG   A G+ G  G  G  G   L    G   +  S  A    +G    
Sbjct: 930  GP------SGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGA----DGPPGR 979

Query: 326  HHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQSRAGAPLPVPRGPS 377
                        +  VGAP  P  P  P           R  A    P GPS
Sbjct: 980  DGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPS 1031



 Score = 42.7 bits (99), Expect = 0.001
 Identities = 77/278 (27%), Positives = 89/278 (32%), Gaps = 56/278 (20%)

Query: 50  GERGPGGASNCGTPQLDTEAAA-------GPPARSLLLSSYASHPFGAPHGPSAPGVAGP 102
           G RGP G    G P  D EA         GPP            P GA   P  PG+ G 
Sbjct: 171 GPRGPPGPP--GKPGDDGEAGKPGKAGERGPPG-----------PQGARGFPGTPGLPGV 217

Query: 103 GGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAY 162
            G+         +  LD A   +     + G K    +P +            S GP   
Sbjct: 218 KGHRG-------YPGLDGAKGEA----GAPGVKGESGSPGEN----------GSPGPMGP 256

Query: 163 DGAPGGFVHSAAAAAAAAAAASS---PVYVPTTRVGSMLPGLPYHLQGSG-SGPANHAGG 218
            G PG    +  A AA A        P   P     +  PG P      G +GP    G 
Sbjct: 257 RGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGP 316

Query: 219 AGAHPGWPQASADSPPYGSGGGAAGGGAAG----PGGAGSAAAHVSA---RFPYSPSPPM 271
            GA    P+    +P  GS G A   G  G    PG  GSA A   A    FP    PP 
Sbjct: 317 EGAQG--PRGEPGTP--GSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPG 372

Query: 272 ANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQ 309
             GA    G     G  G  G  G        G   PQ
Sbjct: 373 PQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQ 410



 Score = 42.0 bits (97), Expect = 0.002
 Identities = 66/270 (24%), Positives = 78/270 (28%), Gaps = 55/270 (20%)

Query: 49   GGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSS 108
            G +  PG + + G P        GPP          + P G P    +PG  GP G    
Sbjct: 936  GKQGAPGASGDRGPP-----GPVGPPG--------LTGPAGEPGREGSPGADGPPGR--- 979

Query: 109  WEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGG 168
                         A   K      GA  +P AP  P           S GPA   G  G 
Sbjct: 980  -----------DGAAGVKGDRGETGAVGAPGAPGPP----------GSPGPAGPTGKQGD 1018

Query: 169  FVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPAN-----HAGGAGAH- 222
               + A      +        P    G   P  P   +G    P       H G  G   
Sbjct: 1019 RGEAGAQGPMGPSG-------PAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQG 1071

Query: 223  -PGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMA----NGAAR 277
             PG P  S D    G  G +   G  GP G            P  P  P       G A 
Sbjct: 1072 LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAG 1131

Query: 278  EPGGYAAAGSGGAGGVSGGGSSLAAMGGRE 307
             PG     G  G  G     S+ A +G RE
Sbjct: 1132 PPGNPGPPGPPGPPGPGIDMSAFAGLGPRE 1161



 Score = 36.6 bits (83), Expect = 0.067
 Identities = 36/138 (26%), Positives = 42/138 (30%), Gaps = 9/138 (6%)

Query: 218 GAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAR 277
           G    PG    + +  P G  G     GA GP G            P +P PP   G   
Sbjct: 48  GPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPG-----TPGNPGPPGPPGPPG 102

Query: 278 EP--GGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPS 335
            P  GG  AA   G      GG+ L  M G            P           +   P 
Sbjct: 103 PPGLGGNFAAQMAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPG 162

Query: 336 PYSPYVGAPLTPAWPAGP 353
              P V  P+ P  P GP
Sbjct: 163 --EPGVSGPMGPRGPPGP 178


>gi|110349772 alpha 1 type I collagen preproprotein [Homo sapiens]
          Length = 1464

 Score = 59.7 bits (143), Expect = 7e-09
 Identities = 102/384 (26%), Positives = 122/384 (31%), Gaps = 71/384 (18%)

Query: 15   GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG--GERGPGGASNCGTPQLDTEAAAG 72
            G AGA        PA  P+ PP PI +  +  ++G  G  GP GA+  G P        G
Sbjct: 830  GDAGAKGDAGPPGPAG-PAGPPGPIGNVGAPGAKGARGSAGPPGAT--GFP--GAAGRVG 884

Query: 73   PPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSR 132
            PP            P G    P  PG AG  G               +  T      + R
Sbjct: 885  PPG-----------PSGNAGPPGPPGPAGKEGGKGP-----------RGETGP----AGR 918

Query: 133  GAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTT 192
              ++ P  P  P     +  A    GPA   G PG             A     V +P  
Sbjct: 919  PGEVGPPGPPGPAGEKGSPGA---DGPAGAPGTPG---------PQGIAGQRGVVGLPGQ 966

Query: 193  RVGSMLPGLPYHLQGSG-SGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG 251
            R     PGLP      G  GP+  +G  G            PP  SG   A G    PG 
Sbjct: 967  RGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGR 1026

Query: 252  AGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSG---GAGGVSGGGSSLAAMGGREP 308
             GS  A           PP A GA   PG    AG     G  G +G    +  +G R P
Sbjct: 1027 DGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGP 1086

Query: 309  Q-------YSSLSAARPLNGTYHHHHHHHHHHP--SPYSPYVGAPLTPAWPAGPFETPVL 359
                        +  +   G   H        P   P SP    P   + PAGP   P  
Sbjct: 1087 AGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPP-- 1144

Query: 360  HSLQSRAGAP-------LPVPRGP 376
                  AGAP       LP P GP
Sbjct: 1145 ----GSAGAPGKDGLNGLPGPIGP 1164



 Score = 55.8 bits (133), Expect = 1e-07
 Identities = 79/302 (26%), Positives = 102/302 (33%), Gaps = 30/302 (9%)

Query: 10  LPKRFGAAG-ADASDSRAFPA-REPSTPPSPISSSSSSCSRG-----GERG-PGGASNCG 61
           +P   GA G + A   R FP  R    PP P     ++ + G     G+ G PG   + G
Sbjct: 663 VPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQG 722

Query: 62  TPQLD----TEAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTD 117
            P L        AAG P          + P GA   P   GV G  G +          D
Sbjct: 723 APGLQGMPGERGAAGLPGPKGDRGD--AGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGD 780

Query: 118 LDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAA 177
             ++  +        GA+ +P    +P             GPA + G PG      A   
Sbjct: 781 KGESGPSGPA--GPTGARGAPGDRGEP----------GPPGPAGFAGPPGADGQPGAKGE 828

Query: 178 AAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGS 237
              A A      P     +  PG   ++   G+  A  + G     G+P A+    P G 
Sbjct: 829 PGDAGAKGDAGPPGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGP 888

Query: 238 GGGAAGGGAAGP----GGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGV 293
            G A   G  GP    GG G       A  P    PP   G A E G   A G  GA G 
Sbjct: 889 SGNAGPPGPPGPAGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGT 948

Query: 294 SG 295
            G
Sbjct: 949 PG 950



 Score = 54.3 bits (129), Expect = 3e-07
 Identities = 89/323 (27%), Positives = 109/323 (33%), Gaps = 45/323 (13%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCS--RG--GERG-PGGASNCGTPQLD-TE 68
           G +G D +   A PA     P SP  + +      RG  GERG PG     G    D   
Sbjct: 269 GFSGLDGAKGDAGPAGPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGAT 328

Query: 69  AAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLL 128
            AAGPP       +  + P G P    A G AGP G   S     +  +      A    
Sbjct: 329 GAAGPPG-----PTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAG--- 380

Query: 129 WSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVY 188
             + G   +P A  QP        A  + G     GAPG         A   +    P  
Sbjct: 381 --AAGPAGNPGADGQP-------GAKGANGAPGIAGAPG------FPGARGPSGPQGPGG 425

Query: 189 VPTTRVGSMLPGLPYHLQGSGS-------------GPANHAGGAGAHPGWPQASADSPPY 235
            P  +  S  PG P     +G+             GPA   G  GA  G P  +    P 
Sbjct: 426 PPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGAR-GEPGPTGLPGPP 484

Query: 236 GSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSG 295
           G  GG    G   PG  G A     A    SP P    G+  E G    AG  GA G++G
Sbjct: 485 GERGGPGSRGF--PGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTG 542

Query: 296 GGSSLAAMGGREPQYSSLSAARP 318
              S    G   P   +    RP
Sbjct: 543 SPGSPGPDGKTGPPGPAGQDGRP 565



 Score = 51.6 bits (122), Expect = 2e-06
 Identities = 98/368 (26%), Positives = 118/368 (32%), Gaps = 66/368 (17%)

Query: 15  GAAGADASDSRAFPAREPSTP--------PSPISSSSSSCSRGGERG-PGGASNCGTPQL 65
           G  G    D R  P   P           P P   ++    + GERG PG     G    
Sbjct: 554 GPPGPAGQDGRPGPPGPPGARGQAGVMGFPGP-KGAAGEPGKAGERGVPGPPGAVGPAGK 612

Query: 66  DTEAAA-GPPARSLLLSSYASH-PFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLD---- 119
           D EA A GPP  +          P G+P     PG AGP G      +  +  DL     
Sbjct: 613 DGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGP 672

Query: 120 QAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPG--------GFVH 171
             A   +     RG +  P  P  P           ++G A   GAPG        G   
Sbjct: 673 SGARGERGFPGERGVQ-GPPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPG 731

Query: 172 SAAAA-----------AAAAAAASSP----VYVPTTRVGSMLP-GLPYHLQGSG-SGPAN 214
              AA           A    A  SP    V   T  +G   P G P     SG SGPA 
Sbjct: 732 ERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGDKGESGPSGPAG 791

Query: 215 HAGGAGA-----HPGWPQASADSPPYGSGG-----------GAAG--------GGAAGPG 250
             G  GA      PG P  +  + P G+ G           GA G        G A  PG
Sbjct: 792 PTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPPG 851

Query: 251 GAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQY 310
             G+  A  +     S  PP A G     G     G  G  G  G        GG+ P+ 
Sbjct: 852 PIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRG 911

Query: 311 SSLSAARP 318
            +  A RP
Sbjct: 912 ETGPAGRP 919



 Score = 50.4 bits (119), Expect = 4e-06
 Identities = 79/303 (26%), Positives = 94/303 (31%), Gaps = 56/303 (18%)

Query: 15  GAAGADASDSRAFPAREPST-----PPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEA 69
           G  GA  S        EP       PP P        +RG E GP G    G P      
Sbjct: 434 GEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARG-EPGPTGLP--GPPG----E 486

Query: 70  AAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLW 129
             GP +R    +   + P G      +PG AGP G+             +        L 
Sbjct: 487 RGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPG-----------EAGRPGEAGLP 535

Query: 130 SSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYV 189
            ++G   SP +P    +           GPA  DG PG                  P   
Sbjct: 536 GAKGLTGSPGSPGPDGKT-------GPPGPAGQDGRPG------------------PPGP 570

Query: 190 PTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAH--PGWPQASADSPPYGSGGGAAGGGAA 247
           P  R  + + G P        G A   G AG    PG P A   +   G  G     G A
Sbjct: 571 PGARGQAGVMGFP-----GPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPA 625

Query: 248 GPGGA-GSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGR 306
           GP G  G      S  F   P P    G A +PG     G  GA G SG        G R
Sbjct: 626 GPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGER 685

Query: 307 EPQ 309
             Q
Sbjct: 686 GVQ 688



 Score = 50.1 bits (118), Expect = 6e-06
 Identities = 96/379 (25%), Positives = 113/379 (29%), Gaps = 57/379 (15%)

Query: 15  GAAGADASDSRAFP--------AREPSTPPSPISSSSSSCSRG--GERGPGGASN-CGTP 63
           GAAG       A P        A+  + P  P  S      RG  G  GP GA+   G P
Sbjct: 329 GAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGNP 388

Query: 64  QLDTEAAAGPPARSLLLSSYASHPFG-APHGPSAPG-VAGPGGNLSSWEDLLLFTDLDQA 121
             D +  A     +  ++     P    P GP  PG   GP GN             +  
Sbjct: 389 GADGQPGAKGANGAPGIAGAPGFPGARGPSGPQGPGGPPGPKGNSG-----------EPG 437

Query: 122 ATASKLLWSSRGAKLSPF-----APEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAA 176
           A  SK      GAK  P       P  P        A    GP    G PG      +  
Sbjct: 438 APGSK---GDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRG 494

Query: 177 AAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANH----AGGAGAHPGWPQASADS 232
              A   + P   P    GS  P  P    G    P       A G    PG P     +
Sbjct: 495 FPGADGVAGPKG-PAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKT 553

Query: 233 PPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGG 292
            P G  G     G  GP GA   A  +       P P  A G   + G     G  GA G
Sbjct: 554 GPPGPAGQDGRPGPPGPPGARGQAGVMGF-----PGPKGAAGEPGKAGERGVPGPPGAVG 608

Query: 293 VSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAG 352
            +G      A G   P       A P               P+    + G P  PA P G
Sbjct: 609 PAGKDGEAGAQGPPGP-------AGPAG-------ERGEQGPAGSPGFQGLP-GPAGPPG 653

Query: 353 PFETPVLHSLQSRAGAPLP 371
               P    +    GAP P
Sbjct: 654 EAGKPGEQGVPGDLGAPGP 672



 Score = 48.9 bits (115), Expect = 1e-05
 Identities = 91/346 (26%), Positives = 108/346 (31%), Gaps = 50/346 (14%)

Query: 40  SSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASHPFGAPHGPSAPGV 99
           S S +     G  GP G +    P+      AGPP R  +         G P  P  PG 
Sbjct: 99  SESPTDQETTGVEGPKGDTGPRGPR----GPAGPPGRDGIPGQPGLP--GPPGPPGPPGP 152

Query: 100 AGPGGNLSSWEDLLLFTDLDQAATASKLL------WSSRGAKLSPFAP-----EQPEEMY 148
            G GGN +      L    D+ +T    +         RG    P AP     + P    
Sbjct: 153 PGLGGNFAP----QLSYGYDEKSTGGISVPGPMGPSGPRGLPGPPGAPGPQGFQGPPGEP 208

Query: 149 QTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTR---VGSMLPGLPYHL 205
               A    GP    G PG       A          P      R     + LPG+  H 
Sbjct: 209 GEPGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQGARGLPGTAGLPGMKGHR 268

Query: 206 QGSG-------SGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAH 258
             SG       +GPA   G  G+ PG   A     P G  G     GA GP GA      
Sbjct: 269 GFSGLDGAKGDAGPAGPKGEPGS-PGENGAPGQMGPRGLPGERGRPGAPGPAGARGNDGA 327

Query: 259 VSARFPYSPSPPM-------ANGAAREPGGYAAAGSGGAGGVSG----GGSSLAAMGGRE 307
             A  P  P+ P        A GA  E G     GS G  GV G     G + AA     
Sbjct: 328 TGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAGN 387

Query: 308 PQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGP 353
           P       A+  NG             +P  P    P  P  P GP
Sbjct: 388 PGADGQPGAKGANGA-------PGIAGAPGFPGARGPSGPQGPGGP 426



 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 78/301 (25%), Positives = 97/301 (32%), Gaps = 51/301 (16%)

Query: 15   GAAGADASDSRAFPAREPSTP-PSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGP 73
            G AG   S     PA  P TP P  I+         G+RG  G      P       +G 
Sbjct: 929  GPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGP-------SGE 981

Query: 74   PARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRG 133
            P +     S AS   G P     PG+AGP G               + A  ++    S G
Sbjct: 982  PGKQG--PSGASGERGPPGPMGPPGLAGPPGESGR-----------EGAPGAE---GSPG 1025

Query: 134  AKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPG-----GFVHSAAAAAAAAAAASSPVY 188
               SP A     E           GPA   GAPG     G V  A  +        +   
Sbjct: 1026 RDGSPGAKGDRGET----------GPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPA 1075

Query: 189  VPTTRVGSMLPGLPYHLQGSGS--------GPANHAGGAGAH--PGWPQASADSPPYGSG 238
             P   VG+  P  P   +G           G   H G +G    PG P +  +  P G+ 
Sbjct: 1076 GPVGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGAS 1135

Query: 239  GGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGS 298
            G A   G   PG AG+         P    PP   G   + G     G  G  G  G  S
Sbjct: 1136 GPAGPRGP--PGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPS 1193

Query: 299  S 299
            +
Sbjct: 1194 A 1194



 Score = 43.5 bits (101), Expect = 5e-04
 Identities = 81/317 (25%), Positives = 101/317 (31%), Gaps = 45/317 (14%)

Query: 15   GAAGADASDSRAFPARE-----PSTPPSPISSSSSSCSRGGERGPGGA-SNCGTPQLDTE 68
            GA G   +  R  P        P  PP P         RG E GP G     G P     
Sbjct: 872  GATGFPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGPRG-ETGPAGRPGEVGPP----- 925

Query: 69   AAAGPPARSLLLSSY-ASHPFGAPHGPSAPGVAGPGG--NLSSWEDLLLFTDLD-QAATA 124
               GPP  +    S  A  P GAP  P   G+AG  G   L        F  L   +   
Sbjct: 926  ---GPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEP 982

Query: 125  SKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAAS 184
             K   S    +  P  P  P  +         +G    +G+PG      A         +
Sbjct: 983  GKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPA 1042

Query: 185  SPVYVPTTRVGSMLPGLPYHLQGSG----------SGPANHAGGAGAH-PGWPQASADSP 233
             P   P        PG P  +  +G          +GPA   G  GA  P  PQ      
Sbjct: 1043 GPPGAPGA------PGAPGPVGPAGKSGDRGETGPAGPAGPVGPVGARGPAGPQG----- 1091

Query: 234  PYGSGGGAAGGGAAG-PGGAGSAAAHVSARFPYSPS---PPMANGAAREPGGYAAAGSGG 289
            P G  G     G  G  G  G +        P SP    P  A+G A   G   +AG+ G
Sbjct: 1092 PRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGPRGPPGSAGAPG 1151

Query: 290  AGGVSGGGSSLAAMGGR 306
              G++G    +   G R
Sbjct: 1152 KDGLNGLPGPIGPPGPR 1168



 Score = 42.4 bits (98), Expect = 0.001
 Identities = 71/283 (25%), Positives = 87/283 (30%), Gaps = 54/283 (19%)

Query: 11   PKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAA 70
            P + G +GA          R P  P  P   +      G E  PG     G+P  D    
Sbjct: 982  PGKQGPSGASGE-------RGPPGPMGPPGLAGPPGESGREGAPGAE---GSPGRD---- 1027

Query: 71   AGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWS 130
              P A+     +  + P GAP  P APG  GP G                         S
Sbjct: 1028 GSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGK------------------------S 1063

Query: 131  SRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPG--GFVHSAAAAAAAAAAASSPVY 188
                +  P  P  P      +  + ++GPA   G  G  G                S + 
Sbjct: 1064 GDRGETGPAGPAGP------VGPVGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQ 1117

Query: 189  VPTTRVGSMLPGLPYHLQGSG-SGPANHAGGAGAH-----PGWPQASADSPPYGSGGGAA 242
             P    GS  PG       SG +GP    G AGA       G P       P G  G A 
Sbjct: 1118 GPPGPPGS--PGEQGPSGASGPAGPRGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAG 1175

Query: 243  GGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAA 285
              G  GP G        SA F +S  P      A + G Y  A
Sbjct: 1176 PVGPPGPPGPPGPPGPPSAGFDFSFLPQPPQEKAHDGGRYYRA 1218



 Score = 41.6 bits (96), Expect = 0.002
 Identities = 76/286 (26%), Positives = 92/286 (32%), Gaps = 62/286 (21%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASN----CGTPQLDTEAA 70
           G  GA        P  EP  P      +S      G  GP G +      G P    E  
Sbjct: 191 GPPGAPGPQGFQGPPGEPGEP-----GASGPMGPRGPPGPPGKNGDDGEAGKPGRPGER- 244

Query: 71  AGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWS 130
            GPP            P GA   P   G+ G  G+         F+ LD A   +     
Sbjct: 245 -GPPG-----------PQGARGLPGTAGLPGMKGHRG-------FSGLDGAKGDA----G 281

Query: 131 SRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVP 190
             G K  P +P +     Q    +  +G     G PG    + A     A  A+ P    
Sbjct: 282 PAGPKGEPGSPGENGAPGQ----MGPRGLPGERGRPGAPGPAGARGNDGATGAAGP---- 333

Query: 191 TTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAG-P 249
                   PG         +GPA   G     PG   A  ++ P G  G     G  G P
Sbjct: 334 --------PG--------PTGPAGPPG----FPGAVGAKGEAGPQGPRGSEGPQGVRGEP 373

Query: 250 GGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSG 295
           G  G A A   A  P +   P A GA   PG   A G  GA G SG
Sbjct: 374 GPPGPAGAAGPAGNPGADGQPGAKGANGAPGIAGAPGFPGARGPSG 419


>gi|21264565 AT rich interactive domain 1A isoform a [Homo sapiens]
          Length = 2285

 Score = 59.3 bits (142), Expect = 1e-08
 Identities = 105/404 (25%), Positives = 148/404 (36%), Gaps = 98/404 (24%)

Query: 15  GAAGADASDSRAFPAREP-------STPPSPISSSSSSCSRGGERGPGGASNCGTPQLDT 67
           G  G   + S   P  EP       +  P P  +++ +   GG  G G +   G P    
Sbjct: 81  GGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEPPGGGGG-GSSDGVGAPPHSA 139

Query: 68  EAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKL 127
            AA  PPA          + FG P+G S   VA     +   +         Q +     
Sbjct: 140 AAALPPPA----------YGFGQPYGRSPSAVAAAAAAVFHQQ------HGGQQSPGLAA 183

Query: 128 LWSSRGAKLSPFA-PEQ-------PEEMYQTL----AALSSQGPA-AYDGAPGGFVHSAA 174
           L S  G  L P+A P+Q       P   Y +     +A     PA A     GG   S A
Sbjct: 184 LQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPGSGA 243

Query: 175 AAAA--------AAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWP 226
           AAAA        +A+A+SS       R G+M           G GP+   GG       P
Sbjct: 244 AAAAGSKPPPSSSASASSSSSSFAQQRFGAM----------GGGGPSAAGGGTPQPTATP 293

Query: 227 Q----ASADSPPYGSGGGAAGGGAAGP--GGAGSAAAHVSAR----FPYSPSPPMANGAA 276
                 ++ S   G  G   G  + GP  GGAG   A ++++       + +   A+G A
Sbjct: 294 TLNQLLTSPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGA 353

Query: 277 REPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSS------LSAARPLNGTYHHHHHHH 330
           ++   +A    G +G   GGG  LA    R PQ SS          +P  GT        
Sbjct: 354 QQRSHHAPMSPGSSG---GGGQPLA----RTPQPSSPMDQMGKMRPQPYGGT-------- 398

Query: 331 HHHPSPYSPYVGAPLTP----AWPAGPF--ETPVLH--SLQSRA 366
               +PYS   G P  P     +P  P+  +TP  +  ++Q RA
Sbjct: 399 ----NPYSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRA 438



 Score = 58.2 bits (139), Expect = 2e-08
 Identities = 59/199 (29%), Positives = 79/199 (39%), Gaps = 30/199 (15%)

Query: 167 GGFVHSAAAA------AAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAG 220
           GG   +AAAA      AAA   +  P   P   +G  L        G G G A   GG G
Sbjct: 36  GGEAAAAAAAERGEMKAAAGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPG 95

Query: 221 AHPGWPQASADS------------PPYGSGGGAAGGGAAGPGGAGSAAAHVSARF--PYS 266
           A P    ++ ++            PP G GGG++ G  A P  A +A    +  F  PY 
Sbjct: 96  AEPDLKNSNGNAGPRPALNNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYG 155

Query: 267 PSPPMANGAA-----REPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNG 321
            SP     AA     ++ GG  + G   A   SGGG  L    G  PQ +S     P N 
Sbjct: 156 RSPSAVAAAAAAVFHQQHGGQQSPGL--AALQSGGGGGLEPYAG--PQQNSHDHGFP-NH 210

Query: 322 TYHHHHHHHHHHPSPYSPY 340
            Y+ ++ +   +P P   Y
Sbjct: 211 QYNSYYPNRSAYPPPAPAY 229



 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 50/186 (26%), Positives = 64/186 (34%), Gaps = 34/186 (18%)

Query: 175 AAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGG-----AGAHPGWPQAS 229
           AA  A AAASS        +G+  P  P  L+ +       AGG     A A  G  +A+
Sbjct: 2   AAQVAPAAASS--------LGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAA 53

Query: 230 A----DSPPYG--------------SGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPM 271
           A    + P  G              S GG  GGGA   GG G+     ++     P P +
Sbjct: 54  AGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPAL 113

Query: 272 ANGAAREPGGYAAAGSGGAGGVSGGGSSL---AAMGGREPQYSSLSAARPLNGTYHHHHH 328
            N     PGG     S G G      ++     A G  +P   S SA         H  H
Sbjct: 114 NNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQH 173

Query: 329 HHHHHP 334
                P
Sbjct: 174 GGQQSP 179



 Score = 44.7 bits (104), Expect = 2e-04
 Identities = 83/301 (27%), Positives = 111/301 (36%), Gaps = 54/301 (17%)

Query: 27  FPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASH 86
           +P R    PP+P  + SS   RGG  G G A+  G+    + +A+     S   SS+A  
Sbjct: 216 YPNRSAYPPPAPAYALSSP--RGGTPGSGAAAAAGSKPPPSSSASA----SSSSSSFAQQ 269

Query: 87  PFGA--PHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQ- 143
            FGA    GPSA G   P    +          L+Q  T+     S+RG +  P      
Sbjct: 270 RFGAMGGGGPSAAGGGTPQPTAT--------PTLNQLLTSPS---SARGYQGYPGGDYSG 318

Query: 144 -PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPV-----YVPTTRVGSM 197
            P++        + +GPA       G    AAAAAAAAAAAS        + P +   S 
Sbjct: 319 GPQD------GGAGKGPADMASQCWG----AAAAAAAAAAASGGAQQRSHHAPMSPGSSG 368

Query: 198 LPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSA-A 256
             G P       S P +  G        PQ    + PY    G   G   G G  G    
Sbjct: 369 GGGQPLARTPQPSSPMDQMGKMR-----PQPYGGTNPYSQQQGPPSGPQQGHGYPGQPYG 423

Query: 257 AHVSARFPYSPSPPMANGAAREPGGYAAAGS---GGAGGVSGGGSSLAAMGGREPQYSSL 313
           +    R+P +    M   A    GG +        G  G SG G       G+ P Y+  
Sbjct: 424 SQTPQRYPMT----MQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQ-----GQTPYYNQQ 474

Query: 314 S 314
           S
Sbjct: 475 S 475



 Score = 37.4 bits (85), Expect = 0.039
 Identities = 72/296 (24%), Positives = 97/296 (32%), Gaps = 29/296 (9%)

Query: 73  PPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLS------SWEDLLLFTDLDQAATASK 126
           PP + L   S+ S    AP   S+ G      NLS      S  DL    D     T   
Sbjct: 598 PPPQELSQDSFGSQASSAPSMTSSKG-GQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGA 656

Query: 127 LLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSP 186
           L   S G   S  +  Q E+     +  S        G  G       + A+ A + S P
Sbjct: 657 L---SPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGP 713

Query: 187 VYVPTTRVGSMLPGLP--------YHLQGSGSGPANHAGGAGAHPGWPQ------ASADS 232
           +  P    G+ +P  P         H   + S  A   G    +P  PQ       SA S
Sbjct: 714 L-SPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALS 772

Query: 233 PPYGSGGGA-AGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAG 291
           P   SGG    G G+      GS         P    P   N  A     Y +AG  G  
Sbjct: 773 PRQPSGGQIHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGI 832

Query: 292 GVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTP 347
              G G  +    G  P Y +L   R  + +  +  +  +    P  P VG+ + P
Sbjct: 833 NPMGAGGQMHGQPG-IPPYGTLPPGRMSHASMGNRPYGPNMANMP--PQVGSGMCP 885


>gi|21264575 AT rich interactive domain 1A isoform b [Homo sapiens]
          Length = 2068

 Score = 59.3 bits (142), Expect = 1e-08
 Identities = 105/404 (25%), Positives = 148/404 (36%), Gaps = 98/404 (24%)

Query: 15  GAAGADASDSRAFPAREP-------STPPSPISSSSSSCSRGGERGPGGASNCGTPQLDT 67
           G  G   + S   P  EP       +  P P  +++ +   GG  G G +   G P    
Sbjct: 81  GGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEPPGGGGG-GSSDGVGAPPHSA 139

Query: 68  EAAAGPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKL 127
            AA  PPA          + FG P+G S   VA     +   +         Q +     
Sbjct: 140 AAALPPPA----------YGFGQPYGRSPSAVAAAAAAVFHQQ------HGGQQSPGLAA 183

Query: 128 LWSSRGAKLSPFA-PEQ-------PEEMYQTL----AALSSQGPA-AYDGAPGGFVHSAA 174
           L S  G  L P+A P+Q       P   Y +     +A     PA A     GG   S A
Sbjct: 184 LQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPGSGA 243

Query: 175 AAAA--------AAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWP 226
           AAAA        +A+A+SS       R G+M           G GP+   GG       P
Sbjct: 244 AAAAGSKPPPSSSASASSSSSSFAQQRFGAM----------GGGGPSAAGGGTPQPTATP 293

Query: 227 Q----ASADSPPYGSGGGAAGGGAAGP--GGAGSAAAHVSAR----FPYSPSPPMANGAA 276
                 ++ S   G  G   G  + GP  GGAG   A ++++       + +   A+G A
Sbjct: 294 TLNQLLTSPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGA 353

Query: 277 REPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSS------LSAARPLNGTYHHHHHHH 330
           ++   +A    G +G   GGG  LA    R PQ SS          +P  GT        
Sbjct: 354 QQRSHHAPMSPGSSG---GGGQPLA----RTPQPSSPMDQMGKMRPQPYGGT-------- 398

Query: 331 HHHPSPYSPYVGAPLTP----AWPAGPF--ETPVLH--SLQSRA 366
               +PYS   G P  P     +P  P+  +TP  +  ++Q RA
Sbjct: 399 ----NPYSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRA 438



 Score = 58.2 bits (139), Expect = 2e-08
 Identities = 59/199 (29%), Positives = 79/199 (39%), Gaps = 30/199 (15%)

Query: 167 GGFVHSAAAA------AAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAG 220
           GG   +AAAA      AAA   +  P   P   +G  L        G G G A   GG G
Sbjct: 36  GGEAAAAAAAERGEMKAAAGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPG 95

Query: 221 AHPGWPQASADS------------PPYGSGGGAAGGGAAGPGGAGSAAAHVSARF--PYS 266
           A P    ++ ++            PP G GGG++ G  A P  A +A    +  F  PY 
Sbjct: 96  AEPDLKNSNGNAGPRPALNNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYG 155

Query: 267 PSPPMANGAA-----REPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNG 321
            SP     AA     ++ GG  + G   A   SGGG  L    G  PQ +S     P N 
Sbjct: 156 RSPSAVAAAAAAVFHQQHGGQQSPGL--AALQSGGGGGLEPYAG--PQQNSHDHGFP-NH 210

Query: 322 TYHHHHHHHHHHPSPYSPY 340
            Y+ ++ +   +P P   Y
Sbjct: 211 QYNSYYPNRSAYPPPAPAY 229



 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 50/186 (26%), Positives = 64/186 (34%), Gaps = 34/186 (18%)

Query: 175 AAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGG-----AGAHPGWPQAS 229
           AA  A AAASS        +G+  P  P  L+ +       AGG     A A  G  +A+
Sbjct: 2   AAQVAPAAASS--------LGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAA 53

Query: 230 A----DSPPYG--------------SGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPM 271
           A    + P  G              S GG  GGGA   GG G+     ++     P P +
Sbjct: 54  AGQESEGPAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPAL 113

Query: 272 ANGAAREPGGYAAAGSGGAGGVSGGGSSL---AAMGGREPQYSSLSAARPLNGTYHHHHH 328
            N     PGG     S G G      ++     A G  +P   S SA         H  H
Sbjct: 114 NNNLTEPPGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQH 173

Query: 329 HHHHHP 334
                P
Sbjct: 174 GGQQSP 179



 Score = 44.7 bits (104), Expect = 2e-04
 Identities = 83/301 (27%), Positives = 111/301 (36%), Gaps = 54/301 (17%)

Query: 27  FPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAAGPPARSLLLSSYASH 86
           +P R    PP+P  + SS   RGG  G G A+  G+    + +A+     S   SS+A  
Sbjct: 216 YPNRSAYPPPAPAYALSSP--RGGTPGSGAAAAAGSKPPPSSSASA----SSSSSSFAQQ 269

Query: 87  PFGA--PHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQ- 143
            FGA    GPSA G   P    +          L+Q  T+     S+RG +  P      
Sbjct: 270 RFGAMGGGGPSAAGGGTPQPTAT--------PTLNQLLTSPS---SARGYQGYPGGDYSG 318

Query: 144 -PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPV-----YVPTTRVGSM 197
            P++        + +GPA       G    AAAAAAAAAAAS        + P +   S 
Sbjct: 319 GPQD------GGAGKGPADMASQCWG----AAAAAAAAAAASGGAQQRSHHAPMSPGSSG 368

Query: 198 LPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSA-A 256
             G P       S P +  G        PQ    + PY    G   G   G G  G    
Sbjct: 369 GGGQPLARTPQPSSPMDQMGKMR-----PQPYGGTNPYSQQQGPPSGPQQGHGYPGQPYG 423

Query: 257 AHVSARFPYSPSPPMANGAAREPGGYAAAGS---GGAGGVSGGGSSLAAMGGREPQYSSL 313
           +    R+P +    M   A    GG +        G  G SG G       G+ P Y+  
Sbjct: 424 SQTPQRYPMT----MQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQ-----GQTPYYNQQ 474

Query: 314 S 314
           S
Sbjct: 475 S 475



 Score = 37.4 bits (85), Expect = 0.039
 Identities = 72/296 (24%), Positives = 97/296 (32%), Gaps = 29/296 (9%)

Query: 73  PPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLS------SWEDLLLFTDLDQAATASK 126
           PP + L   S+ S    AP   S+ G      NLS      S  DL    D     T   
Sbjct: 598 PPPQELSQDSFGSQASSAPSMTSSKG-GQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGA 656

Query: 127 LLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSP 186
           L   S G   S  +  Q E+     +  S        G  G       + A+ A + S P
Sbjct: 657 L---SPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGP 713

Query: 187 VYVPTTRVGSMLPGLP--------YHLQGSGSGPANHAGGAGAHPGWPQ------ASADS 232
           +  P    G+ +P  P         H   + S  A   G    +P  PQ       SA S
Sbjct: 714 L-SPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALS 772

Query: 233 PPYGSGGGA-AGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAG 291
           P   SGG    G G+      GS         P    P   N  A     Y +AG  G  
Sbjct: 773 PRQPSGGQIHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGI 832

Query: 292 GVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTP 347
              G G  +    G  P Y +L   R  + +  +  +  +    P  P VG+ + P
Sbjct: 833 NPMGAGGQMHGQPG-IPPYGTLPPGRMSHASMGNRPYGPNMANMP--PQVGSGMCP 885



 Score = 32.7 bits (73), Expect = 0.97
 Identities = 42/190 (22%), Positives = 59/190 (31%), Gaps = 21/190 (11%)

Query: 185  SPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGG 244
            +P Y P+     M+  + Y       G    A G+      P  S+   P G G G    
Sbjct: 1208 NPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSD-----PFMSSGQGPNG-GMGDPYS 1261

Query: 245  GAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMG 304
             AAGPG  G+ A      +PY              G Y    +    G  G  S+ A   
Sbjct: 1262 RAAGPG-LGNVAMGPRQHYPYG-------------GPYDRVRTEPGIGPEGNMSTGAPQP 1307

Query: 305  GREPQYSSLSAARPLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGPFETPVLHSLQS 364
               P         P              H S Y        TP+    P +   ++  Q 
Sbjct: 1308 NLMPSNPDSGMYSPSRYPPQQQQQQQQRHDS-YGNQFSTQGTPSGSPFPSQQTTMYQQQQ 1366

Query: 365  RAGAPLPVPR 374
            +  +P P+PR
Sbjct: 1367 QVSSPAPLPR 1376


>gi|5031757 T-cell leukemia homeobox 1 [Homo sapiens]
          Length = 330

 Score = 58.2 bits (139), Expect = 2e-08
 Identities = 52/156 (33%), Positives = 62/156 (39%), Gaps = 41/156 (26%)

Query: 198 LPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAA 257
           L G  Y   G GS  A  AGGAGA             YG+GG    GG  GP G G A +
Sbjct: 50  LVGGAYTYGGGGSAAATGAGGAGA-------------YGTGGP---GGPGGPAGGGGACS 93

Query: 258 HVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAAR 317
                  Y+ +  +A G           G GG GG SGG  +L+A G        + A R
Sbjct: 94  MGPLTGSYNVNMALAGG----------PGPGGGGGSSGGAGALSAAG-----VIRVPAHR 138

Query: 318 PLNGTYHHHHHHHHHHPSPYSPYVGAPLTPAWPAGP 353
           PL G           HP P +   G P  P+ PA P
Sbjct: 139 PLAGAV--------AHPQPLA--TGLPTVPSVPAMP 164



 Score = 37.7 bits (86), Expect = 0.030
 Identities = 37/137 (27%), Positives = 47/137 (34%), Gaps = 25/137 (18%)

Query: 167 GGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWP 226
           G + +    +AAA  A  +  Y      G+  PG          GP   AGG GA    P
Sbjct: 53  GAYTYGGGGSAAATGAGGAGAY------GTGGPG----------GPGGPAGGGGACSMGP 96

Query: 227 --------QASADSP-PYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAR 277
                    A A  P P G GG + G GA    G     AH       +   P+A G   
Sbjct: 97  LTGSYNVNMALAGGPGPGGGGGSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPT 156

Query: 278 EPGGYAAAGSGGAGGVS 294
            P   A  G     G++
Sbjct: 157 VPSVPAMPGVNNLTGLT 173


>gi|169636435 caudal type homeobox 2 [Homo sapiens]
          Length = 313

 Score = 57.8 bits (138), Expect = 3e-08
 Identities = 54/146 (36%), Positives = 66/146 (45%), Gaps = 28/146 (19%)

Query: 215 HAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANG 274
           H+GG    P   Q     P Y   GG     AA      +AAA++ +     PS P A G
Sbjct: 20  HSGGLNLAP---QNFVSPPQYPDYGGYHVAAAA------AAAANLDSAQSPGPSWPAAYG 70

Query: 275 AA-RE------PGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHH 327
           A  RE      PGG AAA +  A G++GG S  AAMG     YSS +   P     HHH 
Sbjct: 71  APLREDWNGYAPGGAAAAANAVAHGLNGG-SPAAAMG-----YSSPADYHP-----HHHP 119

Query: 328 HHHHHHPSPYSPYVGAPLTPAWPAGP 353
           HHH HHP+  +P   + L      GP
Sbjct: 120 HHHPHHPAA-APSCASGLLQTLNPGP 144


>gi|73427806 v-maf musculoaponeurotic fibrosarcoma oncogene homolog
           isoform b [Homo sapiens]
          Length = 373

 Score = 57.8 bits (138), Expect = 3e-08
 Identities = 67/255 (26%), Positives = 91/255 (35%), Gaps = 46/255 (18%)

Query: 65  LDTEAAAGPPARSLLLSSYASHPFGAPHG--PSAPGVAGPGGNLSSWEDLLLFTDLDQAA 122
           ++T+       R +   S +S P   P    P +P  + P     S          +Q A
Sbjct: 37  VETDRIISQCGRLIAGGSLSSTPMSTPCSSVPPSPSFSAPSPGSGS----------EQKA 86

Query: 123 TASKLLW-SSRGAKLSPFAPE-QPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA 180
                 W +    +L+P A    PE+  + L + S Q    +DG   G    AAAA A A
Sbjct: 87  HLEDYYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGA 146

Query: 181 AAA---SSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGS 237
            A+   S     P   V S +        G+G    +H   A  H   P A A       
Sbjct: 147 GASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAP------ 200

Query: 238 GGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGG 297
             GAAG  AA  GGAG A                        G  +A G GG GG  GGG
Sbjct: 201 --GAAGSAAASAGGAGGAGGG---------------------GPASAGGGGGGGGGGGGG 237

Query: 298 SSLAAMGGREPQYSS 312
            +  A G   P +++
Sbjct: 238 GAAGAGGALHPHHAA 252



 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 283 AAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHH----HHHHPSPYS 338
           AAAG+G    + G G  +           + +AA+   G ++HHHHH    HHHHP+  +
Sbjct: 140 AAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGA 199

Query: 339 PYVGAPLTPAWPAG 352
           P  GA  + A  AG
Sbjct: 200 P--GAAGSAAASAG 211



 Score = 42.4 bits (98), Expect = 0.001
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 273 NGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHH 332
           +G AR     AAA   GAG   GG      MG      S++ AA         H+HHHHH
Sbjct: 129 DGYARGAQQLAAAAGAGAGASLGGSGE--EMGPAAAVVSAVIAAAAAQSGAGPHYHHHHH 186

Query: 333 HPS--PYSPYVGAP 344
           H +   + P  GAP
Sbjct: 187 HAAGHHHHPTAGAP 200



 Score = 40.0 bits (92), Expect = 0.006
 Identities = 43/129 (33%), Positives = 50/129 (38%), Gaps = 17/129 (13%)

Query: 149 QTLAALSSQGPAAYDGA------PGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLP 202
           Q LAA +  G  A  G       P   V SA  AAAAA + + P Y       +   G  
Sbjct: 136 QQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAA---GHH 192

Query: 203 YH----LQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGG----AAGGGAAGPGGAGS 254
           +H      G+    A  AGGAG   G   ASA     G GGG    AAG G A      +
Sbjct: 193 HHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAA 252

Query: 255 AAAHVSARF 263
              H   RF
Sbjct: 253 GGLHFDDRF 261


>gi|5453736 v-maf musculoaponeurotic fibrosarcoma oncogene homolog
           isoform a [Homo sapiens]
          Length = 403

 Score = 57.8 bits (138), Expect = 3e-08
 Identities = 67/255 (26%), Positives = 91/255 (35%), Gaps = 46/255 (18%)

Query: 65  LDTEAAAGPPARSLLLSSYASHPFGAPHG--PSAPGVAGPGGNLSSWEDLLLFTDLDQAA 122
           ++T+       R +   S +S P   P    P +P  + P     S          +Q A
Sbjct: 37  VETDRIISQCGRLIAGGSLSSTPMSTPCSSVPPSPSFSAPSPGSGS----------EQKA 86

Query: 123 TASKLLW-SSRGAKLSPFAPE-QPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAA 180
                 W +    +L+P A    PE+  + L + S Q    +DG   G    AAAA A A
Sbjct: 87  HLEDYYWMTGYPQQLNPEALGFSPEDAVEALISNSHQLQGGFDGYARGAQQLAAAAGAGA 146

Query: 181 AAA---SSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSPPYGS 237
            A+   S     P   V S +        G+G    +H   A  H   P A A       
Sbjct: 147 GASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGAP------ 200

Query: 238 GGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGG 297
             GAAG  AA  GGAG A                        G  +A G GG GG  GGG
Sbjct: 201 --GAAGSAAASAGGAGGAGGG---------------------GPASAGGGGGGGGGGGGG 237

Query: 298 SSLAAMGGREPQYSS 312
            +  A G   P +++
Sbjct: 238 GAAGAGGALHPHHAA 252



 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 283 AAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHH----HHHHPSPYS 338
           AAAG+G    + G G  +           + +AA+   G ++HHHHH    HHHHP+  +
Sbjct: 140 AAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAAGHHHHPTAGA 199

Query: 339 PYVGAPLTPAWPAG 352
           P  GA  + A  AG
Sbjct: 200 P--GAAGSAAASAG 211



 Score = 42.4 bits (98), Expect = 0.001
 Identities = 27/74 (36%), Positives = 33/74 (44%), Gaps = 4/74 (5%)

Query: 273 NGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARPLNGTYHHHHHHHHH 332
           +G AR     AAA   GAG   GG      MG      S++ AA         H+HHHHH
Sbjct: 129 DGYARGAQQLAAAAGAGAGASLGGSGE--EMGPAAAVVSAVIAAAAAQSGAGPHYHHHHH 186

Query: 333 HPS--PYSPYVGAP 344
           H +   + P  GAP
Sbjct: 187 HAAGHHHHPTAGAP 200



 Score = 40.0 bits (92), Expect = 0.006
 Identities = 43/129 (33%), Positives = 50/129 (38%), Gaps = 17/129 (13%)

Query: 149 QTLAALSSQGPAAYDGA------PGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLP 202
           Q LAA +  G  A  G       P   V SA  AAAAA + + P Y       +   G  
Sbjct: 136 QQLAAAAGAGAGASLGGSGEEMGPAAAVVSAVIAAAAAQSGAGPHYHHHHHHAA---GHH 192

Query: 203 YH----LQGSGSGPANHAGGAGAHPGWPQASADSPPYGSGGG----AAGGGAAGPGGAGS 254
           +H      G+    A  AGGAG   G   ASA     G GGG    AAG G A      +
Sbjct: 193 HHPTAGAPGAAGSAAASAGGAGGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAA 252

Query: 255 AAAHVSARF 263
              H   RF
Sbjct: 253 GGLHFDDRF 261


>gi|4502951 collagen type III alpha 1 preproprotein [Homo sapiens]
          Length = 1466

 Score = 57.4 bits (137), Expect = 4e-08
 Identities = 85/315 (26%), Positives = 106/315 (33%), Gaps = 44/315 (13%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCSRG--GERGPGGASNC-------GTPQL 65
           GAAGA  +D       +P  PP P  ++    S G  GE GP G+          G P  
Sbjct: 315 GAAGARGNDGARGSDGQPG-PPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRGEPGP 373

Query: 66  DTEAAA-GPPARSLLLSSYASHPFGAPHG-PSAPGVAG----PGGNLSSWEDLLLFTDLD 119
              A A GPP    +  S        P G P APG+ G    PG   ++    L     +
Sbjct: 374 QGHAGAQGPPGPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPAGANGAPGLRGGAGE 433

Query: 120 QAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAA 179
                +K     RG +     P  P           ++G    DG+PG     A     A
Sbjct: 434 PGKNGAKGEPGPRGERGEAGIPGVP----------GAKGEDGKDGSPGE--PGANGLPGA 481

Query: 180 AAAASSPVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGAHPGWPQASADSP-----P 234
           A    +P        G   P  P  + G   GPA   G  G  P  P+ +A  P     P
Sbjct: 482 AGERGAP--------GFRGPAGPNGIPGE-KGPAGERGAPG--PAGPRGAAGEPGRDGVP 530

Query: 235 YGSGGGAAGGGAAGPGGAGSAAAHVSARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVS 294
            G G     G   GPG  G      S      P PP  +G   +PG     G  G  G  
Sbjct: 531 GGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAP 590

Query: 295 GGGSSLAAMGGREPQ 309
           G        GG  PQ
Sbjct: 591 GKNGERGGPGGPGPQ 605



 Score = 54.3 bits (129), Expect = 3e-07
 Identities = 101/380 (26%), Positives = 126/380 (33%), Gaps = 79/380 (20%)

Query: 32   PSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLD----TEAAAGPPARSLLLSSYASH- 86
            P+ PP P         RG   GPG A   G   L     +    GPP  S          
Sbjct: 850  PAGPPGPQGVKGE---RGSPGGPGAAGFPGARGLPGPPGSNGNPGPPGPSGSPGKDGPPG 906

Query: 87   PFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEE 146
            P G    P +PGV+GP G+                           G K SP        
Sbjct: 907  PAGNTGAPGSPGVSGPKGDA-----------------------GQPGEKGSP-------- 935

Query: 147  MYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQ 206
                     +QGP    GAPG       A    A   + P  +P  R GS  PG P  ++
Sbjct: 936  --------GAQGP---PGAPGPL---GIAGITGARGLAGPPGMPGPR-GS--PG-PQGVK 977

Query: 207  G-SGSGPANHAGGAGAHPGWPQ------ASADSPPYGSGGGAAG--GGAAGPGGAGSAAA 257
            G SG   AN   G    PG PQ       +A  P      G+ G  G    PGG G    
Sbjct: 978  GESGKPGANGLSGERGPPG-PQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGKGDRGE 1036

Query: 258  HVSARFPYS---PSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLS 314
            + S   P +   P PP   G A + G    +G  G  G  G   S  A G + P+     
Sbjct: 1037 NGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPAGAPGPAGSRGAPGPQGPRGDKGE 1096

Query: 315  AARPLNGTYHHHHHHHHHHPSPYSP-------YVGAPLTPAWPAGPFETPVLHSLQSRAG 367
                       H     +  +P SP        +G+P  PA P GP            +G
Sbjct: 1097 TGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSP-GPAGPRGPVGPSGPPGKDGTSG 1155

Query: 368  APLPV-PRGPSADLLEDLSE 386
             P P+ P GP  +  E  SE
Sbjct: 1156 HPGPIGPPGPRGNRGERGSE 1175



 Score = 53.5 bits (127), Expect = 5e-07
 Identities = 82/312 (26%), Positives = 105/312 (33%), Gaps = 64/312 (20%)

Query: 15  GAAGADASDSRAFP--AREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEAAA- 71
           G+ G   SD +  P  ++  S  P P   S      G    PG   N G P  + E    
Sbjct: 540 GSPGGPGSDGKPGPPGSQGESGRPGPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGP 599

Query: 72  ------GPPARSLLLSSYASHPFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATAS 125
                 GPP ++             P GP  PG  GPGG              D+  T  
Sbjct: 600 GGPGPQGPPGKN---------GETGPQGP--PGPTGPGG--------------DKGDTG- 633

Query: 126 KLLWSSRGAKLSPFAPEQPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASS 185
                          P  P+ +        + GP   +G PG       A A  A     
Sbjct: 634 ---------------PPGPQGLQ---GLPGTGGPPGENGKPGEPGPKGDAGAPGAPGGKG 675

Query: 186 PVYVPTTRVGSMLPGLPYHLQGSGSGPANHAGGAGA--HPGWPQASADSPPYGSGGGAAG 243
               P  R    L G P  L+G G+GP    GG GA   PG P A+      G  G   G
Sbjct: 676 DAGAPGERGPPGLAGAP-GLRG-GAGPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGG 733

Query: 244 GGAAGPGG-----AGSAAAHVSAR-FPYSPSPPMA-NGAAREPGGYAAAGSGGAGGVSGG 296
            G+ GP G      G  A  V  +  P  P+ P+   G A +PG     G+ G  G++G 
Sbjct: 734 LGSPGPKGDKGEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDKGEGGAPGLPGIAGP 793

Query: 297 GSSLAAMGGREP 308
             S    G   P
Sbjct: 794 RGSPGERGETGP 805



 Score = 51.2 bits (121), Expect = 3e-06
 Identities = 89/358 (24%), Positives = 108/358 (30%), Gaps = 66/358 (18%)

Query: 10  LPKRFGAAGADASDSRAFPAREPSTPPSPISSSSSSCSRGGERGPGGASNCGTPQLDTEA 69
           LP   G  G +       P  +   P +P           GERGP G +  G P L   A
Sbjct: 643 LPGTGGPPGENGKPGEPGPKGDAGAPGAP--GGKGDAGAPGERGPPGLA--GAPGLRGGA 698

Query: 70  ----------AAGPPA----------------RSLLLSSYASHPFGAPHGPSAPGVAGPG 103
                     AAGPP                 R  L S       G P GP A GV G  
Sbjct: 699 GPPGPEGGKGAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDKGEPGGPGADGVPGKD 758

Query: 104 GNLSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPE------QPEEMYQTLAALSSQ 157
           G       +       Q           +G   +P  P        P E  +T       
Sbjct: 759 GPRGPTGPIGPPGPAGQPG--------DKGEGGAPGLPGIAGPRGSPGERGET----GPP 806

Query: 158 GPAAYDGAPG-----GFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGS---- 208
           GPA + GAPG     G      A           V  P    G   P  P  ++G     
Sbjct: 807 GPAGFPGAPGQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPAGPPGPQGVKGERGSP 866

Query: 209 -GSGPANHAGGAGAHPGWPQASADSPPYGSGGGAAGGGAAGPGG-------AGSAAAHVS 260
            G G A   G  G  PG P ++ +  P G  G     G  GP G        G +     
Sbjct: 867 GGPGAAGFPGARGL-PGPPGSNGNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGD 925

Query: 261 ARFPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMGGREPQYSSLSAARP 318
           A  P     P A G    PG    AG  GA G++G        G   PQ     + +P
Sbjct: 926 AGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQGVKGESGKP 983



 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 73/305 (23%), Positives = 87/305 (28%), Gaps = 60/305 (19%)

Query: 28  PAREPSTPPSPISSSSSSCSRGGERG----PGGASNCGTPQLDTEAAAGPPARSLLLSSY 83
           P  +P  P  P         R G+ G    PG   + G P +      GP   S    SY
Sbjct: 107 PKGDPGPPGIP--------GRNGDPGIPGQPGSPGSPGPPGICESCPTGPQNYSPQYDSY 158

Query: 84  ASH-------------PFGAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAATASKLLWS 130
                           P G P  P  PG +G  G+  S                      
Sbjct: 159 DVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGS---------------------- 196

Query: 131 SRGAKLSPFAPEQ--PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVY 188
             G +  P  P Q  P        A+   GPA  DG  G                  P  
Sbjct: 197 -PGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAG 255

Query: 189 VPTTRVGSMLPGLPYHLQGSG-SGPANHAGGAG--AHPGWPQASADSPPYGSGGGAAGGG 245
           +P        PG+  H    G +G     G  G     G P  +    P G  G     G
Sbjct: 256 IPG------FPGMKGHRGFDGRNGEKGETGAPGLKGENGLPGENGAPGPMGPRGAPGERG 309

Query: 246 AAGPGGAGSAAAHVSAR-FPYSPSPPMANGAAREPGGYAAAGSGGAGGVSGGGSSLAAMG 304
             G  GA  A  +  AR     P PP   G A  PG   A G  G  G  G   +    G
Sbjct: 310 RPGLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGAPGQRG 369

Query: 305 GREPQ 309
              PQ
Sbjct: 370 EPGPQ 374



 Score = 47.4 bits (111), Expect = 4e-05
 Identities = 80/320 (25%), Positives = 96/320 (30%), Gaps = 67/320 (20%)

Query: 15  GAAGADASDSRAFPAREPSTPPSPISSSSSSCS--RG--------GERGPGGASNCGTPQ 64
           G  GA   D +     EP     P ++        RG        GE+GP G      P 
Sbjct: 456 GVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFRGPAGPNGIPGEKGPAGERGAPGPA 515

Query: 65  LDTEAAAGPPARSLLLSSYASHPF-GAPHGPSAPGVAGPGGNLSSWEDLLLFTDLDQAAT 123
                AAG P R  +          G+P GP + G  GP G+                  
Sbjct: 516 -GPRGAAGEPGRDGVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGE--------------- 559

Query: 124 ASKLLWSSRGAKLSPFAPE-QPEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAA 182
                 S R     P  P  QP         +   GP   DGAPG               
Sbjct: 560 ------SGRPGPPGPSGPRGQP-------GVMGFPGPKGNDGAPGKNGERGGPGGPGPQG 606

Query: 183 ASSPVYVPTTRVGSMLPGLPYHLQGSG-----SGPANHAGGAGAHPGWPQASADSPPYGS 237
                  P  + G   P  P    G G     +GP    G  G  PG      ++   G 
Sbjct: 607 -------PPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGLQGL-PGTGGPPGENGKPGE 658

Query: 238 GGGAAGGGAAG-PGGAGSAAAHVSARFP------------YSPSPPMANGAAREPGGYAA 284
            G     GA G PGG G A A      P              P P    GAA  PG   A
Sbjct: 659 PGPKGDAGAPGAPGGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGKGAAGPPGPPGA 718

Query: 285 AGSGGAGGVSGGGSSLAAMG 304
           AG+ G  G+ G    L + G
Sbjct: 719 AGTPGLQGMPGERGGLGSPG 738



 Score = 46.2 bits (108), Expect = 8e-05
 Identities = 86/333 (25%), Positives = 103/333 (30%), Gaps = 41/333 (12%)

Query: 15   GAAGADASDSRAFPAREPSTPPSP-ISSSSSSCSRGGERG-PGGASNCGTPQ-------L 65
            G +G+   D    PA     P SP +S       + GE+G PG     G P         
Sbjct: 894  GPSGSPGKDGPPGPAGNTGAPGSPGVSGPKGDAGQPGEKGSPGAQGPPGAPGPLGIAGIT 953

Query: 66   DTEAAAGPPARSLLLSSYA----------------SHPFGAPHGPSAPGVAG----PGGN 105
                 AGPP       S                  S   G P     PG+AG    PG +
Sbjct: 954  GARGLAGPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPPGPQGLPGLAGTAGEPGRD 1013

Query: 106  LSSWEDLLLFTDLDQAATASKLLWSSRGAKLSPFAPEQPEEMYQTLAA--LSSQGPAAYD 163
             +   D L   D        +    S GA  +P  P  P  +     +      GPA   
Sbjct: 1014 GNPGSDGLPGRDGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGESGPAGPA 1073

Query: 164  GAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPYHLQGSGS-GPANHAGGAGAH 222
            GAPG      A                         G P +    GS GPA   G  G+ 
Sbjct: 1074 GAPGPAGSRGAPGPQGPRGDKGETGERGAAGIKGHRGFPGNPGAPGSPGPAGQQGAIGSP 1133

Query: 223  -PGWPQASAD-SPPYGSGGGAAGGGAAGP-GGAGSAAAHVSARFPYSPSPPMANGAAREP 279
             P  P+     S P G  G +   G  GP G  G+     S   P  P  P   G    P
Sbjct: 1134 GPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSEGSPGHPGQPGPPGPPGAP 1193

Query: 280  ----GGYAAAGSGGAGGVSGGGSSLAAMGGREP 308
                GG  AA   G GG   GG   A   G EP
Sbjct: 1194 GPCCGGVGAAAIAGIGGEKAGG--FAPYYGDEP 1224



 Score = 36.2 bits (82), Expect = 0.088
 Identities = 52/182 (28%), Positives = 60/182 (32%), Gaps = 29/182 (15%)

Query: 144 PEEMYQTLAALSSQGPAAYDGAPGGFVHSAAAAAAAAAAASSPVYVPTTRVGSMLPGLPY 203
           PE  +    A+  Q P A    P G                 P  +P       +PG P 
Sbjct: 77  PEIPFGECCAVCPQPPTAPTRPPNG------QGPQGPKGDPGPPGIPGRNGDPGIPGQP- 129

Query: 204 HLQGSGSGPANHAGGAGAHPGWPQASADSPPYGS----GGGAAGGGAAGPGGAGSAAAHV 259
              GS   P    G   + P  PQ    SP Y S     G A GG A  PG AG      
Sbjct: 130 ---GSPGSPGP-PGICESCPTGPQNY--SPQYDSYDVKSGVAVGGLAGYPGPAGP----- 178

Query: 260 SARFPYSPSPPMANGAAREPG--GYAAA-GSGGAGGVSGGGSSLAAMGGREPQYSSLSAA 316
               P  P PP  +G    PG  GY    G  G  G SG      A+G   P      + 
Sbjct: 179 ----PGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPSGPAGKDGESG 234

Query: 317 RP 318
           RP
Sbjct: 235 RP 236


  Database: hs.faa
    Posted date:  Aug 4, 2009  4:42 PM
  Number of letters in database: 18,247,518
  Number of sequences in database:  37,866
  
Lambda     K      H
   0.311    0.128    0.400 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 30,847,413
Number of Sequences: 37866
Number of extensions: 1978544
Number of successful extensions: 31343
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 502
Number of HSP's successfully gapped in prelim test: 1067
Number of HSP's that attempted gapping in prelim test: 13268
Number of HSP's gapped (non-prelim): 9469
length of query: 595
length of database: 18,247,518
effective HSP length: 108
effective length of query: 487
effective length of database: 14,157,990
effective search space: 6894941130
effective search space used: 6894941130
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 65 (29.6 bits)

Search results were obtained with NCBI BLAST and RefSeq entries.


Home | Table of Contents | Search text | Search genes | Search sequences | Purchase | FAQ | Blog | Help

Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.

CSHL Press