BLASTP 2.2.11 [Jun-05-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= gi|58331216 ABO blood group (alpha 1-3-N-acetylgalactosaminyltransferase, alpha 1-3-galactosyltransferase) [Homo sapiens] (354 letters) Database: hs.faa 37,866 sequences; 18,247,518 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gi|58331216 ABO blood group (alpha 1-3-N-acetylgalactosaminyltra... 728 0.0 gi|32484985 globoside alpha-1,3-N-acetylgalactosaminyltransferas... 292 4e-79 gi|122937275 alpha 1,3-galactosyltransferase 2 [Homo sapiens] 229 2e-60 gi|134288859 glycosyltransferase 6 domain containing 1 [Homo sap... 156 2e-38 gi|239745032 PREDICTED: hypothetical protein XP_002343336 [Homo ... 29 5.6 gi|189083844 cathepsin C isoform a preproprotein [Homo sapiens] 28 9.5 >gi|58331216 ABO blood group (alpha 1-3-N-acetylgalactosaminyltransferase, alpha 1-3-galactosyltransferase) [Homo sapiens] Length = 354 Score = 728 bits (1879), Expect = 0.0 Identities = 354/354 (100%), Positives = 354/354 (100%) Query: 1 MAEVLRTLAGKPKCHALRPMILFLIMLVLVLFGYGVLSPRSLMPGSLERGFCMAVREPDH 60 MAEVLRTLAGKPKCHALRPMILFLIMLVLVLFGYGVLSPRSLMPGSLERGFCMAVREPDH Sbjct: 1 MAEVLRTLAGKPKCHALRPMILFLIMLVLVLFGYGVLSPRSLMPGSLERGFCMAVREPDH 60 Query: 61 LQRVSLPRMVYPQPKVLTPCRKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTV 120 LQRVSLPRMVYPQPKVLTPCRKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTV Sbjct: 61 LQRVSLPRMVYPQPKVLTPCRKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTV 120 Query: 121 FAIKKYVAFLKLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKR 180 FAIKKYVAFLKLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKR Sbjct: 121 FAIKKYVAFLKLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKR 180 Query: 181 WQDVSMRRMEMISDFCERRFLSEVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSS 240 WQDVSMRRMEMISDFCERRFLSEVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSS Sbjct: 181 WQDVSMRRMEMISDFCERRFLSEVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSS 240 Query: 241 REAFTYERRPQSQAYIPKDEGDFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVW 300 REAFTYERRPQSQAYIPKDEGDFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVW Sbjct: 241 REAFTYERRPQSQAYIPKDEGDFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVW 300 Query: 301 HDESHLNKYLLRHKPTKVLSPEYLWDQQLLGWPAVLRKLRFTAVPKNHQAVRNP 354 HDESHLNKYLLRHKPTKVLSPEYLWDQQLLGWPAVLRKLRFTAVPKNHQAVRNP Sbjct: 301 HDESHLNKYLLRHKPTKVLSPEYLWDQQLLGWPAVLRKLRFTAVPKNHQAVRNP 354 >gi|32484985 globoside alpha-1,3-N-acetylgalactosaminyltransferase 1 [Homo sapiens] Length = 347 Score = 292 bits (747), Expect = 4e-79 Identities = 143/283 (50%), Positives = 189/283 (66%), Gaps = 1/283 (0%) Query: 71 YPQPKVLTPCRKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTVFAIKKYVAFL 130 YPQPK+L +L +TPWLAPIV EGTFN ++L ++ N TIG+TVFA+ KY F+ Sbjct: 66 YPQPKLLEHRPTQLLTLTPWLAPIVSEGTFNPELLQHIYQPLNLTIGVTVFAVGKYTHFI 125 Query: 131 KLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKRWQDVSMRRME 190 + FLE+AE+ FM G+RVHYY+FTD PAAVP V LG R LS + ++ + W++ SMRRME Sbjct: 126 QSFLESAEEFFMRGYRVHYYIFTDNPAAVPGVPLGPHRLLSSIPIQGHSHWEETSMRRME 185 Query: 191 MISDFCERRFLSEVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSSREAFTYERRP 250 IS +R EVDYL C+DVDM FR+ G E L L +HP +Y R+ F YERR Sbjct: 186 TISQHIAKRAHREVDYLFCLDVDMVFRNPWGPETLGDLVAAIHPSYYAVPRQQFPYERRR 245 Query: 251 QSQAYIPKDEGDFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVWHDESHLNKYL 310 S A++ EGDFYY G FGG V V TR CH A++ D+ANGI A W +ESHLN++ Sbjct: 246 VSTAFVADSEGDFYYGGAVFGGQVARVYEFTRGCHMAILADKANGIMAAWREESHLNRHF 305 Query: 311 LRHKPTKVLSPEYLWDQQLLGWPAVLRKLRFTAVPKNHQAVRN 353 + +KP+KVLSPEYLWD + P L+ +RF+ + K+ +R+ Sbjct: 306 ISNKPSKVLSPEYLWDDR-KPQPPSLKLIRFSTLDKDISCLRS 347 >gi|122937275 alpha 1,3-galactosyltransferase 2 [Homo sapiens] Length = 340 Score = 229 bits (585), Expect = 2e-60 Identities = 123/274 (44%), Positives = 161/274 (58%), Gaps = 2/274 (0%) Query: 81 RKDVLVVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTVFAIKKYVA-FLKLFLETAEK 139 R +VL TPW API+W+G+F+ D+ ++ R QN TIGLT+FA+ +Y+ +L+ FLETAE+ Sbjct: 68 RPEVLTCTPWGAPIIWDGSFDPDVAKQEARQQNLTIGLTIFAVGRYLEKYLERFLETAEQ 127 Query: 140 HFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKRWQDVSMRRMEMISDFCERR 199 HFM G V YYVFT+ P AVPRV LG GR+L V V +RWQDVSM RM + Sbjct: 128 HFMAGQSVMYYVFTELPGAVPRVALGPGRRLPVERVARERRWQDVSMARMRTLHAALGGL 187 Query: 200 FLSEVDYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSSREAFTYERRPQSQAYIPKD 259 E ++ C+DVD F G E L LH Y +ER S A + Sbjct: 188 PGREAHFMFCMDVDQHFSGTFGPEALAESVAQLHSWHYHWPSWLLPFERDAHSAAAMAWG 247 Query: 260 EGDFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVWHDESHLNKYLLRHKPTKVL 319 +GDFY FGGSV ++ LT C + D+A G+EA WHDESHLNK+ HKP KVL Sbjct: 248 QGDFYNHAAVFGGSVAALRGLTAHCAGGLDWDRARGLEARWHDESHLNKFFWLHKPAKVL 307 Query: 320 SPEYLWDQQLLGWPAVLRKLRFTAVPKNHQAVRN 353 SPE+ W +G A +R+ R PK ++ +RN Sbjct: 308 SPEFCWSPD-IGPRAEIRRPRLLWAPKGYRLLRN 340 >gi|134288859 glycosyltransferase 6 domain containing 1 [Homo sapiens] Length = 276 Score = 156 bits (395), Expect = 2e-38 Identities = 94/292 (32%), Positives = 148/292 (50%), Gaps = 22/292 (7%) Query: 26 MLVLVLFGYGVLSPRSLMPGSLERGFCMAVREPDHLQRVSLPRMVYPQPKVLTPCRKDVL 85 ML+LVLF + ++ +ER F ++ + L +P+ R DV+ Sbjct: 6 MLLLVLFAFSLML--------VERYF-----RNHQVEELRLSDWFHPRK------RPDVI 46 Query: 86 VVTPWLAPIVWEGTFNIDILNEQFRLQNTTIGLTVFAIKKYVA-FLKLFLETAEKHFMVG 144 T WLAP++WEGTF+ +L + +R +N T+GL VFA ++ +L+ FL +A KHFM G Sbjct: 47 TKTDWLAPVLWEGTFDRRVLEKHYRRRNITVGLAVFATGRFAEEYLRPFLHSANKHFMTG 106 Query: 145 HRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKRWQDVSMRRMEMISDFCERRFLSEV 204 +RV +Y+ D +P + R +V + W D + ++ + + EV Sbjct: 107 YRVIFYIMVDAFFKLPDIEPSPLRTFKAFKVGTERWWLDGPLVHVKSLGEHIASHIQDEV 166 Query: 205 DYLVCVDVDMEFRDHVGVEILTPLFGTLHPGFYGSSREAFTYERRPQSQAYIPKDEGDFY 264 D+L + + F++ GVE L PL LH +Y + + F YERRP S A IP +GDFY Sbjct: 167 DFLFSMAANQVFQNEFGVETLGPLVAQLHAWWYFRNTKNFPYERRPTSAACIPFGQGDFY 226 Query: 265 YLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVWHDESHLNKYLLRHKPT 316 Y GG+ + + ++ D NG+ + + E HLNKY +KPT Sbjct: 227 YGNLMVGGTPHNILDFIKEYLNGVIHDIKNGLNSTY--EKHLNKYFYLNKPT 276 >gi|239745032 PREDICTED: hypothetical protein XP_002343336 [Homo sapiens] Length = 828 Score = 29.3 bits (64), Expect = 5.6 Identities = 20/61 (32%), Positives = 29/61 (47%) Query: 127 VAFLKLFLETAEKHFMVGHRVHYYVFTDQPAAVPRVTLGTGRQLSVLEVRAYKRWQDVSM 186 VA L FLETA MVG R+ + T +P + R +G + V E ++ +S Sbjct: 283 VAVLAAFLETARASAMVGLRLAQHNSTLRPEKLVRAGAASGSLVPVKEPARFRGMARMSR 342 Query: 187 R 187 R Sbjct: 343 R 343 >gi|189083844 cathepsin C isoform a preproprotein [Homo sapiens] Length = 463 Score = 28.5 bits (62), Expect = 9.5 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 5/52 (9%) Query: 262 DFYYLGGFFGGSVQEVQRLTRACHQAMMVDQANGIEAVWHDESHLNKYLLRH 313 +++Y+GGF+GG + + +L H M V V+ D H K + H Sbjct: 344 EYHYVGGFYGGCNEALMKLELVHHGPMAV-----AFEVYDDFLHYKKGIYHH 390 Database: hs.faa Posted date: Aug 4, 2009 4:42 PM Number of letters in database: 18,247,518 Number of sequences in database: 37,866 Lambda K H 0.327 0.141 0.444 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 13,454,839 Number of Sequences: 37866 Number of extensions: 603520 Number of successful extensions: 1467 Number of sequences better than 10.0: 6 Number of HSP's better than 10.0 without gapping: 5 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 1458 Number of HSP's gapped (non-prelim): 6 length of query: 354 length of database: 18,247,518 effective HSP length: 103 effective length of query: 251 effective length of database: 14,347,320 effective search space: 3601177320 effective search space used: 3601177320 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 62 (28.5 bits)
Search results were obtained with NCBI BLAST and RefSeq entries.
Guide to the Human Genome
Copyright © 2010 by Stewart Scherer. All rights reserved.