BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy11642
(372 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
Length = 596
Score = 580 bits (1494), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/363 (74%), Positives = 309/363 (85%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP ++GPGEGGKAY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 75 LGNFEPKEVDRRDGPGEGGKAYILPEDQQNRASDAEMEYGMNIVVSDTISLDRTIRDTRL 134
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EECK+WDYP +LP SVI+VFHNEGFS LMRTVHS++ R+P L EIILVDDFS K DL
Sbjct: 135 EECKHWDYPHNLPTTSVIIVFHNEGFSVLMRTVHSVLNRSPKHVLHEIILVDDFSDKEDL 194
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+KLE+YI+RF+GKV+LIRN EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 195 KEKLENYIERFDGKVKLIRNVEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 254
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
APIY DR +MTVPVIDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIYRDRTVMTVPVIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+SEPYKSPTHAGGLFA++R FFLE+G YDPGLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 315 DSEPYKSPTHAGGLFAINREFFLEIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSR 374
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFDE++K YFYTREPLA FLDMGDI
Sbjct: 375 VGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDEQYKEYFYTREPLARFLDMGDI 434
Query: 370 SEQ 372
SEQ
Sbjct: 435 SEQ 437
>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
pisum]
Length = 588
Score = 576 bits (1485), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/371 (73%), Positives = 310/371 (83%), Gaps = 1/371 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
P+++ D GN E K GPGE GKA+H+P SL EYGMNM S+ IS +
Sbjct: 58 PIYR-DQIFGNFEYSTSTNKPGPGEKGKAHHVPSDRENEALQSLSEYGMNMACSDDISLN 116
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R+IPD R EECKYW YP LP+ SVI+VFHNEG+SSL+RTVHSI+ RTP Q+LEEI+LVD
Sbjct: 117 RSIPDHREEECKYWTYPEQLPRTSVIIVFHNEGWSSLLRTVHSILNRTPPQFLEEILLVD 176
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
DFSSK +L +KLE YI++FNGKVRLIRN+EREGLIRTRS+GA +RGEVI+FLDAHCEVG
Sbjct: 177 DFSSKENLKKKLEYYIEKFNGKVRLIRNSEREGLIRTRSKGASNARGEVILFLDAHCEVG 236
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
NWLPPL+API DRKIMTVPVIDGID+ TWE+R VYE DH +RGIFEWGMLYKE E+P
Sbjct: 237 YNWLPPLIAPIARDRKIMTVPVIDGIDHNTWEYRPVYEKDHLFRGIFEWGMLYKEIEIPA 296
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
+E +KR Y SEPYKSPTHAGGLFA+DR +FLELG YDPGLLVWGGENFELSFKIW CGGS
Sbjct: 297 QEERKRIYKSEPYKSPTHAGGLFAIDRNYFLELGAYDPGLLVWGGENFELSFKIWQCGGS 356
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
IEWVPCSR+GHVYR FMPYNFG+L +VKGPLITYNYKRVIETWFD KHK +FYTREPLA
Sbjct: 357 IEWVPCSRVGHVYRGFMPYNFGELGKKVKGPLITYNYKRVIETWFDNKHKEFFYTREPLA 416
Query: 362 MFLDMGDISEQ 372
+LDMGDIS+Q
Sbjct: 417 RYLDMGDISKQ 427
>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
Length = 583
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 267/372 (71%), Positives = 310/372 (83%), Gaps = 2/372 (0%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RP +D LGN EP EGPGEGGK +HL + + D S EYGMN+ S+ IS
Sbjct: 55 RPKLVSD--LGNFEPRDSQEHEGPGEGGKPHHLRQDQQNDADQSESEYGMNVACSDEISL 112
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
DRTI D R+ ECK+W+YP +LP SVI+VFHNEG+S L+RTVHS+I R+P + L+E++LV
Sbjct: 113 DRTILDTRLSECKHWNYPENLPSTSVIIVFHNEGWSVLLRTVHSVINRSPPKILKEVLLV 172
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +LE YI+RFNGKVRLIRN +REGLIRTRSRGAKE+ GEVIVFLDAHCEV
Sbjct: 173 DDFSDKENLKTRLETYIERFNGKVRLIRNAQREGLIRTRSRGAKEATGEVIVFLDAHCEV 232
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVPVIDGID++T+E+R VY D H+RGIFEWGMLYKENE+P
Sbjct: 233 NTNWLPPLLAPIYRDRSVMTVPVIDGIDHKTFEYRPVYGEDRHFRGIFEWGMLYKENEVP 292
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
++E RK+NSEPYKSPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 293 QKELNTRKHNSEPYKSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 352
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYRSFMPYNFGKLA + KGPLIT NYKRVIETWFD+K+K +FYTREP+
Sbjct: 353 SIEWVPCSRVGHVYRSFMPYNFGKLAQKKKGPLITINYKRVIETWFDDKYKEFFYTREPM 412
Query: 361 AMFLDMGDISEQ 372
A FLDMGDISEQ
Sbjct: 413 ARFLDMGDISEQ 424
>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
Length = 598
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 267/363 (73%), Positives = 305/363 (84%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP +P +GPGEGGKAY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 77 LGNFEPADKPMVDGPGEGGKAYVLPEDQQNRATDAEMEYGMNIVVSDAISLDRTIKDTRL 136
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EECK+WDYP LP+ SV++VFHNEGFS LMRTVHS++ R+P L EIILVDD+S K DL
Sbjct: 137 EECKHWDYPYHLPRTSVVIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 196
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KLE YI+RF+G VRLIRN+EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 197 KGKLERYIERFDGMVRLIRNSEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 256
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
API+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 257 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 316
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+SEPY+SPTHAGGLFA++R FFLELG YD GLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 317 DSEPYRSPTHAGGLFAINRKFFLELGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPCSR 376
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFD +K YFYTREPLA FLDMGDI
Sbjct: 377 VGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDGPYKEYFYTREPLARFLDMGDI 436
Query: 370 SEQ 372
SEQ
Sbjct: 437 SEQ 439
>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
Length = 874
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 264/363 (72%), Positives = 301/363 (82%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP +EGPGEGG+AY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 75 LGNFEPHEPTVREGPGEGGRAYVLPEDQQNQATDAEMEYGMNIVVSDAISLDRTIRDTRL 134
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EECK+WDYP LPK SVI+VFHNEGFS LMRTVHS++ R+P L EIILVDD+S K DL
Sbjct: 135 EECKHWDYPYHLPKTSVIIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 194
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KLE YI+RF V+LIRN+EREGLIRTRSRGA E+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 195 RGKLERYIERFGSLVKLIRNSEREGLIRTRSRGAHEATGEVIVYLDAHCEVNTNWLPPLL 254
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
API+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+SEPY+SPTHAGGLFA++R FFL+LG YD GLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 315 DSEPYRSPTHAGGLFAINRKFFLDLGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPCSR 374
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFDE +K YFYTREPLA +LDMGDI
Sbjct: 375 VGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDEPYKEYFYTREPLAQYLDMGDI 434
Query: 370 SEQ 372
SEQ
Sbjct: 435 SEQ 437
>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
Length = 591
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 265/372 (71%), Positives = 305/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G AY LP + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVEGLGNFEPKDLKPRTGPGENGDAYTLPPEKKNVADASEMEYGMNIACSDDISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ + R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRETRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L KL+DY+Q+FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSKLDDYVQQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
LNWLPPLLAPIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK +FYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEFFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
Length = 587
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 264/372 (70%), Positives = 305/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G A+ L + A DAS EYGMN+ S+ IS
Sbjct: 57 REVPKLIEGLGNFEPKDLKPRSGPGENGDAHVLNANKKNAADASEMEYGMNIACSDDISM 116
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 117 HRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 176
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L KL++YI +F+G V++IRN EREGLIRTRSRGAKE+ GEVIVFLDAHCEV
Sbjct: 177 DDFSDKENLKAKLDEYILQFDGLVKIIRNKEREGLIRTRSRGAKEATGEVIVFLDAHCEV 236
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 237 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 296
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 297 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 356
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFDE HK YFYTREPL
Sbjct: 357 SIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDETHKEYFYTREPL 416
Query: 361 AMFLDMGDISEQ 372
A +LDMGDI+EQ
Sbjct: 417 ARYLDMGDITEQ 428
>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
Length = 591
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 262/372 (70%), Positives = 302/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVEGLGNFEPKDLKPRNGPGENGDAHTLSPDKKNVADASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L KL+DY+ +F G VR+IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRTKLDDYVLQFKGLVRIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
LNWLPPLLAPIY DR +MTVP+IDGID++++E+R VY D H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKSFEYRPVYGSDTHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK +FYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEFFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDI+EQ
Sbjct: 421 ARYLDMGDITEQ 432
>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
Length = 591
Score = 553 bits (1426), Expect = e-155, Method: Compositional matrix adjust.
Identities = 259/372 (69%), Positives = 303/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPEKKHMSDASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +L++Y+Q+F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVQQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
Length = 591
Score = 552 bits (1423), Expect = e-155, Method: Compositional matrix adjust.
Identities = 256/363 (70%), Positives = 301/363 (82%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP + GPGE G+A++L + + DAS EYGMN+ S+ IS RT+ D R+
Sbjct: 70 LGNFEPKDLKPRTGPGENGEAHNLSKDKKNKADASEMEYGMNIACSDEISMHRTVKDTRL 129
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EEC++WDYP DLPK SVI+VFHNEGFS LMRTVHS+I R+P+ L EIILVDDFS K +L
Sbjct: 130 EECRHWDYPYDLPKTSVIIVFHNEGFSVLMRTVHSVIDRSPSHILHEIILVDDFSDKENL 189
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+L+ Y+++F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWL PLL
Sbjct: 190 GNQLDKYVEQFKGLVKVIRNKEREGLIRTRSRGATEATGEVIVFLDAHCEVNLNWLAPLL 249
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
APIY DR +MTVP+IDGID++ +E+R VY + H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETHFRGIFEWGMLYKENEVPRREQRRRSH 309
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 310 NSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSR 369
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPLA +LDMGDI
Sbjct: 370 VGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPLARYLDMGDI 429
Query: 370 SEQ 372
SEQ
Sbjct: 430 SEQ 432
>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
Length = 591
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 260/372 (69%), Positives = 302/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + D S EYGMN+ SN IS
Sbjct: 61 REVPKLVEGLGNFEPKDLKPRSGPGENGEAHTLSPDKKNVADDSEMEYGMNIACSNDISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DD+S K DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDI+EQ
Sbjct: 421 ARYLDMGDITEQ 432
>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
Length = 591
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/372 (69%), Positives = 302/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 7;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
AltName: Full=dGalNAc-T2
gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
Length = 591
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 258/372 (69%), Positives = 302/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
Length = 591
Score = 550 bits (1416), Expect = e-154, Method: Compositional matrix adjust.
Identities = 257/372 (69%), Positives = 302/372 (81%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +L++Y+ +F G V++IRN +REGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKQREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
Length = 521
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 251/363 (69%), Positives = 304/363 (83%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
+GN EPP+E + GPGEGGK + L R S+ E+GMNM S+ IS RTI D R
Sbjct: 1 MGNFEPPIEAPRSGPGEGGKPHTLLPDQRNEASQSISEFGMNMVVSDEISLSRTISDTRT 60
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EC++W YP DLPKASV++VFHNEG+S+L+RTV S+I R+P Q+LEE++LVDDFS KA L
Sbjct: 61 PECQHWSYPEDLPKASVVIVFHNEGWSTLLRTVQSVIDRSPPQFLEEVLLVDDFSEKAHL 120
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+KLED+I+R++GKVRLIRN EREGLIRTR+RGA+E+RGEV++FLDAHCEVGLNWLPPLL
Sbjct: 121 KRKLEDFIERYDGKVRLIRNKEREGLIRTRTRGAEEARGEVVLFLDAHCEVGLNWLPPLL 180
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
PIY DR MTVP+IDGID++ +E+R VY+ + ++RG+FEWGMLYKENE+PEREA+ R Y
Sbjct: 181 YPIYLDRTTMTVPLIDGIDHENFEYRPVYQGETNFRGVFEWGMLYKENEVPEREAQSRTY 240
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
NSEPYK+PTHAGGLFA++RA+FLE+G YDPGLLVWGGENFELSFKIW CGG I WVPCSR
Sbjct: 241 NSEPYKAPTHAGGLFAINRAYFLEIGAYDPGLLVWGGENFELSFKIWQCGGKILWVPCSR 300
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPY FGKLA KG LIT NYKRVIE WFD+K+K +FYTREP A FLDMG+I
Sbjct: 301 VGHVYRGFMPYTFGKLAANKKGSLITINYKRVIEVWFDDKYKEFFYTREPTARFLDMGNI 360
Query: 370 SEQ 372
++Q
Sbjct: 361 TQQ 363
>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
N-acetylgalactosaminyltransferase 2 [Drosophila
melanogaster]
Length = 591
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/372 (69%), Positives = 301/372 (80%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + G GE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGSGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420
Query: 361 AMFLDMGDISEQ 372
A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432
>gi|383860243|ref|XP_003705600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Megachile
rotundata]
Length = 581
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 260/371 (70%), Positives = 301/371 (81%), Gaps = 2/371 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PV D LGN E P + GPGEGGK + L + + S EYGMNM S+ IS D
Sbjct: 54 PVLVKD--LGNFELQHVPIRTGPGEGGKPHILRDDQQNDVQQSESEYGMNMVCSDEISLD 111
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R++PD RM ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVD
Sbjct: 112 RSVPDTRMTECKHWNYPEVLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVD 171
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
D+S K +L +LE YI+++ GKV+LIRN EREGLIRTRSRGA+E++GEVIVFLDAHCEV
Sbjct: 172 DYSDKDNLKGELESYIEQWEGKVKLIRNYEREGLIRTRSRGAREAKGEVIVFLDAHCEVN 231
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
+NWLPPLLAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 232 VNWLPPLLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPA 291
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE K R YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 292 REQKTRPYNSMPYKSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 351
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
I WVPCS +GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+K+K +FYTREPLA
Sbjct: 352 ILWVPCSHVGHVYRGFMPYNFGKLAQKKKGPLITINYKRVIETWFDDKYKEFFYTREPLA 411
Query: 362 MFLDMGDISEQ 372
LD GDISEQ
Sbjct: 412 QLLDHGDISEQ 422
>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
vitripennis]
Length = 583
Score = 540 bits (1390), Expect = e-151, Method: Compositional matrix adjust.
Identities = 254/363 (69%), Positives = 302/363 (83%), Gaps = 1/363 (0%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP ++ ++ GPGE GK + L + + S YGMN+ S+ IS DR++PD R
Sbjct: 63 LGNFEPEIQ-HRTGPGEEGKPHILRDDQQNDVQESETAYGMNIVCSDEISLDRSVPDTRP 121
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
+ECK+W+Y +LPK SVI+VFHNEG+S LMRTVHS++ RTP QYLEEI+LVDDFS K +L
Sbjct: 122 DECKHWNYSKNLPKTSVIIVFHNEGWSVLMRTVHSVLNRTPPQYLEEILLVDDFSDKENL 181
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+LE YI+++ KVRL+RN EREGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 182 KGELESYIEQWGPKVRLLRNKEREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 241
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
+PI D K+MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP+REAK RK+
Sbjct: 242 SPIAEDNKVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPQREAKTRKH 301
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
NSEPY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 302 NSEPYRSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPCSH 361
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA LD GDI
Sbjct: 362 VGHVYRGFMPYNFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARLLDHGDI 421
Query: 370 SEQ 372
+EQ
Sbjct: 422 TEQ 424
>gi|307212076|gb|EFN87959.1| N-acetylgalactosaminyltransferase 7 [Harpegnathos saltator]
Length = 563
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 252/363 (69%), Positives = 299/363 (82%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP +K GPGEGGK + L E + S +YGMNM S+ IS R+IPD R
Sbjct: 42 LGNFEPRDTSFKAGPGEGGKPHILREDQQNDVQQSESDYGMNMVCSDEISMSRSIPDTRP 101
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEE++LVDDFS K +L
Sbjct: 102 AECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEVLLVDDFSDKENL 161
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 162 KDDLDSYIEQWGGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 221
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
API +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R +
Sbjct: 222 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRSH 281
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+S PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPCS
Sbjct: 282 DSMPYKSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGTILWVPCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA FLD GDI
Sbjct: 342 VGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARFLDHGDI 401
Query: 370 SEQ 372
SEQ
Sbjct: 402 SEQ 404
>gi|242016390|ref|XP_002428804.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212513501|gb|EEB16066.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 579
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 256/364 (70%), Positives = 300/364 (82%), Gaps = 3/364 (0%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
GN EP E +GPGEGG+ + L E + SL +YGMN+ S+ IS DR+IPD R+
Sbjct: 58 GNFEPREEEISDGPGEGGRPHKLREDQQNDASQSLADYGMNIACSDEISLDRSIPDTRLP 117
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
ECK W YP DLPKASVI+VFHNEG+S+L+RTVHS+I RTP Q+LEE+++VDDFS K +L
Sbjct: 118 ECKRWMYPEDLPKASVIIVFHNEGWSTLLRTVHSVINRTPPQFLEEVLMVDDFSDKENL- 176
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
++L+DYI RFNGKVRLIRN+ER+GLIRTRSRGA E+RGEVIVFLDAHCEV NWLPPLLA
Sbjct: 177 KELDDYILRFNGKVRLIRNSERQGLIRTRSRGAVEARGEVIVFLDAHCEVNKNWLPPLLA 236
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER--EAKKRK 248
PIY DR +TVPVIDGID+ T+E++ VY HHYRGIFEWGMLYKE EL ++ A RK
Sbjct: 237 PIYYDRTTLTVPVIDGIDHDTFEYKPVYVDGHHYRGIFEWGMLYKEIELTDQFANADNRK 296
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
YNSEPY+SPTHAGGLFA+DR +FL++G YD GLLVWGGENFELSFK+W CGG I WVPCS
Sbjct: 297 YNSEPYRSPTHAGGLFAIDRNYFLDIGAYDDGLLVWGGENFELSFKVWQCGGRILWVPCS 356
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
R+GHVYRSFMPY FG LA KGPLIT NYKRVIETWFDEK+K +FYTREPLA +L+MGD
Sbjct: 357 RVGHVYRSFMPYTFGSLAKNKKGPLITINYKRVIETWFDEKYKEFFYTREPLARYLNMGD 416
Query: 369 ISEQ 372
IS+Q
Sbjct: 417 ISKQ 420
>gi|307169192|gb|EFN62008.1| N-acetylgalactosaminyltransferase 7 [Camponotus floridanus]
Length = 580
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 255/371 (68%), Positives = 302/371 (81%), Gaps = 2/371 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PVF +G LGN EP P + GPGE GK + L + S +YGMNM S+ IS
Sbjct: 53 PVF-VEG-LGNYEPRDVPVRSGPGENGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R+IPD R ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEEI+LVD
Sbjct: 111 RSIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEILLVD 170
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
DFS K +L L+ YI+++NGKVRL+RN ER+GLIRTRSRGA++++GEVIVFLDAHCEV
Sbjct: 171 DFSDKENLKGDLDSYIEQWNGKVRLLRNYERQGLIRTRSRGARDAKGEVIVFLDAHCEVN 230
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
+NWLPPLLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
REAK R Y+S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 291 REAKTRAYDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 350
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
I WVPCS +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA
Sbjct: 351 ILWVPCSHVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLA 410
Query: 362 MFLDMGDISEQ 372
LD GDISEQ
Sbjct: 411 RLLDHGDISEQ 421
>gi|340718182|ref|XP_003397550.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Bombus
terrestris]
Length = 581
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 254/364 (69%), Positives = 295/364 (81%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LGN E + GPGEGGK Y L + + S +YGMNM S+ IS DR+I D R
Sbjct: 59 ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 118
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
M ECK+W+YP LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 238
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 298
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+KHK +FYTREPLA LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKHKEFFYTREPLAQLLDHGD 418
Query: 369 ISEQ 372
ISEQ
Sbjct: 419 ISEQ 422
>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
Length = 499
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 244/326 (74%), Positives = 283/326 (86%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+YGMN+ SN I+ +R+IPD R++ECKYW YP DL K SVI+VFHNEGFS LMRTVHS+I
Sbjct: 15 KYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEDLAKTSVIIVFHNEGFSVLMRTVHSVI 74
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
RTPAQ+L E++LVDDFS K DL + L++YI+R+NGKVRL+RN +REGLIRTRSRGA+E+
Sbjct: 75 NRTPAQFLHEVVLVDDFSDKDDLKENLDNYIKRWNGKVRLVRNVQREGLIRTRSRGAQEA 134
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+VIVFLDAHCEV +NWLPPLLAPIY D + MTVPVIDGIDY T+E+R VY+ +YRG
Sbjct: 135 TGDVIVFLDAHCEVNVNWLPPLLAPIYRDYRTMTVPVIDGIDYNTFEYRPVYQHGTNYRG 194
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
IFEWGMLYKENE+P+REA K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLVWGG
Sbjct: 195 IFEWGMLYKENEVPDREAHLHKHKSEPYKSPTHAGGLFAINRRYFLEIGAYDPGLLVWGG 254
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
ENFELSFKIW CGGSIEWVPCSR+GHVYR+FMPY FG LA KG LIT NYKRVIETWF
Sbjct: 255 ENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYTFGNLAKNRKGSLITINYKRVIETWF 314
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE+HK YFYTREP+A FLDMGDISEQ
Sbjct: 315 DEEHKEYFYTREPMARFLDMGDISEQ 340
>gi|380013105|ref|XP_003690610.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Apis florea]
Length = 581
Score = 533 bits (1372), Expect = e-149, Method: Compositional matrix adjust.
Identities = 254/364 (69%), Positives = 296/364 (81%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LGN EP + GPGE GK + L + + S +YGMNM S+ IS DR IPD R
Sbjct: 59 ELGNFEPKRISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNMVCSDEISLDRLIPDTR 118
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
M ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEMLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L +LE YI+R+ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIERWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKIRP 298
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAXKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 418
Query: 369 ISEQ 372
ISEQ
Sbjct: 419 ISEQ 422
>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 532 bits (1371), Expect = e-149, Method: Compositional matrix adjust.
Identities = 250/373 (67%), Positives = 303/373 (81%), Gaps = 3/373 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PVF+ D LGN EP ++GPGEGG AYH+PE R + S +YGMN+ S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R++PD+R+EECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231
Query: 122 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
DFS K +L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
VG+NWLPPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
P+ E K+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CG
Sbjct: 352 PDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCG 411
Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
G I WVPCSR+GHVYR FMPY+FGKLA + KGPLIT NYKRV+E W DE +K YFYTREP
Sbjct: 412 GMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRVVEVWMDE-YKEYFYTREP 470
Query: 360 LAMFLDMGDISEQ 372
LA + D GD+ +Q
Sbjct: 471 LATYYDAGDLKQQ 483
>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 532 bits (1371), Expect = e-149, Method: Compositional matrix adjust.
Identities = 250/373 (67%), Positives = 303/373 (81%), Gaps = 3/373 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PVF+ D LGN EP ++GPGEGG AYH+PE R + S +YGMN+ S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R++PD+R+EECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231
Query: 122 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
DFS K +L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
VG+NWLPPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
P+ E K+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CG
Sbjct: 352 PDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCG 411
Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
G I WVPCSR+GHVYR FMPY+FGKLA + KGPLIT NYKRV+E W DE +K YFYTREP
Sbjct: 412 GMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRVVEVWMDE-YKEYFYTREP 470
Query: 360 LAMFLDMGDISEQ 372
LA + D GD+ +Q
Sbjct: 471 LATYYDAGDLKQQ 483
>gi|328781461|ref|XP_395266.4| PREDICTED: n-acetylgalactosaminyltransferase 7 [Apis mellifera]
Length = 581
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 252/364 (69%), Positives = 296/364 (81%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LGN EP + GPGE GK + L + + S +YGMN+ S+ IS DR IPD R
Sbjct: 59 ELGNFEPKHISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNIVCSDEISLDRLIPDTR 118
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
M ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEILPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L +LE YI+++ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIEQWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRS 298
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 418
Query: 369 ISEQ 372
ISEQ
Sbjct: 419 ISEQ 422
>gi|332023194|gb|EGI63450.1| N-acetylgalactosaminyltransferase 7 [Acromyrmex echinatior]
Length = 614
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 249/363 (68%), Positives = 294/363 (80%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN E P + GPGEGGK + L + S +YGMNM S+ IS R+IPD R+
Sbjct: 93 LGNYELRDVPVRSGPGEGGKPHILKDDQLNDVQQSESDYGMNMVCSDEISLSRSIPDTRL 152
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
+CK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP++ LEEI+LVDDFS K +L
Sbjct: 153 AQCKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKLLEEILLVDDFSDKENL 212
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E+RGEVIVFLDAHCEV +NWLPPLL
Sbjct: 213 KSDLDSYIEQWGGKVRLLRNHERQGLIRTRSRGAREARGEVIVFLDAHCEVNVNWLPPLL 272
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
API +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R +
Sbjct: 273 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRAH 332
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+S PY+SPTHAGGLFA+ R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 333 DSMPYRSPTHAGGLFAISRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPCSH 392
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA LD GDI
Sbjct: 393 VGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARLLDHGDI 452
Query: 370 SEQ 372
SEQ
Sbjct: 453 SEQ 455
>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
impatiens]
Length = 582
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 252/364 (69%), Positives = 295/364 (81%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LGN E + GPGEGGK Y L + + S +YGMNM S+ IS DR+I D R
Sbjct: 60 ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 119
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
M ECK+W+YP LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 120 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 179
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 180 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 239
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 240 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 299
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPCS
Sbjct: 300 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGNILWVPCS 359
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA LD GD
Sbjct: 360 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 419
Query: 369 ISEQ 372
ISEQ
Sbjct: 420 ISEQ 423
>gi|322798640|gb|EFZ20244.1| hypothetical protein SINV_10970 [Solenopsis invicta]
Length = 580
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 254/371 (68%), Positives = 300/371 (80%), Gaps = 2/371 (0%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PVF +G LGN EP P + GPGEGGK + L + S +YGMNM S+ IS
Sbjct: 53 PVF-VEG-LGNYEPRDIPVRSGPGEGGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R IPD R ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP+++LEEI+LVD
Sbjct: 111 RAIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKFLEEILLVD 170
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
DFS K +L L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV
Sbjct: 171 DFSDKENLKGDLDSYIEQWEGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVN 230
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
+NWLPPLLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
REAK R ++S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 291 REAKTRAHDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 350
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
I WVPCS +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA
Sbjct: 351 ILWVPCSHVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLA 410
Query: 362 MFLDMGDISEQ 372
LD GDISEQ
Sbjct: 411 RLLDHGDISEQ 421
>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
Length = 486
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 240/326 (73%), Positives = 280/326 (85%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
EYGMN+ S+ IS R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I
Sbjct: 2 EYGMNIACSDEISMHRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVI 61
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
R+P L EIILVDDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+
Sbjct: 62 DRSPTHMLHEIILVDDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEA 121
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
GEVIVFLDAHCEV NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RG
Sbjct: 122 TGEVIVFLDAHCEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRG 181
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
IFEWGMLYKENE+P RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGG
Sbjct: 182 IFEWGMLYKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGG 241
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
ENFELSFKIW CGGSIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWF
Sbjct: 242 ENFELSFKIWQCGGSIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWF 301
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
D+ HK YFYTREPLA +LDMGDISEQ
Sbjct: 302 DDTHKEYFYTREPLARYLDMGDISEQ 327
>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 499
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 239/329 (72%), Positives = 283/329 (86%)
Query: 44 SLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVH 103
S EYGMN+ SN I+ +R+IPD R++ECKYW YP +LP SVI+VFHNEGFS LMRTVH
Sbjct: 12 SESEYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEELPSTSVIIVFHNEGFSVLMRTVH 71
Query: 104 SIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA 163
++I R+P L+E+++VDDFS K DL + L++Y++R+ GKVR+IRN+ER+GLIRTRSRGA
Sbjct: 72 TVIDRSPPNILKEVVMVDDFSDKDDLKENLDNYVKRWKGKVRIIRNSERQGLIRTRSRGA 131
Query: 164 KESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH 223
E+ GEVIVFLDAHCEV +NWLPPLLAPIY D KIMTVPVIDGID++T+E+R VY +
Sbjct: 132 MEATGEVIVFLDAHCEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTFEYRPVYSHGIN 191
Query: 224 YRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLV 283
YRGIFEWGMLYKENE+P+REA K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLV
Sbjct: 192 YRGIFEWGMLYKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFLEIGAYDPGLLV 251
Query: 284 WGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIE 343
WGGENFELSFKIW CGGSIEWVPCSR+GHVYR+FMPY+FG LA KG LIT NYKRVIE
Sbjct: 252 WGGENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYSFGNLAKNRKGSLITINYKRVIE 311
Query: 344 TWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
TWFDE+HK +FYTREP+A FLDMGDISEQ
Sbjct: 312 TWFDEEHKEFFYTREPMARFLDMGDISEQ 340
>gi|391336074|ref|XP_003742408.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Metaseiulus
occidentalis]
Length = 593
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 245/374 (65%), Positives = 293/374 (78%), Gaps = 8/374 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISF 60
PVF+ D LGN E + P K GPGEGG AY + + R+ L +YGMNM SN IS
Sbjct: 70 PVFR-DDVLGNFEMSM-PKKVGPGEGGAAYVI--SGRSVEQQKLKNQYGMNMVVSNEISP 125
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
+RTIPDLR++ECKYW YP DLP SVI+VFHNEG S LMRTVHS+I R+P Q+L E++LV
Sbjct: 126 NRTIPDLRLDECKYWHYPEDLPGTSVIVVFHNEGLSVLMRTVHSVINRSPRQFLHEVVLV 185
Query: 121 DDFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
DDFS K +L ++LE+YI R G VRL+RN R+GLIR+RS GA+ + GEVI+FLDAHC
Sbjct: 186 DDFSDKLNLREELENYIARNFPKGLVRLVRNKSRQGLIRSRSYGAEVATGEVILFLDAHC 245
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
EVG NWLPPLLAPI ++ K MTVPVIDGID++ +E+R VY H+RGIFEWGMLYKE E
Sbjct: 246 EVGANWLPPLLAPIKANPKTMTVPVIDGIDHENFEYRPVYHGKQHFRGIFEWGMLYKEIE 305
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+PE E K+R +SEPYKSPTHAGGLFAM+R +FLELGGYDPGLLVWGGENFELSFK+W C
Sbjct: 306 IPEEEVKRRTKHSEPYKSPTHAGGLFAMNREYFLELGGYDPGLLVWGGENFELSFKLWQC 365
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG I WVPCSR+GHVYR FMPY+FG L + KGPLI NYKRV+E WFDE +K YFYTRE
Sbjct: 366 GGQILWVPCSRVGHVYRGFMPYSFGDLGKKRKGPLIVINYKRVVEVWFDE-YKEYFYTRE 424
Query: 359 PLAMFLDMGDISEQ 372
P+A D G++++Q
Sbjct: 425 PMARDYDAGNLTQQ 438
>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 478
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 244/367 (66%), Positives = 292/367 (79%), Gaps = 5/367 (1%)
Query: 10 LGNLEPPLEPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
LGN EP + K PGEGG YH P + S EYGMN+ S+HIS +RTIPD+
Sbjct: 1 LGNFEPAVADVVDKRKPGEGGFPYHTPPKLKNNVAHSNMEYGMNVVASDHISPNRTIPDM 60
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R++ECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVDD+S K
Sbjct: 61 RLQECKYWDYPTDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVDDYSDKE 120
Query: 128 DLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
+L +LE YI R G VRL+RN ER+GLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 121 NLKGELETYIARNFPVGLVRLLRNEERQGLIRSRSYGAEQSVGDVVLFLDAHCEVGINWL 180
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PPLLAPI ++R MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+PE E K
Sbjct: 181 PPLLAPIRANRYTMTVPVIDGIDKDTFEYRPVYHGGQHFRGIFEWGMLYKEIEIPEEEIK 240
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+RKY+SEPYKSPTHAGGLFA+DR +FL+LGGYDPGLLVWGGENFELSFKIW CGGSI WV
Sbjct: 241 RRKYHSEPYKSPTHAGGLFAIDRKYFLKLGGYDPGLLVWGGENFELSFKIWQCGGSIYWV 300
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
PCSR+GHVYR FMPY+FGKLA + KGP++T NYKRV+E W DE +K YFYTREP+A D
Sbjct: 301 PCSRVGHVYRGFMPYSFGKLAHKRKGPIVTVNYKRVVEVWMDE-YKEYFYTREPMARHYD 359
Query: 366 MGDISEQ 372
GD+S Q
Sbjct: 360 PGDLSGQ 366
>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
Length = 567
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 239/372 (64%), Positives = 280/372 (75%), Gaps = 24/372 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
R V K LGN EP + GPGE G+A+ L + D S EYGMN+ SN IS
Sbjct: 61 REVPKLIEGLGNFEPKDLKPRSGPGENGEAHSLSPDKKNVADDSEMEYGMNIACSNDISM 120
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DD+S K DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRTHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SI D+ KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SI------------------------DKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 396
Query: 361 AMFLDMGDISEQ 372
A +LDMGDI+EQ
Sbjct: 397 ARYLDMGDITEQ 408
>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
Length = 640
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 226/376 (60%), Positives = 283/376 (75%), Gaps = 16/376 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNH 57
VFK G+LGN EP + + G GE G+ ++ PE RA + E+G N S+
Sbjct: 115 VFKK-GELGNFEPKEKQSRPGKHGEMGEPVNVDLNQPEVQRA-----MNEFGFNTFVSDM 168
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS +R++PD+RM+ECKYW YP DLP ASV++VFHNEG+S L+RTVHS+I R+P L+EI
Sbjct: 169 ISLNRSVPDVRMDECKYWHYPEDLPTASVVIVFHNEGWSPLLRTVHSVILRSPPNLLKEI 228
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
+LVDDFS K L +L+ YI++FNGKVRL+RN EREGLIRTRS GA+ + G+V++FLDAH
Sbjct: 229 VLVDDFSDKEHLKDRLDRYIEQFNGKVRLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAH 288
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKE 236
CEV +NWLPPLLAPI +RK+MTVPVIDGID TW +R VY D H+RGIFEWG+LYKE
Sbjct: 289 CEVNINWLPPLLAPIRRNRKVMTVPVIDGIDMHTWSYRRVYGSADRHFRGIFEWGLLYKE 348
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
E+ + EA++RKYNSEP++SPTHAGGLFA+D+ +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 349 TEITKEEARRRKYNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSFKIW 408
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG I +VPCS +GHVYRS MPY FGKL+ + P+I+ N RVI+TW DE K Y+Y
Sbjct: 409 QCGGGILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMVRVIKTWMDEYEK-YYYI 464
Query: 357 REPLAMFLDMGDISEQ 372
REP A GDIS Q
Sbjct: 465 REPSAKHRSPGDISAQ 480
>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
magnipapillata]
Length = 599
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 216/376 (57%), Positives = 272/376 (72%), Gaps = 6/376 (1%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PVF +D KLGN E E K GPGEGGK + L + + G YG N S+ IS D
Sbjct: 51 PVFLSDNKLGNFEK-YEDVKSGPGEGGKPHRLKPEQKEEEERLKGVYGFNQLVSDEISLD 109
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R +PD+R EECK+W YP DLP +SVI +FHNEG+S+L+R+VHS+I RTPA L EI+LVD
Sbjct: 110 RVVPDMREEECKHWSYPNDLPSSSVIFIFHNEGWSTLLRSVHSVINRTPAHLLHEIVLVD 169
Query: 122 DFSSKADLDQKLEDYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
D S L ++L++ I++ + KV+L+RN +REGLIR R+ GA + GEV+VFLDAHCE
Sbjct: 170 DKSELEHLHERLDEEIKKPYYQSKVKLVRNKQREGLIRARNIGAIAATGEVLVFLDAHCE 229
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
VG NWLPPL+API D +T P+IDGI++ + VY+ H RGIFEWGMLYKE +L
Sbjct: 230 VGGNWLPPLIAPIQEDPTTLTAPIIDGINWDDFSINPVYQKGSHSRGIFEWGMLYKETDL 289
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
PE+EA+KR Y+SEPY SPTHAGGLFA+ R++F ELG YDPGLL+WGGEN+ELSFK+W CG
Sbjct: 290 PEKEARKRLYHSEPYNSPTHAGGLFAIKRSWFKELGWYDPGLLIWGGENYELSFKLWQCG 349
Query: 300 GSIEWVPCSRIGHVYR--SFMPYNFGKLADRVKG-PLITYNYKRVIETWFDEKHKAYFYT 356
G WVPCS + HVYR S + G + + G PL NYKR+IE WFD+K+K +FYT
Sbjct: 350 GRSLWVPCSHVSHVYRGHSCSSCHSGDMGRKWSGIPLSLRNYKRLIEVWFDDKYKEFFYT 409
Query: 357 REPLAMFLDMGDISEQ 372
REPLA F+D GD+SEQ
Sbjct: 410 REPLARFIDTGDVSEQ 425
>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
Length = 560
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 222/376 (59%), Positives = 281/376 (74%), Gaps = 16/376 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 57
+FK D ++GN EP ++ G GE GK +LPE +A + EYG N S+
Sbjct: 37 IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 90
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 91 ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 150
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDDFS K L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 151 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 210
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 236
CEV +NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 211 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 270
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
E+P +E +RK++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 271 TEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIW 330
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG I ++PCS +GHVYRS MPY FGKL+ + P+I+ N RVI+TW DE K Y+Y
Sbjct: 331 QCGGGILFIPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYEK-YYYI 386
Query: 357 REPLAMFLDMGDISEQ 372
REP A GDIS Q
Sbjct: 387 REPSAKHRLPGDISSQ 402
>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
Length = 562
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 222/376 (59%), Positives = 281/376 (74%), Gaps = 16/376 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 57
+FK D ++GN EP ++ G GE GK +LPE +A + EYG N S+
Sbjct: 39 IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 92
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 93 ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 152
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDDFS K L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 153 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 212
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 236
CEV +NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 213 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 272
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
E+P +E +RK++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 273 TEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIW 332
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG I ++PCS +GHVYRS MPY FGKL+ + P+I+ N RVI+TW DE K Y+Y
Sbjct: 333 QCGGGILFIPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYEK-YYYI 388
Query: 357 REPLAMFLDMGDISEQ 372
REP A GDIS Q
Sbjct: 389 REPSAKHRLPGDISSQ 404
>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
Length = 601
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/370 (57%), Positives = 276/370 (74%), Gaps = 9/370 (2%)
Query: 7 DGKLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRT 63
DG+LGN EP P P + PGE G+ + E AAG A+ E+G N S+ IS +RT
Sbjct: 79 DGELGNYEPKTPEIPSNQ-PGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDLISMNRT 137
Query: 64 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
IPD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E I++VDD
Sbjct: 138 IPDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIESIVMVDDD 197
Query: 124 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
S K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV N
Sbjct: 198 SDKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTN 257
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPER 242
WLPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ER
Sbjct: 258 WLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITER 317
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
E+ RK+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I
Sbjct: 318 ESAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGI 377
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
+VPCS +GHVYRS MPY FGK + + P+I+ N RV++TW D+ K Y+ TREP A
Sbjct: 378 VFVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQAA 433
Query: 363 FLDMGDISEQ 372
++ GDIS Q
Sbjct: 434 HVNPGDISAQ 443
>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 645
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 214/372 (57%), Positives = 278/372 (74%), Gaps = 8/372 (2%)
Query: 3 VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
+FK D ++GN EP + G GE G+ + + +A + EYG N S+ IS +
Sbjct: 122 IFKLD-EIGNFEPKETQLQPGDYGEMGEPVLIDKTLTEVKEA-MREYGFNTYVSDMISLN 179
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R+IPD+RM+ECKYW YP DLP AS+++ FHNEG++ L+RTVHS++ R+P ++EII+VD
Sbjct: 180 RSIPDVRMDECKYWHYPEDLPTASIVIAFHNEGWTPLLRTVHSVLLRSPPHLIKEIIMVD 239
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
DFS K L +L+ Y+++F+GKV+L+RN+EREGLIRTRS GAKE+ G+V++FLDAHCEV
Sbjct: 240 DFSDKEHLKDRLDVYLKQFDGKVKLVRNSEREGLIRTRSIGAKEAVGDVVIFLDAHCEVN 299
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELP 240
+NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE EL
Sbjct: 300 VNWLPPLLAPIRQNRKVMTVPVIDGIDKNDWSYRIVYGSADKHYRGIFEWGLLYKETELS 359
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+E +RK+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW CGG
Sbjct: 360 SQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGG 419
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
I +VPCS +GHVYRS MPY FGKL+ + P+I+ N RVI+TW DE K Y+Y REP
Sbjct: 420 GILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYDK-YYYIREPS 475
Query: 361 AMFLDMGDISEQ 372
A G+IS Q
Sbjct: 476 ARHRLPGNISSQ 487
>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
Length = 601
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 211/369 (57%), Positives = 274/369 (74%), Gaps = 7/369 (1%)
Query: 7 DGKLGNLEPPL-EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 64
DG+LGN EP E PGE G+ + E AAG A+ E+G N S+ IS +RTI
Sbjct: 79 DGELGNYEPKTAEIPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
PD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NW
Sbjct: 199 DKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 243
LPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318
Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
+ RK+ S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I
Sbjct: 319 SGHRKHTSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIV 378
Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
+VPCS +GHVYRS MPY FGK + + P+I+ N RV++TW D+ K Y+ TREP A
Sbjct: 379 FVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQAAH 434
Query: 364 LDMGDISEQ 372
++ GDIS Q
Sbjct: 435 VNPGDISAQ 443
>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
Length = 601
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 210/369 (56%), Positives = 275/369 (74%), Gaps = 7/369 (1%)
Query: 7 DGKLGNLEPPL-EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 64
+G+LGN EP + E PGE G+ + E AAG A+ E+G N S+ IS +RTI
Sbjct: 79 EGELGNYEPKIPEVPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
PD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NW
Sbjct: 199 DKQHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 243
LPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318
Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
RK++S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I
Sbjct: 319 TAHRKHSSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIV 378
Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
+VPCS +GHVYRS MPY FGK + + P+I+ N RV++TW D+ K Y+ TREP A
Sbjct: 379 FVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYEK-YYLTREPQAAH 434
Query: 364 LDMGDISEQ 372
++ GDIS Q
Sbjct: 435 VNPGDISAQ 443
>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
Length = 601
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 212/368 (57%), Positives = 274/368 (74%), Gaps = 9/368 (2%)
Query: 9 KLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
+LGN EP P P + PGE GK + E AAG A+ E+G N S+ IS +RTIP
Sbjct: 81 ELGNYEPKEPEIPSNQ-PGEHGKPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTIP 139
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D+R EECK+WDYP LP SV++VFHNEG++ L+RTVHS++ R+P + +E++++VDD S
Sbjct: 140 DIRPEECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRSPPELIEQVVMVDDDSD 199
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWL
Sbjct: 200 KPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWL 259
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREA 244
PPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 260 PPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERET 319
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
RK+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +
Sbjct: 320 AHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVF 379
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
VPCS +GHVYRS MPY+FGK + + P+I+ N RV++TW D+ K Y+ TREP A +
Sbjct: 380 VPCSHVGHVYRSHMPYSFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQATNV 435
Query: 365 DMGDISEQ 372
+ GDIS Q
Sbjct: 436 NPGDISAQ 443
>gi|195130803|ref|XP_002009840.1| GI15586 [Drosophila mojavensis]
gi|193908290|gb|EDW07157.1| GI15586 [Drosophila mojavensis]
Length = 595
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 201/294 (68%), Positives = 238/294 (80%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
LGN EP + GPGE G+ + L + DAS EYGMN+ S+ IS R++ D R+
Sbjct: 70 LGNFEPRDLKPRTGPGENGEGHILSPDKKNVADASEMEYGMNIACSDEISMHRSVRDTRL 129
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P L EIILVDDFS K +L
Sbjct: 130 EECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILVDDFSDKENL 189
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KL++Y+ +F G V++IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPPLL
Sbjct: 190 RSKLDEYVLQFKGLVKIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPPLL 249
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
APIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVPRREQRRRAH 309
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIE
Sbjct: 310 NSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIE 363
>gi|443700020|gb|ELT99205.1| hypothetical protein CAPTEDRAFT_172619 [Capitella teleta]
Length = 336
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 184/304 (60%), Positives = 232/304 (76%), Gaps = 2/304 (0%)
Query: 38 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
+ A D S+ E+G NM S+ IS +RTIPD RMEECKYW YP LP ASVILVFHNEG+S+
Sbjct: 4 KEAADRSIREFGFNMVASDKISMNRTIPDTRMEECKYWHYPKTLPSASVILVFHNEGWST 63
Query: 98 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
L+RTVHS+I +P + L EI++VDDFS K L +LEDY+++F+GKV+L RN ER GLI
Sbjct: 64 LVRTVHSVIDMSPPELLHEIVMVDDFSDKEHLKTRLEDYLKQFHGKVKLYRNKERLGLIG 123
Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
TR+ GA+ + G+ IVFLDAHCE NWLPPLLA I DR I+ +PVIDGID+ + + V
Sbjct: 124 TRTLGAQYATGDAIVFLDAHCECNRNWLPPLLARIAYDRTILAIPVIDGIDFDNFRYNPV 183
Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
Y +RGIFEWG LYKE+++P + +R++ SE YKSPTHAGGLFA+DR +F ELG Y
Sbjct: 184 YSGRELFRGIFEWGFLYKESKVPGKTLLERQHQSEAYKSPTHAGGLFAIDRKYFFELGAY 243
Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYN 337
DPGL +WGGENFELSFKIW CGGS+EWVPCS +GHVYR+ MPY FGK+ ++ P++ N
Sbjct: 244 DPGLQIWGGENFELSFKIWQCGGSVEWVPCSHVGHVYRNSMPYGFGKINPKI--PVVLLN 301
Query: 338 YKRV 341
Y R+
Sbjct: 302 YMRL 305
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 10/374 (2%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
VF+ DG G+ EP P +EGPGEGG A + +A D + EYG N S+ IS DR
Sbjct: 81 VFR-DGVRGDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDR 139
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I DLR ++CK+W YP LP SVI+VFHNEG+S+L+RTVHS+ R+P+Q L EIILVDD
Sbjct: 140 NIADLRSQQCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDD 199
Query: 123 FSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
FS+K L ++LEDY+Q RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LDAHCEV
Sbjct: 200 FSTKEHLKERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEV 259
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
G+NWLPPLL PI +R P+ID ID + D RG F+W + +K +P
Sbjct: 260 GVNWLPPLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVP 318
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ E +R++ SEPY+SP AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGG
Sbjct: 319 QFEKSRRQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGG 378
Query: 301 SIEWVPCSRIGHVYRSF--MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
S+ WVPCSR+GHVYR +PY+ + + L N +RV+E WFD+ +K YFY +
Sbjct: 379 SLLWVPCSRVGHVYRILGKVPYSAPNGSMLI---LSERNLRRVVEVWFDD-YKEYFYRSK 434
Query: 359 PLAMFLDMGDISEQ 372
P ++ + G+I +Q
Sbjct: 435 PESLLVSTGNIEKQ 448
>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7-like [Saccoglossus
kowalevskii]
Length = 594
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 198/378 (52%), Positives = 254/378 (67%), Gaps = 16/378 (4%)
Query: 3 VFKADGKLGNLE--PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
VFK+ LGN E PP + + GPGE KA + D S+ EYG N S+ IS
Sbjct: 67 VFKSR-VLGNYENLPPSQEGRTGPGEYAKAVKTTPDEQKQVDRSINEYGFNQYVSDKISL 125
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
DRTI DLR E+CKYW YP LP VI+VFHNEG+S+L+RTVHS+ RTP L E++LV
Sbjct: 126 DRTIKDLREEQCKYWHYPESLPAVGVIIVFHNEGWSTLLRTVHSLFNRTPPTLLHEVVLV 185
Query: 121 DDFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
DDFS+K L ++LE+Y++ RF GK++L+RN +REGLIRTR+ GA S +V+V+LDAHC
Sbjct: 186 DDFSNKEHLRERLEEYVKEPRFLGKIKLVRNAKREGLIRTRTVGAIHSTADVLVWLDAHC 245
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
EVG+NWLPPLL+PI +R +TVP+ID ID + RS + RG F+W + +K
Sbjct: 246 EVGINWLPPLLSPIAQNRTTVTVPIIDVIDNMDYTMRSQGSGELS-RGGFDWSLYWKHLP 304
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+ + E +KR +SEPY+SP AGGLFAM R +F ELG YDPGL VWGGENFELSFKIW C
Sbjct: 305 MSKEETRKRSLSSEPYRSPAMAGGLFAMARDYFFELGAYDPGLEVWGGENFELSFKIWQC 364
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY----NYKRVIETWFDEKHKAYF 354
GGS+ WVPCS +GHVYR GK+ R +T NY+RV+E W D+ +K +F
Sbjct: 365 GGSMLWVPCSHVGHVYRI-----LGKVPYRAPNATMTQWSLRNYRRVVEVWMDD-YKEFF 418
Query: 355 YTREPLAMFLDMGDISEQ 372
Y +P + L GDIS+Q
Sbjct: 419 YRSKPESQLLHFGDISKQ 436
>gi|66472462|ref|NP_001018477.1| N-acetylgalactosaminyltransferase 7 [Danio rerio]
gi|63100869|gb|AAH95642.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Danio rerio]
Length = 652
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/369 (52%), Positives = 258/369 (69%), Gaps = 8/369 (2%)
Query: 8 GKLGNLEPPL-EPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
G LGN EP EP+ + GPGEG K + L Y+ A AS+ E+G NM S+ IS DRT+
Sbjct: 125 GTLGNFEPKEPEPHGVQGGPGEGSKPFVLGPEYKDAVQASIKEFGFNMVASDMISLDRTV 184
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECKYW+Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+++DDFS
Sbjct: 185 GDLRHEECKYWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVMIDDFS 244
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCEVGLN 183
+KA L ++LE+YI+++NG V++ RN +REGLI+ RS GA K + G+V+++LDAHCEVG+N
Sbjct: 245 NKAHLKERLEEYIKQWNGLVKVFRNEKREGLIQARSIGARKATLGKVLIYLDAHCEVGVN 304
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
W PL+API DR + TVP+ID ID T E + + D RG ++W +L+K L
Sbjct: 305 WYAPLVAPISKDRTVCTVPLIDYIDGNDYTIEPQQGGDEDGLARGAWDWSLLWKRVPLSS 364
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG
Sbjct: 365 REKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGQ 424
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + V NY RV+E W+D+ +K YFY P
Sbjct: 425 LLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDD-YKDYFYASRPET 482
Query: 362 MFLDMGDIS 370
+ L GDIS
Sbjct: 483 LTLAYGDIS 491
>gi|410927898|ref|XP_003977377.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Takifugu rubripes]
Length = 664
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/376 (52%), Positives = 259/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV K G LGNLEP P P G GEG K + L Y+ A AS+ E+G NM S+ I
Sbjct: 132 PVLKK-GILGNLEPKEPEPPGVPGGLGEGAKPFVLNAEYKDAIQASIKEFGFNMVASDMI 190
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR+I D+R +ECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 191 SLDRSISDIRHDECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 250
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 177
++DDFS+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAH
Sbjct: 251 MIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLIYLDAH 310
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYK 235
CEVG+NW PL+API DR + TVP+ID ID Q + D + RG ++W ML+K
Sbjct: 311 CEVGINWYAPLVAPISKDRTVCTVPLIDSIDGQKYTVDPQGGGDQNGFARGAWDWSMLWK 370
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 371 RVPLGDREKQLRKTETEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 430
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSRIGH+YR + V NY RV+E W+DE +K YFY
Sbjct: 431 WQCGGQLLFVPCSRIGHIYR-LHGWQGNPPPAHVGSSPTLKNYVRVVEVWWDE-YKDYFY 488
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 489 ASRPETLTLAYGDISE 504
>gi|432847870|ref|XP_004066191.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oryzias
latipes]
Length = 653
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/377 (51%), Positives = 259/377 (68%), Gaps = 11/377 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P+ + G LGN EP EP +G PGEG K L Y+ + AS+ E+G NM S+
Sbjct: 122 PILRK-GTLGNFEPK-EPEPQGILNGPGEGAKPLILGSEYKDSVQASIKEFGFNMVASDM 179
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DRTI DLR +ECKYW Y L +SV++VFHNEG+S+LMRTVHS+IKRTP QYL EI
Sbjct: 180 ISMDRTISDLRNDECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRQYLAEI 239
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDA 176
+++DDFS+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDA
Sbjct: 240 VMIDDFSNKVHLKERLEEYIKQWNGLVKLFRNDKREGLIQARSIGAKKATKGQVLIYLDA 299
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLY 234
HCEVG+NW PL+API DR + TVP+ID I + T E + + D RG ++W ML+
Sbjct: 300 HCEVGINWYAPLIAPISKDRTVCTVPLIDSIHGERFTIEPQGGGDEDGFARGAWDWSMLW 359
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K L +RE K RK +EPY+SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 360 KRVPLGDREKKLRKTQTEPYRSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYK 419
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IW CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K +F
Sbjct: 420 IWQCGGQLLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDE-YKDFF 477
Query: 355 YTREPLAMFLDMGDISE 371
Y P + L GDISE
Sbjct: 478 YASRPETLTLAYGDISE 494
>gi|326918604|ref|XP_003205578.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Meleagris
gallopavo]
Length = 665
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/376 (52%), Positives = 260/376 (69%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 133 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 191
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 192 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 251
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 252 LIDDFSNKAHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 311
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEVG+NW PL+API DR TVP+ID ID T++ + + D RG ++W ML+K
Sbjct: 312 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGNTFKIVPQGGGDEDGFARGAWDWSMLWK 371
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE +KR+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 372 RVPLSKREKEKRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 431
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 432 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 489
Query: 356 TREPLAMFLDMGDISE 371
P L GDISE
Sbjct: 490 ASRPETKALPYGDISE 505
>gi|198419403|ref|XP_002128971.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 631
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/387 (51%), Positives = 259/387 (66%), Gaps = 19/387 (4%)
Query: 1 RPVFKAD---------GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 51
+PV K D KLGN E L + GPGE G A H + A AS+ E+G N
Sbjct: 91 KPVVKEDFSNYPQLNWRKLGNYEESLA-RRNGPGEYGVAVHATNDEKEAVAASIKEFGFN 149
Query: 52 METSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPA 111
M S+ IS DR DLR +EC++WDYP DLP SVI+VFHNEG+S+L+RTVHS+I TP
Sbjct: 150 MVNSDKISLDRLPKDLRHDECRHWDYPSDLPDVSVIIVFHNEGWSTLVRTVHSVINLTPK 209
Query: 112 QYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR---G 168
+ L EI+++DD S+K L QKL +YIQRFNG V+L RN REGLIR RS GA++S G
Sbjct: 210 KLLYEIVMIDDHSNKEHLGQKLTEYIQRFNGLVKLYRNERREGLIRARSIGAQKSTPADG 269
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRG 226
V+V+LDAHCEVG NWLPPL+ PI ++RK+ TVP+ID I+ Q + F S D + RG
Sbjct: 270 RVLVYLDAHCEVGYNWLPPLIMPIVNNRKVTTVPLIDVINGQDYTFTSQAGGDANGFARG 329
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
++W ML+K L + E +RK+ ++PY+SP AGGLFA++R +F ++G YDPGL +WGG
Sbjct: 330 AWDWSMLWKRVPLTKEEHNRRKHTTDPYRSPAMAGGLFAIERQYFFDIGLYDPGLEIWGG 389
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP-YNFGKLADRVKGPLITYNYKRVIETW 345
ENFE+SFKIWMC G + +VPCSR+GHVYR +P ++ + V NY RV+ETW
Sbjct: 390 ENFEMSFKIWMCEGEVLFVPCSRVGHVYR--LPGWSGNPPPEYVPSNPSLRNYIRVVETW 447
Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
+DE +K YFY P + + GDIS Q
Sbjct: 448 WDE-YKDYFYASRPETLNMPYGDISAQ 473
>gi|345307492|ref|XP_001507110.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Ornithorhynchus anatinus]
Length = 873
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 194/377 (51%), Positives = 257/377 (68%), Gaps = 11/377 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
PV + GKLGN EP EP +G PGE K Y L Y+ + AS+ E+G NM S+
Sbjct: 98 PVLQP-GKLGNFEPK-EPEPQGVMGGPGEEAKPYVLGPEYKDSIQASIKEFGFNMVASDM 155
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 156 ISLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEI 215
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
+L+DDFS+KA L +L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 216 VLIDDFSNKAHLKDRLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 275
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 234
HCEV +NW PL+API DR + TVP+ID I T+ + + D + RG ++W ML+
Sbjct: 276 HCEVAVNWYAPLVAPISKDRTVCTVPLIDVISGNTFNIVPQGGGDEDGYARGAWDWSMLW 335
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K L +RE RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+K
Sbjct: 336 KRVPLTQREKTLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYK 395
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IW CGG + +VPCSR+GH+YR + V NY RV+E W+D+ +K YF
Sbjct: 396 IWQCGGKLLFVPCSRVGHIYR-LHGWQGNPPPVYVGSSPTLKNYVRVVEVWWDD-YKDYF 453
Query: 355 YTREPLAMFLDMGDISE 371
Y P L GDISE
Sbjct: 454 YASRPETKALPYGDISE 470
>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
laevis]
gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
Length = 653
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/376 (50%), Positives = 261/376 (69%), Gaps = 11/376 (2%)
Query: 2 PVFKADGKLGNLEPPLEP----YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
PV + G LGN+EP EP +GPGEGGK + L Y+ A A++ E+G NM S+
Sbjct: 121 PVLRP-GILGNMEPK-EPEPQGVVDGPGEGGKHFMLGPDYKDAIKATIKEFGFNMVASDM 178
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DRTI DLR EECK+W+Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 179 ISLDRTINDLRHEECKFWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 238
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
+++DDFS+K L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 234
HCEVG+NW PL+API DR TVP+ID I+ T+E ++ + D RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTTCTVPLIDVIEGNTYELIPQAGGDEDGFARGAWDWSMLW 358
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K L +E ++RK +EPY+SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 359 KRVPLTSKEKEQRKTKTEPYRSPAMAGGLFAIEREYFFELGLYDPGLQIWGGENFEISYK 418
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IW CGG + + PCSR+GH+YR + V NY RV+E W+DE ++ YF
Sbjct: 419 IWQCGGKLLFTPCSRVGHIYR-LHGWQGNPTPAHVGSSPTLKNYVRVVEVWWDE-YRDYF 476
Query: 355 YTREPLAMFLDMGDIS 370
Y P L GDIS
Sbjct: 477 YASRPETKALAYGDIS 492
>gi|344235654|gb|EGV91757.1| N-acetylgalactosaminyltransferase 7 [Cricetulus griseus]
Length = 607
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 196/376 (52%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 75 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 133
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 134 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 193
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFSSK L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 194 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 253
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 254 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 313
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 314 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 373
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 374 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 431
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 432 ASRPESKALPYGDISE 447
>gi|449500526|ref|XP_002187477.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Taeniopygia
guttata]
Length = 828
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 296 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 354
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 355 SLDRSVNDLRQEECKYWHYDDNLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 414
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 415 LIDDFSNKAHLQERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 474
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEVG+NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 475 CEVGINWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 534
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 535 RIPLSHKEKSKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 594
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 595 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 652
Query: 356 TREPLAMFLDMGDISE 371
P L GDISE
Sbjct: 653 ASRPETKALPYGDISE 668
>gi|344288243|ref|XP_003415860.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Loxodonta
africana]
Length = 657
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKPAVQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L QKL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKQKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR + TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRAVCTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L ERE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTEREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
[Strongylocentrotus purpuratus]
gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
[Strongylocentrotus purpuratus]
Length = 670
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 188/377 (49%), Positives = 253/377 (67%), Gaps = 15/377 (3%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
PV K GN EPP +PY+ GPGE G L + D + EYG NM S+ IS D
Sbjct: 135 PVLKE--TTGNYEPPRQPYRTGPGEYGLGVLLDHNEKHLYDKAFEEYGFNMVVSDRISLD 192
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R + DLR +ECK+W YP +LP SV++VFH EG+S+L+RT+HS+ +P + L E++LVD
Sbjct: 193 RIVADLRDKECKHWHYPTNLPNTSVVIVFHQEGWSTLIRTIHSVFNTSPKELLAEVLLVD 252
Query: 122 DFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
D+S K L +KL+DYI+ RF+GK+R++RN +REGLIR+R+ GA+++ G+V+ FLDAHCE
Sbjct: 253 DYSDKVHLKKKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKAIGQVLTFLDAHCE 312
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
G NWLPPLLA I DR + P +D I T+ + S + D RG F+W YK +
Sbjct: 313 CGPNWLPPLLAEIAVDRSTIVCPTVDAISSDTFAYTS--QGDGLCRGAFDWDFWYK--RI 368
Query: 240 PEREAKKR---KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
P + R K S+PY SP AGGL A+DR++F ELGGYDPGL +WGGENFE+SFK+W
Sbjct: 369 PVKPYWHRLGLKQRSQPYPSPVMAGGLLALDRSYFFELGGYDPGLQIWGGENFEISFKVW 428
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETWFDEKHKAYFY 355
MCGGS+++VPCSR+GHVYR +PY++ + V+G ++ NY RV E W DE +K FY
Sbjct: 429 MCGGSLKFVPCSRVGHVYRKQVPYSYP--SSGVEGVSVVDLNYMRVAEVWLDE-YKDSFY 485
Query: 356 TREPLAMFLDMGDISEQ 372
+PL G+ISEQ
Sbjct: 486 ATKPLLEGKPCGNISEQ 502
>gi|363733313|ref|XP_420521.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gallus gallus]
Length = 636
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 259/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 104 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 162
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECK+W Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKHWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 223 LIDDFSNKVHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 282
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEVG+NW PL+API DR TVP+ID ID T++ + + D RG ++W ML+K
Sbjct: 283 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGDTFKIVPQGGGDEDGFARGAWDWSMLWK 342
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE +KR+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 343 RVPLSKREKEKRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 402
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 403 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 460
Query: 356 TREPLAMFLDMGDISE 371
P L GDISE
Sbjct: 461 ASRPETKALPYGDISE 476
>gi|449270894|gb|EMC81540.1| N-acetylgalactosaminyltransferase 7, partial [Columba livia]
Length = 613
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 259/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 81 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESIQASIKEFGFNMVASDMI 139
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 140 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 199
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 200 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 259
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEVG+NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 260 CEVGINWYAPLIAPIAKDRTTCTVPLIDYIDGSDYSIEPQQGGDEDGFARGAWDWSLLWK 319
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L ++E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 320 RIPLSQKEKSKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 379
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 380 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 437
Query: 356 TREPLAMFLDMGDISE 371
P L GDISE
Sbjct: 438 ASRPETKALPYGDISE 453
>gi|12621080|ref|NP_075215.1| N-acetylgalactosaminyltransferase 7 [Rattus norvegicus]
gi|51315737|sp|Q9R0C5.1|GALT7_RAT RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|4092503|gb|AAC99426.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T6 [Rattus
norvegicus]
gi|149032267|gb|EDL87173.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a
[Rattus norvegicus]
Length = 657
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEYKQAAQASIKEFGFNMAASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL +YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|327268630|ref|XP_003219099.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Anolis
carolinensis]
Length = 654
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/376 (50%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 122 PVLRP-GILGNFEPKEPEPHGVVNGPGEEAKPYVLGAEYKESVQASIKEFGFNMVASDMI 180
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR+I D+R EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 181 SLDRSINDIRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 240
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++LE+YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 241 LIDDFSNKAHLKERLEEYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 300
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID T+ + + D + RG ++W ML+K
Sbjct: 301 CEVAVNWYAPLIAPISKDRTTCTVPLIDVIDGNTYNIVPQGGGDDDGYARGAWDWSMLWK 360
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 361 RVPLTKREKEMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 420
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + + PCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 421 WQCGGKLLFTPCSRVGHIYR-LQGWQGNPPPAYVGSSPTLKNYVRVVEVWWDE-YKDYFY 478
Query: 356 TREPLAMFLDMGDISE 371
P L GDI++
Sbjct: 479 ASRPETKALAYGDITD 494
>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
[Saccoglossus kowalevskii]
Length = 634
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/374 (50%), Positives = 248/374 (66%), Gaps = 10/374 (2%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
VFK G +GN EPP + G GEG L A + E+G NM S+ IS DR
Sbjct: 115 VFKP-GIVGNFEPPKSERRTGLGEGAIPVQLNPADENKYVKAKREFGFNMVISDQISLDR 173
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
T+ D+R ECKYW YP DLP ASV+LVF NEG+S+LMRTVHS+ +P+ L EI++VDD
Sbjct: 174 TVKDIRDPECKYWHYPTDLPTASVVLVFINEGWSTLMRTVHSVFNTSPSHLLAEIVMVDD 233
Query: 123 FSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCE 179
FS K L KLE+YI+ RF GK++L+RN +REGLIR R+ GA RGEV+VFLDAHCE
Sbjct: 234 FSDKDHLKSKLEEYIKQDRFEGKIKLVRNAKREGLIRARTIGAINAERGEVVVFLDAHCE 293
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
NWLPPLL+ I +RK + P++D +D + + + D RG+F W YK +
Sbjct: 294 CSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNFGYAP--QADGMARGVFNWDFFYKRIPI 351
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
P +EA +R+ NSEPY+SP AGGLFA+ R+FF ++GGYD GL +WGGE +E+SFKIWMCG
Sbjct: 352 PPKEANRRERNSEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISFKIWMCG 411
Query: 300 GSIEWVPCSRIGHVY-RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
G +E+VPCSR+GH+Y R +PY++ + D + ++ NY RV E W DE +K YFY +
Sbjct: 412 GILEFVPCSRVGHIYRRGGIPYSYPQSDDGIS--IVNKNYLRVAEVWMDE-YKEYFYRMK 468
Query: 359 PLAMFLDMGDISEQ 372
P GDI+EQ
Sbjct: 469 PELRGKPYGDITEQ 482
>gi|345790686|ref|XP_543898.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Canis lupus
familiaris]
Length = 721
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 189 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 247
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 248 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 307
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 308 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 367
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 368 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 427
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 428 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 487
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 488 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 545
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 546 ASRPESKALPYGDISE 561
>gi|338722468|ref|XP_001915592.2| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Equus
caballus]
Length = 621
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 89 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 147
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 148 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 207
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 208 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 267
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ +T+E + + D + RG ++W ML+K
Sbjct: 268 CEVAVNWYAPLIAPISKDRTICTVPIIDVINGKTYEIIPQGGGDEDGYARGAWDWSMLWK 327
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 328 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 387
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 388 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 445
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 446 ASRPESKALPYGDISE 461
>gi|417411949|gb|JAA52393.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 615
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V+ NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVRSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESKALAYGDISE 455
>gi|119896052|ref|XP_602855.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 772
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 240 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 478
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 479 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 538
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 539 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 596
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 597 ASRPESKALAYGDISE 612
>gi|440908503|gb|ELR58512.1| N-acetylgalactosaminyltransferase 7, partial [Bos grunniens mutus]
Length = 615
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESKALAYGDISE 455
>gi|354484375|ref|XP_003504364.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cricetulus
griseus]
Length = 784
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 252 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 310
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 311 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 370
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFSSK L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 371 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 430
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 431 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 490
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 491 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 550
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 551 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 608
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 609 ASRPESKALPYGDISE 624
>gi|335301041|ref|XP_001926518.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Sus scrofa]
Length = 712
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ I
Sbjct: 180 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKHAVQASIKEFGFNMVASDMI 238
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 239 SLDRSVNDLRQEECKYWHYDENLLTASIVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 298
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 299 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 358
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 359 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 418
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 419 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 478
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 479 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPISVGSSPTLKNYVRVVEVWWDE-YKDYFY 536
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 537 ASRPESKALPYGDISE 552
>gi|296484976|tpg|DAA27091.1| TPA: N-acetylgalactosaminyltransferase 7-like [Bos taurus]
Length = 781
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 249 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 307
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 308 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 367
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 368 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 427
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 428 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 487
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 488 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 547
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 548 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 605
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 606 ASRPESKALAYGDISE 621
>gi|359067894|ref|XP_002689501.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 617
Score = 369 bits (948), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 85 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 143
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 144 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 203
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 204 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 263
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 264 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 323
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 324 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 383
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 384 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 441
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 442 ASRPESKALAYGDISE 457
>gi|269784707|ref|NP_653332.3| N-acetylgalactosaminyltransferase 7 isoform 1 [Mus musculus]
gi|51315950|sp|Q80VA0.2|GALT7_MOUSE RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|13650041|gb|AAK37549.1|AF349573_1 UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mus musculus]
gi|30851602|gb|AAH52461.1| Galnt7 protein [Mus musculus]
Length = 657
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
garnettii]
Length = 657
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 194/370 (52%), Positives = 252/370 (68%), Gaps = 8/370 (2%)
Query: 8 GKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECKYW Y +L ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 183
+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
W PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L
Sbjct: 310 WYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTL 369
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG
Sbjct: 370 REKSLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGK 429
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + V NY RV+E W+DE +K YFY P +
Sbjct: 430 LLFVPCSRVGHIYR-LEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDE-YKDYFYASRPES 487
Query: 362 MFLDMGDISE 371
L GDISE
Sbjct: 488 KALPYGDISE 497
>gi|74139820|dbj|BAE31754.1| unnamed protein product [Mus musculus]
gi|74191634|dbj|BAE30388.1| unnamed protein product [Mus musculus]
gi|74198878|dbj|BAE30662.1| unnamed protein product [Mus musculus]
Length = 546
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 14 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 72
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 73 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 132
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 133 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 192
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 193 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 252
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 253 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 312
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 313 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 370
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 371 ASRPESKALPYGDISE 386
>gi|301753757|ref|XP_002912714.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Ailuropoda
melanoleuca]
gi|281338294|gb|EFB13878.1| hypothetical protein PANDA_000463 [Ailuropoda melanoleuca]
Length = 657
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L KL+DY++ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKGKLDDYLKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPFGDISE 497
>gi|387019377|gb|AFJ51806.1| n-acetylgalactosaminyltransferase 7-like [Crotalus adamanteus]
Length = 658
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/371 (51%), Positives = 250/371 (67%), Gaps = 10/371 (2%)
Query: 8 GKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
G LGN EP EP G PGE K + L Y+ + AS+ E+G NM S+ IS DR+
Sbjct: 131 GILGNFEPK-EPESHGVVGGPGEEAKPFVLGPEYKESIQASIKEFGFNMVASDMISLDRS 189
Query: 64 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
I DLR EECKYW Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDF
Sbjct: 190 INDLRQEECKYWHYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDF 249
Query: 124 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGL 182
S+K L ++LEDYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +
Sbjct: 250 SNKEHLKERLEDYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGKVLIYLDAHCEVAV 309
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELP 240
NW PL+API DR TVP+ID ID T+ + + D RG ++W ML+K L
Sbjct: 310 NWYAPLIAPISKDRTACTVPLIDVIDGNTYNIVPQGGGDEDGFARGAWDWSMLWKRVPLT 369
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+RE RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG
Sbjct: 370 KREKAMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGG 429
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
+ +VPCSR+GH+YR + V NY RV+E W+DE K YFY P
Sbjct: 430 QLLFVPCSRVGHIYR-LQGWQGNPPPAYVGSSPTLKNYVRVVEVWWDE-FKDYFYASRPE 487
Query: 361 AMFLDMGDISE 371
L GDIS+
Sbjct: 488 TKALAYGDISD 498
>gi|148696676|gb|EDL28623.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a [Mus
musculus]
Length = 615
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESKALPYGDISE 455
>gi|390370478|ref|XP_793045.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like, partial [Strongylocentrotus purpuratus]
Length = 658
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 177/315 (56%), Positives = 225/315 (71%), Gaps = 5/315 (1%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G+ EP P +EGPGEGG A + +A D + EYG N S+ IS DR I DLR +
Sbjct: 333 GDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDRNIADLRSQ 392
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
+CK+W YP LP SVI+VFHNEG+S+L+RTVHS+ R+P+Q L EIILVDDFS+K L
Sbjct: 393 QCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLK 452
Query: 131 QKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
++LEDY+Q RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LDAHCEVG+NWLPPL
Sbjct: 453 ERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPL 512
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L PI +R P+ID ID + D RG F+W + +K +P+ E +R+
Sbjct: 513 LTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVPQFEKSRRQ 571
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ SEPY+SP AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGGS+ WVPCS
Sbjct: 572 HASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCS 631
Query: 309 RIGHVYRSF--MPYN 321
R+GHVYR +PY+
Sbjct: 632 RVGHVYRILGKVPYS 646
>gi|26329091|dbj|BAC28284.1| unnamed protein product [Mus musculus]
Length = 657
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+ PI DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVPPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|332820787|ref|XP_003310650.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan troglodytes]
gi|410227832|gb|JAA11135.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410262380|gb|JAA19156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410297750|gb|JAA27475.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410332293|gb|JAA35093.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
Length = 657
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN +REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNEKREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTTQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|296195170|ref|XP_002745262.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Callithrix jacchus]
Length = 657
Score = 366 bits (940), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|355778494|gb|EHH63530.1| hypothetical protein EGM_16517, partial [Macaca fascicularis]
Length = 615
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESQALPYGDISE 455
>gi|410956565|ref|XP_003984911.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Felis catus]
Length = 772
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 240 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 478
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 479 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 538
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 539 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 596
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 597 ASRPESKALPYGDISE 612
>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
garnettii]
Length = 657
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/370 (51%), Positives = 251/370 (67%), Gaps = 8/370 (2%)
Query: 8 GKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECKYW Y +L ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 183
+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPE 241
W PL+API DR TVP+ID ID + E + + D RG ++W +L+K L
Sbjct: 310 WYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSH 369
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
+E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG
Sbjct: 370 KEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGK 429
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + V NY RV+E W+DE +K YFY P +
Sbjct: 430 LLFVPCSRVGHIYR-LEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDE-YKDYFYASRPES 487
Query: 362 MFLDMGDISE 371
L GDISE
Sbjct: 488 KALPYGDISE 497
>gi|397505872|ref|XP_003823466.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan paniscus]
Length = 657
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|426222421|ref|XP_004005390.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Ovis aries]
Length = 865
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 254/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE + L ++ A AS+ E+G NM S+ I
Sbjct: 333 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAQPLVLGPEFKHAVQASIKEFGFNMVASDMI 391
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 392 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 451
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 452 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 511
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 512 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 571
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 572 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 631
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 632 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 689
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 690 ASRPESKALAYGDISE 705
>gi|197101721|ref|NP_001124628.1| N-acetylgalactosaminyltransferase 7 [Pongo abelii]
gi|75042656|sp|Q5RFJ6.1|GALT7_PONAB RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|55725190|emb|CAH89461.1| hypothetical protein [Pongo abelii]
Length = 657
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|157502212|ref|NP_059119.2| N-acetylgalactosaminyltransferase 7 [Homo sapiens]
gi|51315961|sp|Q86SF2.1|GALT7_HUMAN RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|28279289|gb|AAH46129.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|28704077|gb|AAH47468.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|119625166|gb|EAX04761.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|193786832|dbj|BAG52155.1| unnamed protein product [Homo sapiens]
gi|325464563|gb|ADZ16052.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7)
[synthetic construct]
Length = 657
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|383412007|gb|AFH29217.1| N-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 657
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|402870854|ref|XP_003899414.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Papio anubis]
Length = 657
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|269784709|ref|NP_001161453.1| N-acetylgalactosaminyltransferase 7 isoform 2 [Mus musculus]
gi|26331462|dbj|BAC29461.1| unnamed protein product [Mus musculus]
Length = 657
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|148696677|gb|EDL28624.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_b [Mus
musculus]
Length = 615
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESKALPYGDISE 455
>gi|126331345|ref|XP_001372222.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Monodelphis
domestica]
Length = 585
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 189/376 (50%), Positives = 257/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G +GN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 244 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR + TVP+ID ID ++ + + D + RG ++W +L+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTVCTVPIIDIIDGNNFKIMPQGGGDEDGYARGAWDWSLLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTQREKTMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + + NY RV+E W+D +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYLGSSPTLKNYIRVVEVWWD-GYKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESKALPYGDISE 497
>gi|426346015|ref|XP_004040686.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gorilla gorilla
gorilla]
Length = 650
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 191/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 118 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 176
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 177 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 236
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 237 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 296
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 297 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 356
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 357 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 416
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 417 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 474
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 475 ASRPESQALPYGDISE 490
>gi|403295730|ref|XP_003938783.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Saimiri boliviensis
boliviensis]
Length = 659
Score = 363 bits (932), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 254/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 127 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 185
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 186 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 245
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 246 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 305
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 306 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 365
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 366 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 425
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 426 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 483
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 484 ASRPESQALPYGDISE 499
>gi|348538240|ref|XP_003456600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oreochromis
niloticus]
Length = 649
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 190/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEG---GKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV K G LGN EP PG K + L Y+ + AS+ E+G NM S+ I
Sbjct: 117 PVLKK-GILGNFEPKEPEPPGVPGGPGEGAKPFVLGPEYKDSVQASIKEFGFNMVASDMI 175
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DRTI D+R EECKYW Y L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 176 SLDRTINDIRHEECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 235
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 177
L+DDFS+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+V+LDAH
Sbjct: 236 LIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLVYLDAH 295
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEVG+NW PL+API DR + TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 296 CEVGINWYAPLIAPISKDRTVCTVPLIDYIDGNEYSMEPQQGGDEDGLARGAWDWSLLWK 355
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +RE KR + ++PY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 356 RVPLSQREKAKRTHTTQPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 415
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+D+ +K YFY
Sbjct: 416 WQCGGQLLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDD-YKDYFY 473
Query: 356 TREPLAMFLDMGDISE 371
P + L GDIS+
Sbjct: 474 ASRPETLTLAYGDISD 489
>gi|395542397|ref|XP_003773119.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Sarcophilus harrisii]
Length = 797
Score = 363 bits (931), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 190/376 (50%), Positives = 256/376 (68%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G +GN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 265 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 323
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 324 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 383
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 384 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 443
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 444 CEVAVNWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 503
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 504 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 563
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + + NY RV+E W+D +K YFY
Sbjct: 564 WQCGGKLLFVPCSRVGHIYR-LSGWQGNPPPIYLGSSPTLKNYIRVVEVWWD-GYKDYFY 621
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 622 ASRPESKALPYGDISE 637
>gi|348566877|ref|XP_003469228.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cavia
porcellus]
Length = 637
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 191/377 (50%), Positives = 255/377 (67%), Gaps = 11/377 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
PV + G LGN EP EP +G PGE K L ++ A AS+ E+G NM S+
Sbjct: 105 PVLRP-GILGNFEPK-EPEPQGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDM 162
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 163 ISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 222
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 223 VLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 282
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLY 234
HCEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+
Sbjct: 283 HCEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLW 342
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+K
Sbjct: 343 KRIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYK 402
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IW CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YF
Sbjct: 403 IWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYF 460
Query: 355 YTREPLAMFLDMGDISE 371
Y P + L GDISE
Sbjct: 461 YASRPESKALLYGDISE 477
>gi|441620192|ref|XP_003258074.2| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Nomascus leucogenys]
Length = 636
Score = 361 bits (927), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 190/376 (50%), Positives = 253/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G L N EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 104 PVLRP-GILSNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 162
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 223 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 282
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 283 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 342
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 343 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 402
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 403 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 460
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 461 ASRPESQALPYGDISE 476
>gi|351701091|gb|EHB04010.1| N-acetylgalactosaminyltransferase 7, partial [Heterocephalus
glaber]
Length = 616
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 192/378 (50%), Positives = 257/378 (67%), Gaps = 12/378 (3%)
Query: 2 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSN 56
PV + G LGN EP EP +G PGE K L ++ A AS+ E+G NM S+
Sbjct: 83 PVLRP-GILGNFEPK-EPEPQGVVGGPGEEAKPLILGPEFKHAVQASIIKEFGFNMVASD 140
Query: 57 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL E
Sbjct: 141 MISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAE 200
Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLD 175
I+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LD
Sbjct: 201 IVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLD 260
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGML 233
AHCEV +NW PL+API DR I TVP+ID I+ T++ + + D + RG ++W ML
Sbjct: 261 AHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYQIVPQGGGDEDGYARGAWDWSML 320
Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
+K L RE + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+
Sbjct: 321 WKRVPLTPREKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 380
Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
KIW CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K Y
Sbjct: 381 KIWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDY 438
Query: 354 FYTREPLAMFLDMGDISE 371
FY P + L GDISE
Sbjct: 439 FYASRPESKALLYGDISE 456
>gi|6318186|emb|CAB60270.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7 [Homo
sapiens]
Length = 657
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 190/376 (50%), Positives = 253/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR + DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRNVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGL A++R FF ELG YDP L +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLCAIEREFFFELGLYDPSLQIWGGENFEISYKI 423
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 482 ASRPESQALPYGDISE 497
>gi|355687724|gb|EHH26308.1| hypothetical protein EGK_16238, partial [Macaca mulatta]
Length = 615
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 191/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W GG +VPCSR+GH+YR + V NY RV+E W+DE +K YFY
Sbjct: 382 WQGGGKFLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 440 ASRPESQALPYGDISE 455
>gi|260789880|ref|XP_002589972.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
gi|229275159|gb|EEN45983.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
Length = 522
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 179/365 (49%), Positives = 242/365 (66%), Gaps = 5/365 (1%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
+GN EP E + PGEG Y L Y+ D S+ E+G N+ S+ IS DRTI D+R
Sbjct: 1 MGNWEPEPERISDAPGEGAIPYKLGPEYKDDIDKSIKEFGFNIVASDKISLDRTIKDIRD 60
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
ECKYW Y LP SVI+VF+NE +S +MRTVHS+IKRTP + L EI+LVDDFS+K
Sbjct: 61 PECKYWHYDTKLPNMSVIIVFYNEAWSVVMRTVHSVIKRTPPELLAEIVLVDDFSTKEHW 120
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPPL 188
Q+L+DYI +F G V+L+RN +REGLI+ RS GA+E ++G+++V+LD+HCEVG+NW P L
Sbjct: 121 KQRLDDYIVQFKGLVKLVRNKQREGLIQARSIGAREATKGKILVYLDSHCEVGINWAPAL 180
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH--HYRGIFEWGMLYKENELPEREAKK 246
++PI +R TVP+ID ID + + D H RG ++W +L+K+ RE +
Sbjct: 181 ISPIAVNRTTCTVPLIDVIDGNNYNIYAQGGGDEYGHARGAWDWSLLWKKVPNTPRERAR 240
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
KY++EPY+SP AGGLFA+DR +F ELG YDPGL +WGGENFE+S+K+W CGG + + P
Sbjct: 241 HKYHTEPYRSPAMAGGLFAIDREYFFELGLYDPGLKIWGGENFEISYKVWQCGGEVLFTP 300
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CSR+GH+YR + ++ NY RV+E W+DE +K YFY P
Sbjct: 301 CSRVGHIYR-LKGWAGNPPPQHSGSSVVLQNYMRVVEVWWDE-YKEYFYASRPEIRNHPY 358
Query: 367 GDISE 371
GDISE
Sbjct: 359 GDISE 363
>gi|313226887|emb|CBY22032.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 188/377 (49%), Positives = 247/377 (65%), Gaps = 7/377 (1%)
Query: 1 RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
+ +++ GKLGN EP KE G G+ GK + + A S+ E+G NM S+
Sbjct: 82 KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDK 141
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DR D+R +CKY DYP LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
++VDD S+K L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 235
CE NWLPPLLAPI + +I TVP+IDGID + F S D R G ++W L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTSQGGGDRWGRATGAWDWSFLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
LPE E KK +P+ SP AGGLFA++R +F ++ YDPGL +WGGENFELS+K+
Sbjct: 322 RIALPESEDKKLPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKL 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG + +VPCSR+GH+YR + VK NY+RVIETW+D+ K +FY
Sbjct: 382 WMCGGGMLFVPCSRVGHIYR-LEGWEGNPPPKTVKSNPSMRNYRRVIETWWDDWSK-FFY 439
Query: 356 TREPLAMFLDMGDISEQ 372
P A LD GDI Q
Sbjct: 440 VARPEAKTLDFGDIGPQ 456
>gi|109076193|ref|XP_001085532.1| PREDICTED: n-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 630
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 189/376 (50%), Positives = 252/376 (67%), Gaps = 9/376 (2%)
Query: 2 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 98 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 156
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 157 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 216
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 217 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 276
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 277 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 336
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
L +E + RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 337 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 396
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
W CGG + +VPCSR+GH+YR + V NY +E DE +K YFY
Sbjct: 397 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYLSSVEVCGDE-YKDYFY 454
Query: 356 TREPLAMFLDMGDISE 371
P + L GDISE
Sbjct: 455 ASRPESQALPYGDISE 470
>gi|198437817|ref|XP_002130165.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 647
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 187/373 (50%), Positives = 242/373 (64%), Gaps = 8/373 (2%)
Query: 2 PVFKADGKLGNLE-PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
P F D LGN E + + G GE G+A L + + + +GE+G N S+ IS
Sbjct: 90 PKFVND-DLGNYELKAPDQKRAGAGEYGEAVQLDSSLDSQVKSVIGEFGFNTVASDRISL 148
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
DR DLR EECK+ DYP LP SVI+VFHNE +S LMRTVH++I TP QYL EI+++
Sbjct: 149 DRAPKDLRHEECKHIDYPSHLPSVSVIIVFHNEAWSPLMRTVHNVINNTPRQYLHEIVMI 208
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DD S K L KL++Y+ +FNG V++ RN REGLIR RS GAK+S GE++V+LDAHCE
Sbjct: 209 DDGSHKDHLGSKLDEYVTKFNGIVKVYRNDRREGLIRARSIGAKKSSGEILVYLDAHCEA 268
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENE 238
NWLPPL+ PI +D + TVP+ID ID + F D + RG ++W +K
Sbjct: 269 EPNWLPPLITPILNDHRACTVPLIDVIDGNKYTFTEQAGGDENGLARGAWDWSFQWKRIP 328
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+ ++E +R SEPY+SP AGGLFA+DR FF ELG YD GL +WGGENFELS+K+WMC
Sbjct: 329 ITKKEKARRNRMSEPYRSPAMAGGLFAIDRNFFFELGLYDDGLEIWGGENFELSYKVWMC 388
Query: 299 GGSIEWVPCSRIGHVYRSFMP-YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
GG + +VPCSR+GHVYR +P + V + NYKRVIETW+D+ K YFYTR
Sbjct: 389 GGQLLFVPCSRVGHVYR--LPGWRGNPPPAYVPKDAVFRNYKRVIETWWDDYSK-YFYTR 445
Query: 358 EPLAMFLDMGDIS 370
P +D GD+S
Sbjct: 446 RPEVKSIDTGDLS 458
>gi|32425405|gb|AAH35303.1| GALNT7 protein, partial [Homo sapiens]
Length = 495
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 238/337 (70%), Gaps = 5/337 (1%)
Query: 38 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
+ A AS+ E+G NM S+ IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+
Sbjct: 1 KQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWST 60
Query: 98 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
LMRTVHS+IKRTP +YL EI+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+
Sbjct: 61 LMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQ 120
Query: 158 TRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-- 214
RS GA++++ G+V+++LDAHCEV +NW PL+API DR I TVP+ID I+ T+E
Sbjct: 121 ARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIP 180
Query: 215 RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLEL 274
+ + D + RG ++W ML+K L +E + RK +EPY+SP AGGLFA++R FF EL
Sbjct: 181 QGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFEL 240
Query: 275 GGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLI 334
G YDPGL +WGGENFE+S+KIW CGG + +VPCSR+GH+YR + V
Sbjct: 241 GLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPT 299
Query: 335 TYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
NY RV+E W+DE +K YFY P + L GDISE
Sbjct: 300 LKNYVRVVEVWWDE-YKDYFYASRPESQALPYGDISE 335
>gi|313220437|emb|CBY31290.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 187/377 (49%), Positives = 247/377 (65%), Gaps = 7/377 (1%)
Query: 1 RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
+ +++ GKLGN EP KE G G+ GK + + A S+ E+G NM S+
Sbjct: 82 KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDT 141
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DR D+R +CKY DYP LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
++VDD S+K L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 235
CE NWLPPLLAPI + +I TVP+IDGID + F + D R G ++W L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTTQGGGDRWGRATGAWDWSFLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
LPE E KK +P+ SP AGGLFA++R +F ++ YDPGL +WGGENFELS+K+
Sbjct: 322 RIALPEPEDKKLPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKL 381
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG + +VPCSR+GH+YR + VK NY+RVIETW+D+ K +FY
Sbjct: 382 WMCGGGMLFVPCSRVGHIYR-LEGWEGNPPPKTVKSNPSMRNYRRVIETWWDDWSK-FFY 439
Query: 356 TREPLAMFLDMGDISEQ 372
P A LD GDI Q
Sbjct: 440 VARPEAKTLDFGDIGPQ 456
>gi|47575716|ref|NP_001001200.1| polypeptide N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
gi|45501097|gb|AAH67317.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
Length = 653
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 189/375 (50%), Positives = 252/375 (67%), Gaps = 11/375 (2%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNH 57
PV + G LGNLEP EP +G G K + L Y+ A AS+ E+G NM S+
Sbjct: 121 PVLRP-GILGNLEPK-EPEPQGVVGGPGEGGKPFELGPDYKDAVKASIKEFGFNMVASDM 178
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DRTI DLR EECKYW+Y +L +SV++VFHNEG+S+L+RT+HS+IKRTP QYL EI
Sbjct: 179 ISMDRTINDLRHEECKYWNYDENLLTSSVVIVFHNEGWSTLVRTIHSVIKRTPRQYLAEI 238
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
+++DDFS+K L +L++Y++++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKGRLDEYLKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID--YQTWEFRSVYEPDHHYRGIFEWGMLY 234
HCEVG+NW PL+API DR TVP+ID ID T E + + D RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTACTVPLIDYIDGNLYTIEPQQGGDEDGFARGAWDWSMLW 358
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K L RE KRK+ +EPY SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 359 KRIPLTVREKAKRKHKTEPYWSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYK 418
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IW CGG + + PCSR+GH+YR + V NY RV+E W+DE +K YF
Sbjct: 419 IWQCGGKLLFTPCSRVGHIYR-LHGWQGNPTPVYVGASPTLKNYIRVVEVWWDE-YKDYF 476
Query: 355 YTREPLAMFLDMGDI 369
Y P L GDI
Sbjct: 477 YASRPETKALPYGDI 491
>gi|313242250|emb|CBY34413.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)
Query: 9 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE ++++ + PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + VKG + NY RV+E W+DE K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437
Query: 362 MFLDMGDISE 371
+D GD++E
Sbjct: 438 KHVDPGDLTE 447
>gi|313230492|emb|CBY18708.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)
Query: 9 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE ++++ + PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + VKG + NY RV+E W+DE K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437
Query: 362 MFLDMGDISE 371
+D GD++E
Sbjct: 438 KHVDPGDLTE 447
>gi|313230491|emb|CBY18707.1| unnamed protein product [Oikopleura dioica]
Length = 510
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)
Query: 9 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
RE ++++ + PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+ +VPCSR+GH+YR + VKG + NY RV+E W+DE K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437
Query: 362 MFLDMGDISE 371
+D GD++E
Sbjct: 438 KHVDPGDLTE 447
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 229/350 (65%), Gaps = 11/350 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GPGE G+ + DA+ E+G N S+ IS +RTI D R + CK YP++LP
Sbjct: 1 GPGENGEPVETKAEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLP 60
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
KASV++VFHNEG+S+LMRTVH+++ R+P L+EI++VDDFS+K L QKL+DY ++ G
Sbjct: 61 KASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKL-G 119
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
K++++R ER GLI+ R GA + GEV++FLDAHCE WLPPLL I +R+ P
Sbjct: 120 KIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALNRRTAVCP 179
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
ID ID++T++++ + D + RG F W YKE + E KR+ ++ KSP AGG
Sbjct: 180 TIDFIDHKTFQYKPM---DPYIRGTFNWRFDYKERAVRPEEMAKRRDPTQEVKSPVMAGG 236
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA++R FF ELG YDPG+ +WGGE +E+SFK+W CGG +E +PCSR+GHVYR +PY +
Sbjct: 237 LFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHVPYTY 296
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
K N++RV E W DE +K + Y + P +D GDIS++
Sbjct: 297 P------KHDATLVNFRRVAEVWMDE-YKDWLYDKRPEIKSVDYGDISDR 339
>gi|313227738|emb|CBY22887.1| unnamed protein product [Oikopleura dioica]
Length = 1030
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 232/352 (65%), Gaps = 8/352 (2%)
Query: 25 GEGG-KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
GEGG L + A+LG +G NM S+ ++ DR DLRMEECK WDYP LP
Sbjct: 493 GEGGLSPIRLTSEDQTKVTAALGLWGFNMVASDKVNMDRVPADLRMEECKRWDYPDKLPA 552
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ-KLEDYIQRFNG 142
SVILVFHNEGFS+L+RTVHSI+ +P + L E++++DD S++ + ++ YI+R++G
Sbjct: 553 VSVILVFHNEGFSTLLRTVHSIVNYSPPEMLHEVVMLDDGSTREYITNGTIDRYIERWDG 612
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
V++ N +REGLIR R+ G K S G V+VFLDAHCEV NWLPPL+ PI + K+ ++P
Sbjct: 613 LVKIFHNEKREGLIRARTIGGKHSTGSVLVFLDAHCEVEPNWLPPLITPIAKNYKVSSLP 672
Query: 203 VIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
+ID ID T+ F D + RG ++W +K L +RE +R +EP++SP A
Sbjct: 673 MIDAIDGNTYVFEPQQGGDENNLARGAWDWNFDWKRIPLNQREKARRATITEPFRSPAMA 732
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP- 319
GGLFA+ R +F ELG YD L +WGGENFELS+K+W CGG + +VPCSR+GH+YR MP
Sbjct: 733 GGLFAISRKWFTELGWYDDKLEIWGGENFELSYKLWQCGGELLFVPCSRVGHIYR--MPG 790
Query: 320 YNFGKLADRVKGP-LITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
+ D +KG I NY RVIETW+D+ +K Y+Y R P +D+GD++
Sbjct: 791 WGGNGTPDELKGKNFIAVNYNRVIETWWDDNYKKYYYERRPENKNVDVGDLT 842
>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
Length = 409
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/255 (63%), Positives = 199/255 (78%), Gaps = 5/255 (1%)
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
+VDDFS K L +L+ Y+++F+GKV+L+RN EREGLIRTRS GAKE+ G+V++FLDAHC
Sbjct: 1 MVDDFSDKEHLKDRLDVYLKQFDGKVKLVRNAEREGLIRTRSIGAKEAVGDVVIFLDAHC 60
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKEN 237
EV +NWLPPLLAPI +RKIMTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 61 EVNVNWLPPLLAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYRGIFEWGLLYKET 120
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
EL +E +RK+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 121 ELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQ 180
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG I +VPCS +GHVYRS MPY FGKL+ + P+I+ N RVI+TW DE K Y+Y R
Sbjct: 181 CGGGILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYDK-YYYIR 236
Query: 358 EPLAMFLDMGDISEQ 372
EP A G+IS Q
Sbjct: 237 EPSAKHRLPGNISSQ 251
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 333 bits (855), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 226/352 (64%), Gaps = 13/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ GPGE GK L R G + E G N++ SN IS DR IPD+R C Y D
Sbjct: 40 RTGPGEQGKPADLTAEER--GPHAYEECGFNIKASNKISLDRAIPDIRHPNCASKKYVRD 97
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++ FHNEG+++L+RTVHS++ R+P Q + EIILVDDFS ++ L + LEDY+ +
Sbjct: 98 LPDVSLVIPFHNEGWTTLLRTVHSVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKL 157
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ KVR++R +REGLIRTR GA+ ++G+V++FLD+HCE +NWLPPLL PI ++K +
Sbjct: 158 SPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIV 217
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P ID ID + + + + RG F+W M YK +P+ K S+P++SP A
Sbjct: 218 CPNIDVIDKDDFHYET--QAGDAMRGAFDWEMYYKRIPIPDE--IKNPDPSDPFESPVMA 273
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+W CGG + PCSR+GHVYR F+PY
Sbjct: 274 GGLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRKFVPY 333
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE +K + Y R P DMGDIS Q
Sbjct: 334 KVP------AGVNLGKNLKRVAEVWMDE-YKEHLYKRRPHLRKTDMGDISGQ 378
>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Mus musculus]
Length = 601
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 174/352 (49%), Positives = 228/352 (64%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 S-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Cavia porcellus]
Length = 601
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 230/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E + D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDSDDSAYRENGFNIFVSNNIALERSLPDIRHTNCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LETLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYV 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|313226886|emb|CBY22031.1| unnamed protein product [Oikopleura dioica]
Length = 685
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 185/380 (48%), Positives = 243/380 (63%), Gaps = 12/380 (3%)
Query: 2 PVFKADGKLGNLE--PPL-EPYKEGPGEGGKAYH-LPEAYRAAGDASLGEYGMNMETSNH 57
P +++DGK GN E P + E +GPGE G A H LPE + + +G N+ S+
Sbjct: 144 PFYRSDGKPGNWEDRPHVDESGHDGPGEHGAAVHTLPEEEEQVKEI-IKTFGFNLVNSDK 202
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
IS DR DLR +EC DYP LP SV++VFHNEG+ L+RT HS++ RTP + L EI
Sbjct: 203 ISMDRLPKDLRDKECINIDYPEKLPMVSVVVVFHNEGWGPLVRTFHSVVNRTPPELLGEI 262
Query: 118 ILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
+++DD S K L LE+YI+R++GKV+L RN REGLIR RS GA+ + EV+VFL
Sbjct: 263 VIIDDGSVIKDKPHLGDPLEEYIKRWDGKVKLYRNARREGLIRARSIGAQHAIFEVLVFL 322
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGM 232
DAHCE G NWLPPL+API + +I TVP+ID ID Q + F DH+ R G +EW
Sbjct: 323 DAHCEAGYNWLPPLIAPIARNDRISTVPLIDSIDGQRYTFSGQAGGDHNGRAQGGWEWNF 382
Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
L+K LP++EA+K + +E Y SP AGGLFA++R F +G YDPGL +WGGE +E+S
Sbjct: 383 LWKRYPLPKKEAEKLSHGTEMYPSPAMAGGLFAINREHFNNVGMYDPGLEIWGGEQYEIS 442
Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
+K+WMCGG + +VPCSR+GHVYR + + V NY+RVIE W+D+ K
Sbjct: 443 YKLWMCGGGVYFVPCSRVGHVYR-LEGWGGNPPPEYVPSNPSFRNYRRVIEVWWDDWTK- 500
Query: 353 YFYTREPLAMFLDMGDISEQ 372
YFY P L GDISEQ
Sbjct: 501 YFYWNRPELQKLPYGDISEQ 520
>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Loxodonta africana]
Length = 601
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 174/355 (49%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EALRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Cricetulus griseus]
Length = 555
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 173/352 (49%), Positives = 229/352 (65%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 35 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 92
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 93 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIVEIILVDDFSDREHLKDKLEDYMARF 152
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 153 S-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 211
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK+ +P +R S+P++SP A
Sbjct: 212 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKKIPIPPE--LQRADPSDPFESPVMA 267
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 268 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 327
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G ++ N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 328 KVP------SGTILARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 372
>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Monodelphis domestica]
Length = 573
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 173/352 (49%), Positives = 229/352 (65%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 53 RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLEK 110
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF
Sbjct: 111 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKDKLEEYMARF 170
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I +RK +
Sbjct: 171 S-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNRKTIV 229
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 230 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 285
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 286 GGLFAVDRRWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 345
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 346 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 390
>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Otolemur garnettii]
Length = 601
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Otolemur garnettii]
Length = 600
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 77 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 134
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 194
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 195 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRADPSDPFESP 309
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 310 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 369
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 370 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 417
>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Nomascus leucogenys]
Length = 601
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMLDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Felis catus]
Length = 601
Score = 330 bits (846), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Callithrix jacchus]
Length = 601
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
AltName: Full=Polypeptide GalNAc transferase 17;
Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 17;
AltName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 17; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 17
gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Homo sapiens]
Length = 601
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Saimiri boliviensis boliviensis]
Length = 601
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
Length = 584
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401
>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1
[Macaca mulatta]
gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
Length = 601
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 2
[Macaca mulatta]
Length = 584
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401
>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 601
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GD+S Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDLSAQ 418
>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Equus
caballus]
Length = 601
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Ovis aries]
Length = 601
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GD+S Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDLSAQ 418
>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Canis lupus
familiaris]
Length = 601
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E R D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Papio anubis]
Length = 601
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Papio anubis]
Length = 584
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 118
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401
>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
troglodytes]
Length = 601
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Gorilla gorilla gorilla]
Length = 555
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 32 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 89
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 90 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 149
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 150 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 208
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 209 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 264
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 265 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 324
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 325 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 372
>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Heterocephalus glaber]
Length = 522
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 172/350 (49%), Positives = 226/350 (64%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y LP
Sbjct: 1 GKGEHGKPYPLTE--EDGDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLP 58
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+
Sbjct: 59 NTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS- 117
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P
Sbjct: 118 KVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCP 177
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P +R S+P++SP AGG
Sbjct: 178 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGG 233
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 234 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKV 293
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 294 P------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 336
>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
norvegicus]
Length = 601
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 173/352 (49%), Positives = 225/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -PIVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R SEP++SP A
Sbjct: 258 CPMIDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSEPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
1 [Taeniopygia guttata]
gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
2 [Taeniopygia guttata]
Length = 601
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 171/352 (48%), Positives = 227/352 (64%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEA 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMLRF 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMYDVPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
Length = 601
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 171/352 (48%), Positives = 227/352 (64%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMVRF 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
Length = 601
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSRKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIPLNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/364 (43%), Positives = 232/364 (63%), Gaps = 5/364 (1%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
K+ + P + GPGE G A HL + G+A + ++ MN+ S+ +S DR+IPD R
Sbjct: 59 KMRHKRPDYSKQRSGPGENGAAVHLSGKEKEKGEADMKKWFMNVVASDKLSMDRSIPDTR 118
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
EC+ Y DLP ASV+++F +E ++ L+RTVHS++ R+P L E+IL+DDFS + +
Sbjct: 119 HAECRSVHYDDDLPSASVVIIFTDEAWTPLLRTVHSVVNRSPLHLLHEVILLDDFSQREE 178
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L KL++YI+RF G V+LIR ER GLIR + GA E+ GEVIVFLD+HCE WL PL
Sbjct: 179 LKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEANEGWLEPL 238
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LA I R + P+ID I +T ++ + + + G F W + ++ + + + E +RK
Sbjct: 239 LARIKEKRTAVLCPIIDYISAETMQYSG--DANVNAVGGFWWSLHFRWDSIGKAERDRRK 296
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
EP +SPT AGGL A +R +FLE+GGYDPG+ +WGGEN E+SF++WMCGGSIE++PCS
Sbjct: 297 SAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISFRVWMCGGSIEFIPCS 356
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GH++R+ PYN + + N KR+ E W D+ +K +Y P D+GD
Sbjct: 357 HVGHIFRAGHPYNMTGPGGNLD--VHGTNSKRLAEVWMDD-YKRLYYLHRPDLKTKDVGD 413
Query: 369 ISEQ 372
+SE+
Sbjct: 414 LSER 417
>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Oryctolagus cuniculus]
Length = 601
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYKENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
RF+ KVR++R +REGLIRTR GA + GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLF++DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFSVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 224/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK + + +A R D + E G N+ S+ IS +R+IPD+R CK Y
Sbjct: 158 RSGNGEQGKPFPMTDADRV--DQAYRENGFNIYVSDRISLNRSIPDIRHPNCKQKLYAEK 215
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS + L Q LE+Y+ R
Sbjct: 216 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDREHLKQPLEEYMVRL 275
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 276 -PKVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 334
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P K+ SEP++SP A
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKEDP--SEPFESPVMA 390
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 391 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGCMEDIPCSRVGHIYRKYVPY 450
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD + Q
Sbjct: 451 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDTAAQ 495
>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Pan paniscus]
Length = 518
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 170/348 (48%), Positives = 225/348 (64%), Gaps = 14/348 (4%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y LP
Sbjct: 2 GEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLPNT 59
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+ KV
Sbjct: 60 SIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KV 118
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
R++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+I
Sbjct: 119 RIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMI 178
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
D ID+ + + + + RG F+W M YK +P +R S+P++SP AGGLF
Sbjct: 179 DVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGGLF 234
Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
A++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 235 AVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKVP- 293
Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 294 -----SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 335
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 224/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GKA+ L +A R D + E G N+ S+ IS +R++PD+R CK Y
Sbjct: 158 RSGNGEQGKAFPLTDADRV--DQAYRENGFNIFVSDRISLNRSVPDIRHPNCKQKLYAER 215
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS K L LE+Y+ R
Sbjct: 216 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDHLKGALEEYMVRL 275
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 276 P-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANINWLPPLLDRIALNRKTIV 334
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P K SEP++SP A
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPAELQKNDP--SEPFESPVMA 390
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 391 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 450
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 451 KV------PGGVSLARNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDVAAQ 495
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 225/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D E G N+ SN I+ R++PD+R CK+ Y
Sbjct: 77 EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 134
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HS+I RTP +EE+ILVDDFS + L +KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 194
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 195 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R SEP++SP
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSEPFESP 309
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG + VPCSR+GH+YR +
Sbjct: 310 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSRVGHIYRKY 369
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE + Y Y R P L GDIS Q
Sbjct: 370 VPYKVP------TGTSLARNLKRVAETWMDE-YAEYIYQRRPEYRHLSTGDISSQ 417
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 171/355 (48%), Positives = 225/355 (63%), Gaps = 14/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK Y L E D E G N+ SN I+ R++PD+R CK+ Y
Sbjct: 78 EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 135
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+ FHNEG++SL+RT+HS+I RTP +EE+ILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 195
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID+ + + + + RG F+W M YK +P +R SEP++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSEPFESP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG + VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSRVGHIYRKY 370
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+PY G + N KRV ETW DE + Y Y R P L GDIS Q
Sbjct: 371 VPYKVP------TGTSLARNLKRVAETWMDE-YAEYIYQRRPEYRHLSTGDISSQ 418
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 170/354 (48%), Positives = 231/354 (65%), Gaps = 18/354 (5%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK + + E R D + E G N+ SN IS +R++PD+R E C+ Y
Sbjct: 88 RAGNGEQGKPFPVTETDRV--DQAYRENGFNIYVSNRISLNRSLPDIRHENCRQKLYAEK 145
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP ++I+ FHNEG+SSL+RTVHS+I R+P + + EIILVDDFS K L LE+Y++RF
Sbjct: 146 LPNTTIIIPFHNEGWSSLLRTVHSVINRSPPRLVAEIILVDDFSDKEHLKVALEEYMKRF 205
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 206 P-KVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPLLDRIVQNRKTIV 264
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID ID+ + + + + RG F+W M YK +P A+ R + +EP++SP
Sbjct: 265 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP---AEMRTDDPTEPFESPVM 319
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++P
Sbjct: 320 AGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVP 379
Query: 320 YNFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y +V G + + N KRV E W DE + Y Y R P L GD+S Q
Sbjct: 380 Y-------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMSAQ 425
>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Anolis carolinensis]
Length = 602
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 168/352 (47%), Positives = 227/352 (64%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEQGKPYPLTE--EDNDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPNSLIAEIILVDDFSDREHLKEKLEEYMARF 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -VKVRIVRTKKREGLIRTRLLGASIAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSDPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWDLGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE Y Y R P L GD+S Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYVYQRRPEYRHLSTGDLSAQ 418
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 166/361 (45%), Positives = 235/361 (65%), Gaps = 13/361 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + GPGEGG +L + G+A + ++ MN+ S+ IS DR++PD R ++C+
Sbjct: 6 PDYSKKRIGPGEGGTGVYLTGKQKVQGEADMKKWFMNVVASDLISLDRSLPDRRHKQCRK 65
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L KLE
Sbjct: 66 ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKGKLE 125
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL PLLA I
Sbjct: 126 EYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
R + P+I+ I +T + + P H G F W + ++ + +P+ + +EP
Sbjct: 186 KRSAVLCPIINHISPETLTYSANDRPAH--VGGFWWSLHFRWDPMPKEYSDADP--TEPI 241
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+SPT AGGL A+DR +F E+GGYDP + +WGGEN E+SF++WMCGGS+E++PCS +GH++
Sbjct: 242 RSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEMSFRVWMCGGSVEFIPCSHVGHIF 301
Query: 315 RSFMPYNF---GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
R+ PYN G D V G N KR+ E W D+ K Y+ R L D+GD+SE
Sbjct: 302 RAGHPYNMIGPGNNKD-VHGT----NSKRLAEVWMDDYKKFYYIHRLDLKE-KDVGDLSE 355
Query: 372 Q 372
+
Sbjct: 356 R 356
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 168/353 (47%), Positives = 230/353 (65%), Gaps = 16/353 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GKA+ L ++ R D + E G N+ S+ IS +R++PD+R +CK Y
Sbjct: 81 RTGNGEQGKAFPLTDSDRV--DQAYRENGFNIYISDRISLNRSLPDIRHADCKQKLYAEK 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K L LE+Y++R
Sbjct: 139 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAELILVDDFSDKEHLKVPLEEYMKRM 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKSIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R S+P++SP A
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPAE--MQRDDPSQPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V G + + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 374 -------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMTPQ 418
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 167/353 (47%), Positives = 229/353 (64%), Gaps = 16/353 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK + L E R D + E G N+ S+ IS +R++PD+R E C+ Y
Sbjct: 81 RMGNGEQGKPFPLTENDRV--DQAYRENGFNIYVSDRISLNRSLPDIRHENCRQKLYAEK 138
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ FHNEG+SSL+RTVHS++ R+P++ + E+ILVDDFS K L LE+Y++R
Sbjct: 139 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSRLITEVILVDDFSDKEHLKVALEEYMKRM 198
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKAIV 257
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P +R SEP++SP A
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPPE--MQRDDPSEPFESPVMA 313
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKLWMCGGRMEDIPCSRVGHIYRKYVPY 373
Query: 321 NFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V G + + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 374 -------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMTAQ 418
>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 520
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 231/360 (64%), Gaps = 11/360 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + GPGE G +L + G+A + ++ MN+ S+ IS DR++PD R ++C
Sbjct: 6 PDYSKKRIGPGEDGTGVYLTGKQKVQGEADMKKWFMNLVASDLISLDRSLPDHRHKQCHK 65
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L +KLE
Sbjct: 66 ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKEKLE 125
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL PLLA I
Sbjct: 126 EYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
R + P+I+ I +T + + P + G F W + + + +P+ +EP
Sbjct: 186 KRSAVLCPIINHISAETLTYSANDRPTN--VGGFSWSLHFLWDPMPKEYFDADP--TEPI 241
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+SPT AGGL A+DR++F E+GGYDP + +WGGEN E+SF++WMCGGSIE++PCS +GH++
Sbjct: 242 RSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSFRVWMCGGSIEFIPCSHVGHIF 301
Query: 315 RSFMPYNFGKLADR--VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R PYN D V G N KR+ E W D+ K Y+ R L D+GD+SE+
Sbjct: 302 RDGHPYNMIGPGDNKDVHGT----NSKRLAEVWMDDYKKFYYIHRLDLK-GKDVGDLSER 356
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 167/352 (47%), Positives = 224/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GKA+ + +A R D + E G N+ S+ IS +R++PD+R CK Y
Sbjct: 33 RSGNGEQGKAFPMTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKQKLYAEK 90
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K L L++Y+ R
Sbjct: 91 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEVILVDDFSDKEHLKVPLDEYMVRL 150
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 151 -PKVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 209
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P K+ SEP++SP A
Sbjct: 210 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPLELQKEDP--SEPFESPVMA 265
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E PCSR+GH+YR ++PY
Sbjct: 266 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDTPCSRVGHIYRKYVPY 325
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 326 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLAAGDMAVQ 370
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 152 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 209
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 210 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 269
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 270 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 328
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 329 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 384
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 385 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 444
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 445 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 487
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 122 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 179
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 180 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 239
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 240 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 298
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 299 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 354
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 355 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 414
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 415 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 457
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 165/352 (46%), Positives = 223/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK + L +A R D + E G N+ S+ IS +R++PD+R CK+ Y
Sbjct: 68 RSGNGEQGKPFPLTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKHKLYAEK 125
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP ++I+ FHNEG+SSL+RTVHS++ R+P + EIILVDDFS K L LE+Y+ R
Sbjct: 126 LPNTTIIIPFHNEGWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEHLKVALEEYMVRL 185
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA ++GEV+ FLD+HCE +NWLPPLL I +RK +
Sbjct: 186 -PKVRILRTKKREGLIRTRLLGAAAAKGEVLTFLDSHCEANVNWLPPLLDRIAQNRKTIV 244
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P K SEP++SP A
Sbjct: 245 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPTELQKDDP--SEPFESPVMA 300
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 301 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 360
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 361 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDMTVQ 405
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/371 (44%), Positives = 222/371 (59%), Gaps = 17/371 (4%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RPVF+ P PGE G+ +P+ Y+ + N S+ IS
Sbjct: 39 RPVFQP-------ALPQNHKPAAPGEYGRPVDVPKEYQQLSEELFQRNHFNQWVSDRISL 91
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
RT+PD R E CK YP+DLP SV++VF+NE +S+LMRTVHS++ R+P L E+ILV
Sbjct: 92 QRTLPDPRPEMCKSMTYPVDLPSTSVVIVFYNEAWSTLMRTVHSVLDRSPPDLLHEVILV 151
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DD S +L Q LE+Y+++ + KVRL RN++REGLIR R RG +++ ++ FLDAHCEV
Sbjct: 152 DD--SSDELHQPLEEYVRQLD-KVRLHRNSQREGLIRARLRGLEQTSAPIVTFLDAHCEV 208
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
+ WL PLL I+ DR + P ID ID + ++ Y P RG F W + +K + P
Sbjct: 209 TIGWLEPLLNRIHQDRTTVVCPEIDSIDLNNFAYK--YGPSGVLRGTFNWDLSFKWSIAP 266
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
E +R ++P +SPT AGGLFA+DR +FLELG YD GL +WG EN ELSFK+W CGG
Sbjct: 267 TSERLRRTSATDPMRSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSFKVWQCGG 326
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
+E +PCS +GHV+R PY+ + NY+RV E W D+ +K +FY R P
Sbjct: 327 KLEIIPCSHVGHVFREVQPYDTSVSLHSIANK----NYQRVAEVWMDD-YKKFFYQRHPY 381
Query: 361 AMFLDMGDISE 371
GDISE
Sbjct: 382 LTDQSFGDISE 392
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 167 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKHYLETLP 224
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 225 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 284
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 285 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 343
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 344 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 399
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 400 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 459
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 460 ------PAGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 502
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 224/350 (64%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ I+ +R++PD+R C Y LP
Sbjct: 133 GNGEQGKPYPITDAERV--DQAYRENGFNIFVSDKIALNRSLPDIRHPNCNSKLYLEKLP 190
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 191 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREHLKKRLEDYMAQF-P 249
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I S+RK + P
Sbjct: 250 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIASNRKTIVCP 309
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID + +++ + RG F+W M YK +P K S+P++SP AGG
Sbjct: 310 MIDVIDNDHFGYKT--QAGDAMRGAFDWEMYYKRIPIPLELQKSDP--SDPFESPVMAGG 365
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 366 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYIPYKI 425
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 426 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRLPEYRHLSTGDVTAQ 468
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK + L A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 105 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 162
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 163 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 222
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 223 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 281
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 282 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 337
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 338 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 397
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 398 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVTAQ 440
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 222/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P++ + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 423
>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
Length = 520
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 162/360 (45%), Positives = 233/360 (64%), Gaps = 11/360 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + GPGE G +L + G+A + ++ MN+ S+ IS DR++PD R E+C+
Sbjct: 6 PDYSKKRTGPGEDGSGVYLTGKQKVRGEADMKKWFMNLVASDMISLDRSLPDHRHEQCRK 65
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
+YP +LP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + DL +LE
Sbjct: 66 INYPDNLPVASVVIIFTDEAWSPLMRTVHSVINRTPFKLLQEIILVDDFSQRDDLKGRLE 125
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+YI+RF KVRLIR ER+GLIR + GAKE+ G+V++FLD+HCEV WL PLLA I
Sbjct: 126 EYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVSEGWLEPLLARIKE 185
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+R ++ P+ID I +T + + G F W + ++ + LPE ++P
Sbjct: 186 NRSVVLCPIIDHISAETLAYSGSDRLAN--VGGFWWSLHFRWDPLPEEYYGIDP--TKPI 241
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+SPT AGGLFA+DR +F E+GGYDP + +WGGEN E+SF++WMCGG IE++PCS +GH++
Sbjct: 242 RSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISFRVWMCGGGIEFIPCSHVGHIF 301
Query: 315 RSFMPYNFGKLADR--VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R+ PYN + V G N KR+ E W D+ + Y+ R L ++GD+SE+
Sbjct: 302 RAGHPYNMTGPGNNEDVHGT----NSKRLAEVWMDDYKRFYYIHRSDLKE-KNVGDLSER 356
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 ------PAGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 109 GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLP 166
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 167 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-P 225
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 226 NVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 285
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 286 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 341
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 342 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 401
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 402 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 444
>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Hydra magnipapillata]
Length = 565
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 230/361 (63%), Gaps = 15/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
EP + PGE G A PE Y + + YG N TS+ ISF+R++PD R +EC
Sbjct: 37 EPTGISNQSSPGEQGIAVVTSPEDY-GKRNQAYTLYGFNQFTSDKISFNRSLPDPRPQEC 95
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K Y LP SV+++FHNEG+S+L+RTVHS++ R+P++ L EIIL DD+S K L ++
Sbjct: 96 KITKYQSRLPTVSVVIIFHNEGWSTLLRTVHSVLNRSPSKLLHEIILCDDYSQKEHLKKQ 155
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LEDYI + K++L+R +EREGLIR R GA + G++I+FLD+HCE + WLPPL++ I
Sbjct: 156 LEDYIIPY-PKIKLVRTSEREGLIRARVHGANHANGDIIIFLDSHCEANVGWLPPLVSEI 214
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
+ + +T P +D ID+ ++ +R V D + RG F W YKE + E + RK +E
Sbjct: 215 EKNYRCVTCPTVDFIDHDSFYYRGV---DPYIRGTFNWRFDYKERGITEHQKAARKSVTE 271
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
+SP AGGLFA+ + F+ ELG YDPG+ VWGGE +E+SFK+WMCGG + +PCSR+GH
Sbjct: 272 GVRSPVMAGGLFAISKKFWEELGKYDPGMYVWGGEQYEISFKLWMCGGEMLNMPCSRVGH 331
Query: 313 VYRSFMPYNFGKLADRVKGPLITY-NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
VYR +PY + K P + N+KRV E W DE K + Y P+ + G+ISE
Sbjct: 332 VYRRNVPYTYNK-------PFASLINFKRVAEVWMDE-FKEFLYRGNPMVRSQNAGNISE 383
Query: 372 Q 372
+
Sbjct: 384 R 384
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 119 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 176
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 177 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 236
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 237 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 295
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 296 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 351
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 352 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 411
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 412 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 454
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK + L A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 137 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 194
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 195 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 254
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 255 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 313
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 314 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 369
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 370 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 429
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 430 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVTAQ 472
>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
caballus]
Length = 561
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 46 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 103
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 104 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 163
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 164 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 222
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 223 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 278
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 279 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 338
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 339 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 381
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 371
>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 4 [Macaca mulatta]
Length = 602
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 87 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 144
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 319
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 320 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 379
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 380 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 422
>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
Length = 603
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423
>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
Mn2+
gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
Galnac-ser On Lectin Domain
Length = 570
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 55 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 112
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 113 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 172
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 173 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 231
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 232 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 287
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 288 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 347
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 348 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 390
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 423
>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
mulatta]
Length = 565
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 50 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 107
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 108 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 167
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 168 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 226
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 227 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 282
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 283 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 342
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 343 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 385
>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [synthetic
construct]
Length = 603
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423
>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Homo sapiens]
Length = 603
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423
>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
Length = 582
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ I+ +R++PD+R C Y LP
Sbjct: 67 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKIALNRSLPDIRHPNCNNKRYLETLP 124
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS + L + LEDY+ F
Sbjct: 125 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREHLKKPLEDYMAHFPS 184
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 185 -VRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 243
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 244 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 299
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 300 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 359
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y R P L GD++ Q
Sbjct: 360 ------PAGVSLARNLKRVAEVWMDE-FAEHIYQRRPEYRHLSAGDVAAQ 402
>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
troglodytes]
Length = 551
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 371
>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
Length = 551
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 93
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 212
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 371
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 219/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE K Y + +A R D + E G NM S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQAKPYPMTDAERV--DQAYRENGFNMYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 13 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 70
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 71 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 130
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPLL I +RK + P
Sbjct: 131 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 189
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 190 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 245
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 246 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 305
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 306 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 348
>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Ailuropoda melanoleuca]
Length = 598
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 83 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 140
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 141 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 200
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 201 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 259
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 260 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 315
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 316 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 375
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 376 ------PAGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 418
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKVDP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 163/352 (46%), Positives = 223/352 (63%), Gaps = 14/352 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y
Sbjct: 92 RTGNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNSKLYLEK 149
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 150 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLRKRLEDYMAQF 209
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I + K +
Sbjct: 210 T-KVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNHKTIV 268
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ + + + + RG F+W M YK +P K S+P++SP A
Sbjct: 269 CPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP--SDPFESPVMA 324
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 325 GGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 384
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 385 KVP------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVATQ 429
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 84 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNRKRYLETLP 141
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 142 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMAHFPS 201
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 202 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 260
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 261 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 316
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 317 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 376
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y R P L GD++ Q
Sbjct: 377 P------AGVSLARNLKRVAEVWMDE-FAEHIYQRRPEYRHLSAGDVAAQ 419
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 43 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 100
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 101 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 159
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 160 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 219
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P K S+P++SP AGG
Sbjct: 220 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 275
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 276 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 335
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 336 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 378
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 93 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 150
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 151 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 209
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 210 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 269
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P K S+P++SP AGG
Sbjct: 270 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 325
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 326 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 385
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 386 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 428
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 31 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 88
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 89 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 147
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 148 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 207
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P K S+P++SP AGG
Sbjct: 208 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 263
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 264 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 323
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 324 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 366
>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
Length = 602
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 169/356 (47%), Positives = 221/356 (62%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE G+ L + A + G N S+ IS +R++PD+R +C+ Y
Sbjct: 77 EEKRTGMGEHGRPAFLSPSLDARKEKLYQVNGFNAALSDEISLNRSVPDIRHPDCRKKKY 136
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+LMRT S+I R+P LEEIILVDD S+K +L +KL+DYI
Sbjct: 137 SKNLDPVSVIVSFHNEHFSTLMRTCWSVINRSPPSLLEEIILVDDASTKVELKKKLDDYI 196
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
++ KV ++R +R GLIR R GAK +R +V+VFLD+H E +NWLPPLL PI + K
Sbjct: 197 AQYLPKVSIVRLAKRSGLIRGRLAGAKAARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 256
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + K+ +EP+KSP
Sbjct: 257 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR---PAEPFKSP 310
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 311 IMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 370
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G +G + NYKRV E W DE + Y Y R P LD GD+SEQ
Sbjct: 371 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YAEYIYKRRPHLRALDPGDLSEQ 419
>gi|444727227|gb|ELW67729.1| N-acetylgalactosaminyltransferase 7 [Tupaia chinensis]
Length = 606
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 157/303 (51%), Positives = 213/303 (70%), Gaps = 5/303 (1%)
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
CKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS+K L +
Sbjct: 5 CKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKE 64
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLA 190
+L++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW PL+A
Sbjct: 65 RLDEYIKMWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVA 124
Query: 191 PIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
PI DR TVP+ID ID + E + + D RG ++W +L+K L +E KRK
Sbjct: 125 PISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLNHKEKAKRK 184
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +VPCS
Sbjct: 185 HKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCS 244
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
R+GH+YR + V NY RV+E W+DE +K YFY P + L GD
Sbjct: 245 RVGHIYR-LEGWQGNPPPVYVGSSPTLKNYIRVVEVWWDE-YKDYFYASRPESKALPYGD 302
Query: 369 ISE 371
ISE
Sbjct: 303 ISE 305
>gi|345307949|ref|XP_001508273.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Ornithorhynchus anatinus]
Length = 593
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNNKLYLEKLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ RF
Sbjct: 146 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKRLEDYMARF-P 204
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
+VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 RVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P+ K S+P++SP AGG
Sbjct: 265 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPQELQKPDP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+D+ +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDKKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 381 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMSDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LE Y+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEAYMALFPS 205
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 218/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + E R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 87 GNGEQGRPYPMTEGERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEVLP 144
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 -VRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 319
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 320 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 379
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W D+ + Y Y R P L GD+ Q
Sbjct: 380 P------AGVSLARNLKRVAEVWMDD-YAEYIYQRRPEYRHLSAGDVVAQ 422
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 167/350 (47%), Positives = 219/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y L E D+ E G N+ SN+I+ DR++PD+R CK Y +LP
Sbjct: 82 GKGEHGKPYPLVED--ECDDSVYKENGFNIYVSNNIALDRSLPDIRHPNCKQKLYLENLP 139
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RT+HSI RTP + EIILVDD+S + L L +Y+ RF
Sbjct: 140 NTSIIIPFHNEGWSSLLRTLHSISNRTPDHLIAEIILVDDYSDREHLKAHLAEYMSRF-P 198
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R +REGLIRTR GA +RGEV+ FLD+HCE +NWLPPLL I + K + P
Sbjct: 199 KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPLLDQIAQNPKTIVCP 258
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P S+PY+SP AGG
Sbjct: 259 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPELQGPDP--SDPYQSPVMAGG 314
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA++R +F ELGGYD GL +WGGE FE+SFK+WMCGGS+ VPCSR+GH+YR ++PY
Sbjct: 315 LFAVNRQWFWELGGYDTGLEIWGGEQFEISFKVWMCGGSMYDVPCSRVGHIYRKYVPYKV 374
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV ETW DE + Y Y R P L GD++ Q
Sbjct: 375 P------SGTSLARNLKRVAETWMDE-YTEYIYQRRPEYRHLSTGDLTAQ 417
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 219/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 200 GNGEQGRPYPMTDAERE--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 257
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 258 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 317
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 318 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 376
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 377 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 432
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR +PY
Sbjct: 433 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKSVPYKV 492
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD+ Q
Sbjct: 493 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 535
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 161/371 (43%), Positives = 233/371 (62%), Gaps = 13/371 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K G+L P L EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R++ CK YP D+P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD
Sbjct: 96 SLPDVRLDGCKTKVYPDDVPNTSIVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L +KLE+Y++ VR++R +R GLIR R RGA ++G+VI FLDAHCE +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY+F G +I N +R+ E W D+ K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDD-FKDFFYIISPGV 387
Query: 362 MFLDMGDISEQ 372
M +D GD+S +
Sbjct: 388 MRVDYGDVSSR 398
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 162/352 (46%), Positives = 221/352 (62%), Gaps = 14/352 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+GPGE G A+ LP D G N S+ I+ +R++PD+R C+ Y L
Sbjct: 100 KGPGEQGAAFFLPAGMEKKKDELYKVNGFNALASDFIALNRSLPDIRNPGCQKKRYVSKL 159
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ FHNE +++L+RT S++ R+P + ++EIIL DD+S+K L + LEDYI +
Sbjct: 160 PTVSVIVPFHNEHWTTLLRTATSVLNRSPPELIKEIILADDYSNKEQLKKPLEDYIAKHW 219
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KVR++R T REGLIR R GA+++ G+V++FLD+H E +NWLPPLL PI D + +
Sbjct: 220 NKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPLLEPIAKDYRTVVC 279
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID IDY+T+ +R+ D RG F+W + YK LPE A +EP+KSP A
Sbjct: 280 PFIDVIDYETFAYRA---QDEGARGSFDWELYYKRLPLLPEDLANP----TEPFKSPVMA 332
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R +F ELGGYD GL VWGGE +ELSFKIW CGG++ PCSR+GH+YR F P+
Sbjct: 333 GGLFAISRRYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPCSRVGHIYRKFAPF 392
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ D + NY+RV E W DE +K Y Y R P L+ GD++ Q
Sbjct: 393 PNPGIGD-----FVGRNYRRVAEVWMDE-YKEYLYMRRPHYRNLEPGDLTAQ 438
>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
17 [Ciona intestinalis]
Length = 870
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 168/354 (47%), Positives = 221/354 (62%), Gaps = 22/354 (6%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GPGE G A HL R+ ++ E G N+ SN IS +R++PD+R + C Y LP
Sbjct: 355 GPGELGVAVHLSTEERSR--SAYSENGFNILVSNRISLNRSLPDIRHKNCASRKYLAQLP 412
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS----KADLDQKLEDYIQ 138
AS+I+ FHNEG ++L+RT+HSII RTP L EIILVDD S+ K+ LDQ+L Y Q
Sbjct: 413 DASIIIPFHNEGRTTLLRTIHSIINRTPKILLREIILVDDCSTVDHLKSSLDQELSKYRQ 472
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
V+L+R +REGLIR R G +++G IV LD+H EV NWLPPLL PI DRK+
Sbjct: 473 -----VKLVRLAKREGLIRARLAGVHQAKGNTIVILDSHVEVTNNWLPPLLEPIALDRKV 527
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+T P+ID I+ +F + +P RG F+W + YK +P K+ K S+P++ P
Sbjct: 528 ITCPMIDIINKD--DFHYLTQPGDAMRGAFDWELYYKRIPIPPE--KQLKDPSDPFEDPV 583
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+DR +F E+G YD GL +WGGE +ELSFK WMCGG I PCSR+GH+YR FM
Sbjct: 584 MAGGLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFKAWMCGGKILDAPCSRVGHIYREFM 643
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+ G I N+KRV E W DE + YFY + P + GD+S+Q
Sbjct: 644 PYSLP------PGTNINKNFKRVAEVWMDE-YAEYFYKKRPHVRGIHPGDLSKQ 690
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 162/350 (46%), Positives = 222/350 (63%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 37 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 94
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + E++LVDDFS + L ++LEDY+ +F
Sbjct: 95 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREHLKKRLEDYMAQF-P 153
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 SVRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 213
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ + + + + RG F+W M YK +P K S+P++SP AGG
Sbjct: 214 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP--SDPFESPVMAGG 269
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 270 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 329
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + + Y R P L GD++ Q
Sbjct: 330 P------TGVSLARNLKRVAEVWMDE-YAEFIYQRRPEYRHLSAGDVAAQ 372
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 164/355 (46%), Positives = 222/355 (62%), Gaps = 8/355 (2%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L + G G GGKA L A + + + N+ SN IS++RT+PD+R CK
Sbjct: 111 LNKIENGLGAGGKAVKLFGAELQEAEEIMKKEAFNLFISNRISYNRTLPDVRDSMCKGLT 170
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP ASVI++F NE +S L+RT+ S+I R+P ++L+EI+L+DDFS + +L KLE Y
Sbjct: 171 YDTILPSASVIIIFTNEAWSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQGKLERY 230
Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
I+ + VRL+R ER+GLIR R GAKE+ GEVI+FLD+HCE L WL PLL I D
Sbjct: 231 IETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVIIFLDSHCEATLGWLEPLLQRIKED 290
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
++ + VP+ID ID +T E+ P+ G F W + ++P+RE K+R P
Sbjct: 291 KRAVLVPIIDVIDDKTLEYYH-GSPESFQIGSFTWSGHFTWMDIPKREIKRRGSRVGPTN 349
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SPT AGGLFA+DR +F +LG YD G+ VWGGEN E+SF+IWMCGGS+E +PCSR+GH++R
Sbjct: 350 SPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSFRIWMCGGSLETIPCSRVGHIFR 409
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
SF PY F D N RV+E W D+ +K FY +D+GD S
Sbjct: 410 SFHPYTFPGNKDTH-----GINTARVVEVWMDD-YKELFYMHRGDLKTIDIGDTS 458
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 163/359 (45%), Positives = 228/359 (63%), Gaps = 16/359 (4%)
Query: 18 EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEY-GMNMETSNHISFDRTIPDLRMEECKYW 75
E +GPGE G A LP +A L + G N S+ I+ +R++PD+R EC+
Sbjct: 70 EKLAQGPGEQGAAVELPKDAETEQRKEKLYKVNGFNAAVSDLIALNRSLPDIRHSECQNI 129
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK-ADLDQKLE 134
Y LP AS+++ FHNE S L+RT+ S+++R+P ++EIILVDDFSSK + + +LE
Sbjct: 130 RYAARLPTASIVIPFHNEHLSVLLRTITSVLRRSPKSLIKEIILVDDFSSKKSXVSTELE 189
Query: 135 DYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
+Y+ F +V+L+R T+REGLIR R GA+ + G+V++FLD+H E +NWLPPLL PI
Sbjct: 190 NYLSSHFGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEANVNWLPPLLDPIA 249
Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+R+ + P ID I Y+T+ +RS D RG F+W + YK L + K+ +EP
Sbjct: 250 RNRRTVVCPFIDVIHYETFAYRS---QDEGARGAFDWELYYKRLPLLSEDLKR---PTEP 303
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++SP AGGLFA+DR++F ELGGYD GL VWGGE +ELSFKIW CGG + PCSR+GH+
Sbjct: 304 FRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFKIWQCGGQMFDAPCSRVGHI 363
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
YR F P+ + D + NY+RV E W DE +K + Y R P L GD+S+Q
Sbjct: 364 YRKFAPFPNPGIGD-----FVGRNYRRVAEVWMDE-YKEFLYNRRPHYRTLGYGDVSKQ 416
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 225/361 (62%), Gaps = 9/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L PP++ EGPGE GK +P+ + N+ S I+F+R++PD+R+E C
Sbjct: 46 LRPPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLEGC 105
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 106 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 165
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PLLA I
Sbjct: 166 LESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLARI 225
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 226 KFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 283 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 342
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 343 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 397
Query: 372 Q 372
+
Sbjct: 398 R 398
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 227/361 (62%), Gaps = 9/361 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P+ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GDILKPITKNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396
Query: 370 S 370
S
Sbjct: 397 S 397
>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Strongylocentrotus purpuratus]
Length = 521
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 165/350 (47%), Positives = 216/350 (61%), Gaps = 10/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A L + G N S+ IS DR +PD+R CK Y LP
Sbjct: 1 PGERGVAVKLTPEMKKTEKKDTSANGFNERVSDMISMDRALPDIRNPRCKEITYLAKLPN 60
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
SVI+ FHNE S+L RTVHSI R+P + + EIILVDDFS +A L L+DY+ F K
Sbjct: 61 VSVIIPFHNEALSTLKRTVHSIFNRSPPELIHEIILVDDFSDRAYLKGPLDDYMSAFP-K 119
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V++IR +REGLIRTR GA + G+V++FLD+HCE NWLPPLL I +R+ + P+
Sbjct: 120 VKIIRLEKREGLIRTRLLGAGPATGDVVLFLDSHCEANYNWLPPLLERIALNRRRIVCPM 179
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID I + + + S + RG F+W + YK + E E K+R + S+P+++P AGGL
Sbjct: 180 IDVISNEDFHYES--QAGDVMRGAFDWELYYKRIPISEAENKRRSHESDPFRTPIMAGGL 237
Query: 264 FAMDRAFFL-ELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
FA+DR +F+ ELGGYD GL +WGGE ++LSFK+WMCGG +E +PCSR+GH+YR FM Y
Sbjct: 238 FAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVGHIYRKFMSYTV 297
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
A +I N RV+E W DE K YFY R P D GDIS+Q
Sbjct: 298 PGGAG-----VINKNLLRVVEVWMDEWGK-YFYERRPYLKGQDYGDISKQ 341
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 311 bits (798), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 160/366 (43%), Positives = 228/366 (62%), Gaps = 13/366 (3%)
Query: 8 GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
G+ +L P ++GPGEGGK +P+ + N+ S I+ +R++PD+
Sbjct: 44 GRADSLSRP----RDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDV 99
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R+E CK YP DLP+ SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S +
Sbjct: 100 RLEGCKNKLYPDDLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPRSLLEEIVLVDDASERD 159
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LE Y++R VR++R +R GLIR R +GA S G+VI FLDAHCE L WL P
Sbjct: 160 FLKRQLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEP 219
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LL I D++ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 220 LLTRIKQDKRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 276
Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V
Sbjct: 277 KGDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVT 336
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CS +GHV+R PY F G +I N +R+ E W DE K +FY P +D
Sbjct: 337 CSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDY 391
Query: 367 GDISEQ 372
GDIS +
Sbjct: 392 GDISTR 397
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 158/363 (43%), Positives = 228/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEIIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDKTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
(Silurana) tropicalis]
gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
Length = 603
Score = 311 bits (797), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 161/350 (46%), Positives = 218/350 (62%), Gaps = 14/350 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE GK + + +A D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 85 GNGEQGKPFPMTDADHV--DQAYRENGFNIFVSDKISLNRSLPDIRNSNCKNKFYFSKLP 142
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDD+S KA L +LE Y+ F
Sbjct: 143 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDKAHLKSRLEKYMANF-P 201
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KV+++R +REGLIRTR GA + GEV+ FLD+HCE +NWLPPLL P+ + K + P
Sbjct: 202 KVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLPPLLDPLVQNYKTVVCP 261
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID F V + RG F+W M YK +P K S+P+ SP AGG
Sbjct: 262 MIDVIDSDN--FGYVTQAGDAMRGAFDWEMFYKRIPIPPELQKGDP--SDPFDSPVMAGG 317
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG + PCSR+GH+YR ++PY
Sbjct: 318 LFAINREWFWQLGGYDPGLEIWGGEQYEISFKVWMCGGRMVDSPCSRVGHIYRKYVPYKV 377
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L +GD++ Q
Sbjct: 378 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPDYRHLSVGDVAAQ 420
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 160/353 (45%), Positives = 224/353 (63%), Gaps = 11/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 80
GPGE GK +P +A N+ S+ IS +R++PD RM+ CK YP D
Sbjct: 114 GPGEMGKPVVIPLDRQAESKEKFKINQFNLVASDMISLNRSLPDYRMDACKRKSYPPNSD 173
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++VFHNE +S+L+RTVHSII R+P + L EI+LVDD S + +L +KLEDYI R
Sbjct: 174 LPDTSVVIVFHNEAWSTLLRTVHSIINRSPRELLNEILLVDDASEREELGKKLEDYIARL 233
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
R+IR+ ER GLIR R +GAK++RG+VI FLDAHCE WL PLL I+ DR +
Sbjct: 234 PVSTRVIRSEERTGLIRARLKGAKQARGKVITFLDAHCECTEGWLEPLLYEIHKDRTAVV 293
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I ++E+ + D + G F W + ++ +P+RE +R + S P K+PT
Sbjct: 294 CPIIDVIGDDSFEY--ITGSDMTWGG-FNWKLNFRWYPVPQRELDRRGGDRSNPTKTPTM 350
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD G+ +WGGEN E+SF++WMCGG + V CSR+GHV+R P
Sbjct: 351 AGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMSFRVWMCGGKVYIVTCSRVGHVFRKTSP 410
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y++ R+ I +N +R++E W DE +K +FY P GD+SE+
Sbjct: 411 YSWPGGVARI----INHNTQRIVEVWMDE-YKDFFYKINPGVRSTSYGDVSER 458
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 159/364 (43%), Positives = 226/364 (62%), Gaps = 9/364 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + E PGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVMSRAHEAPGEMGKAVVIPKEEQDKMKELFKINQFNLMASDMIALNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
+ CK YP DLP S+++VFHNE +S+L+RTVHS I R+P Q L EI+LVDD S + L
Sbjct: 103 DGCKTKTYPDDLPNTSIVIVFHNEAWSTLLRTVHSAINRSPRQLLYEILLVDDASERDFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+KLEDY+ VR++R +R GLIR R RGA +RG+VI FLDAHCE WL PL+
Sbjct: 163 KEKLEDYVATLEVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPLM 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I DR+ + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRRAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY+F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVVRVDYGD 394
Query: 369 ISEQ 372
+S +
Sbjct: 395 VSSR 398
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKTFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +RT+PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVAIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIST 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
Length = 540
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 167/356 (46%), Positives = 220/356 (61%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE G+ L + + G N S+ IS +R++PD+R +CK Y
Sbjct: 17 EERRTGMGEHGRPAFLSPSLDVRKEKLYQVNGFNAALSDEISVNRSVPDIRHSDCKKKQY 76
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+L+RT S++ R+P LEEIILVDD S+K +L +KL+DY+
Sbjct: 77 LKNLDPVSVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 136
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ KV ++R +R GLIR R GAK++R +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 137 AQHLPKVLIVRLPKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIARDYK 196
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + K+ +EP+KSP
Sbjct: 197 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR---PAEPFKSP 250
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 251 IMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 310
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G +G + NYKRV E W DE + Y Y R P LD GD+SEQ
Sbjct: 311 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YAEYIYKRRPHLRTLDPGDLSEQ 359
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 162/361 (44%), Positives = 226/361 (62%), Gaps = 9/361 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G + PL+ EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GFIHIPLQDPHEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396
Query: 370 S 370
S
Sbjct: 397 S 397
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 223/353 (63%), Gaps = 9/353 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
++GPGEGGK +P+ + N+ S I+ +R++PD+R+E CK YP +
Sbjct: 53 RDGPGEGGKPVVIPKEQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN 112
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP+ SV++VFHNE +++L+RTVHS+I R+P LEEI+LVDD S + L Q+LE Y+++
Sbjct: 113 LPRTSVVIVFHNEAWTTLLRTVHSVIDRSPHTLLEEIVLVDDASERDFLKQQLERYVRKL 172
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLLA I DRK +
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLARIKQDRKTVV 232
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT
Sbjct: 233 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 289
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R P
Sbjct: 290 AGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATP 349
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F G +I N +R+ E W DE K +FY P +D GDI+ +
Sbjct: 350 YTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDITSR 397
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 159/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P++ EGPGE GK +P+ + N+ S I+F+R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE Y+++ V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 160/363 (44%), Positives = 227/363 (62%), Gaps = 9/363 (2%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+ G PL+ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R
Sbjct: 43 QAGQTMIPLQRNQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVR 102
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
+E CK YP +LP SV++VFHNE +S+L+RT++S+I R P L EIILVDD S +
Sbjct: 103 LEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDF 162
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 163 LKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LA I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 249 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
S +GHV+R PY F G +I N +R+ E W DE K +FY P + +D G
Sbjct: 340 SHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYG 394
Query: 368 DIS 370
D++
Sbjct: 395 DVT 397
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 155/352 (44%), Positives = 225/352 (63%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
EGPGE GKA ++P+ + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 55 EGPGEMGKAVNIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYSDDL 114
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P S+++VFHNE +S+L+RTVHS+I R+P L EIILVDD S + L +KLE+Y++
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPKHLLVEIILVDDASERDFLKKKLENYVRTLE 174
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R +R GLIR R RGA + G+VI FLDAHCE + WL PLLA I DR +
Sbjct: 175 VPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWLEPLLARIKEDRTAVVC 234
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT A
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY
Sbjct: 292 GGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+F G +I N +R+ E W D+ K +FY P M ++ GD+S +
Sbjct: 352 SFPGGT----GQVINKNNRRLAEVWMDD-FKDFFYIISPGVMRVEYGDVSSR 398
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
terrestris]
Length = 610
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 169/353 (47%), Positives = 218/353 (61%), Gaps = 14/353 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK L + A + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
L SVI+ FHNE FS+LMRT S+I R+PA L+EIILVDD S+KA+L + LEDYI
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKAELKKPLEDYITER 207
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KV+L+R ER GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 208 FTKVKLVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYKTCV 267
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP A
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSPVMA 321
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F P+
Sbjct: 322 GGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKFPPF 381
Query: 321 -NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N G KG + NYKRV E W DE + Y YTR P L+ G++ EQ
Sbjct: 382 PNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYTRRPHLRSLNPGNLKEQ 427
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Nomascus leucogenys]
gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
Length = 506
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 157/331 (47%), Positives = 211/331 (63%), Gaps = 12/331 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D + E G N+ S+ IS +R++PD+R C Y LP S+I+ FHNEG+SSL+RT
Sbjct: 8 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 67
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
VHS++ R+P + + EI+LVDDFS + L + LEDY+ F VR++R +REGLIRTR
Sbjct: 68 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS-VRILRTKKREGLIRTRML 126
Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
GA + G+VI FLD+HCE +NWLPPLL I +RK + P+ID ID+ +FR +
Sbjct: 127 GASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184
Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
RG F+W M YK +P K S+P++SP AGGLFA+DR +F ELGGYDPGL
Sbjct: 185 DAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGL 242
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
+WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY G + N KRV
Sbjct: 243 EIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKVP------AGVSLARNLKRV 296
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
E W DE + Y Y R P L GD++ Q
Sbjct: 297 AEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 326
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 163/359 (45%), Positives = 226/359 (62%), Gaps = 10/359 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L PP YK GPGE GKA +P+ + N+ S+ I+ +R++PD+R+E C
Sbjct: 49 LIPPQRDYK-GPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGC 107
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 108 KTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 167
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y++ + V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 168 LENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 227
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 228 KEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 284
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +G
Sbjct: 285 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 344
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 345 HVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 398
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 8 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 66
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 67 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 126
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 127 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 186
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 187 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 243
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 244 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 303
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 304 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 358
Query: 372 Q 372
+
Sbjct: 359 R 359
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 159/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVLEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 80 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 138
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 139 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 198
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 199 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 258
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 259 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 315
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 316 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 375
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 376 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 430
Query: 372 Q 372
+
Sbjct: 431 R 431
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 166/356 (46%), Positives = 218/356 (61%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GKA L + D G N S+ IS +R+IPD+R +CK Y
Sbjct: 83 EEKRTGTGEQGKAATLSPSMEDLKDRLYKVNGFNAALSDLISLNRSIPDIRHPDCKNKRY 142
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
DL SV++ FHNE FS+LMRT S+I R+P L EIILVDD S+K +L KL++Y+
Sbjct: 143 LKDLDPVSVVVSFHNEHFSTLMRTCWSVINRSPPSLLHEIILVDDASTKVELKDKLDEYV 202
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
++ KV+++R R GLIR R GA+++ +++VFLD+H E +NWLPPLL PI D K
Sbjct: 203 KKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILVFLDSHSEANVNWLPPLLEPIAKDYK 262
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + K SEP+KSP
Sbjct: 263 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKN---PSEPFKSP 316
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 317 VMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 376
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G +G + NYKRV E W DE + + Y R P +D GD++EQ
Sbjct: 377 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YADFIYRRRPHLRAMDPGDLTEQ 425
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/331 (47%), Positives = 211/331 (63%), Gaps = 12/331 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D + E G N+ S+ IS +R++PD+R C Y LP S+I+ FHNEG+SSL+RT
Sbjct: 8 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLPNTSIIIPFHNEGWSSLLRT 67
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
VHS++ R+P + + EI+LVDDFS + L + LEDY+ F VR++R +REGLIRTR
Sbjct: 68 VHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-PNVRILRTKKREGLIRTRML 126
Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
GA + G+VI FLD+HCE +NWLPPLL I +RK + P+ID ID+ +FR +
Sbjct: 127 GASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184
Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
RG F+W M YK +P K S+P++SP AGGLFA+DR +F ELGGYDPGL
Sbjct: 185 DAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGL 242
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
+WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY G + N KRV
Sbjct: 243 EIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKVP------AGVSLARNLKRV 296
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
E W DE + + Y R P L GD++ Q
Sbjct: 297 AEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 326
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDI+
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIAS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 9/361 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G L ++ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GELLELIKENQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396
Query: 370 S 370
S
Sbjct: 397 S 397
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 229/369 (62%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ + V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP LP+ SV++VFHNE +S+L+RTVHS++ R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDALPRTSVVIVFHNEAWSTLLRTVHSVLNRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 160/359 (44%), Positives = 224/359 (62%), Gaps = 9/359 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E C
Sbjct: 46 LRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGC 105
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 106 KTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +G
Sbjct: 283 LPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 342
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 343 HVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 396
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/362 (44%), Positives = 225/362 (62%), Gaps = 9/362 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASEREFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLL
Sbjct: 163 KLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394
Query: 369 IS 370
+S
Sbjct: 395 VS 396
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLDFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/362 (44%), Positives = 225/362 (62%), Gaps = 9/362 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLL
Sbjct: 163 KLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394
Query: 369 IS 370
+S
Sbjct: 395 VS 396
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 160/362 (44%), Positives = 226/362 (62%), Gaps = 9/362 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + +EGPGE GKA +P+ + N+ S+ I+F+R++PD+R+
Sbjct: 43 LPALRAVMSRSQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDMIAFNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
E CK YP +LP SV++VFHNE +S+L+RT++S++ R+P L EIILVDD S + L
Sbjct: 103 EGCKTKVYPDELPTTSVVIVFHNEAWSTLLRTIYSVMNRSPHYLLSEIILVDDASERDFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
LE+Y++ V++IR +R GLIR R RGA S+G+VI FLDAHCE WL PLL
Sbjct: 163 KLPLENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTTGWLEPLL 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394
Query: 369 IS 370
+S
Sbjct: 395 VS 396
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTRVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
Length = 603
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 165/356 (46%), Positives = 220/356 (61%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + + G N S+ IS +R++PD+R +C+ Y
Sbjct: 78 EEKRTGIGEHGKPAFLSPSLDVLKEKLYQVNGFNAAVSDEISMNRSVPDIRHPDCRKKKY 137
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+L+RT S++ R+P LEEIILVDD S+K +L +KL+DY+
Sbjct: 138 LKNLDPISVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 197
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ KV ++R ++R GLIR R GAK++R +V+VFLD+H E +NWLPPLL PI + K
Sbjct: 198 AQHLPKVSIVRLSKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 257
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+ + D RG F+W + YK L + K+ +EP+KSP
Sbjct: 258 TCVCPFIDVIAYETFEYIA---QDEGSRGAFDWELYYKRLPLLPEDLKR---PTEPFKSP 311
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PCSR+GHVYR F
Sbjct: 312 IMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHVYRKF 371
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G +G + N+KRV E W DE + Y Y R P LD GD+SEQ
Sbjct: 372 PPFPNPG------RGDFLGKNFKRVAEVWMDE-YAEYLYKRRPHLRTLDPGDLSEQ 420
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 222/351 (63%), Gaps = 9/351 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 9 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE 68
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 69 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNL 128
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 129 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 188
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT
Sbjct: 189 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 245
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R P
Sbjct: 246 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 305
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
Y F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 306 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 351
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
impatiens]
Length = 610
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 167/353 (47%), Positives = 217/353 (61%), Gaps = 14/353 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK L + A + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLRN 147
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
L SVI+ FHNE FS+LMRT S+I R+PA L+EIILVDD S+K +L + LEDYI
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKVELKKPLEDYITEH 207
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KV+++R ER GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 208 LTKVKIVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANVNWLPPLLEPIAQDYKTCV 267
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP A
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSPVMA 321
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F P+
Sbjct: 322 GGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKFPPF 381
Query: 321 -NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N G KG + NYKRV E W DE + Y YTR P L+ G++ EQ
Sbjct: 382 PNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYTRRPHLRSLNPGNLKEQ 427
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 222/351 (63%), Gaps = 9/351 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 14 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 73
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 74 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 133
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 134 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 193
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT
Sbjct: 194 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 250
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R P
Sbjct: 251 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 310
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
Y F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 311 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 356
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 38 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 97
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 98 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 157
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 158 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 217
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 218 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 274
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 275 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 334
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 335 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 389
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 390 VKVDYGDVS 398
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 225/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LE +P+ EGPGE GK +P+ + N+ S I+ +RT+PD+R+E C
Sbjct: 48 LETVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIST 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 158/361 (43%), Positives = 225/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+ R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVTDRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GD++
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDVAS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 167/354 (47%), Positives = 224/354 (63%), Gaps = 15/354 (4%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + ++ + G N+ S+ IS DR++ D+R CK Y
Sbjct: 92 EAARTGLGEQGKPVTLFGHEKL--HSAYKDNGFNILVSDRISLDRSLHDIRHASCKSKKY 149
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
DLP SVI+ FHNEG S+L+RT+HS+ R+P L+EI+LVDD SS+ L ++LE +
Sbjct: 150 YSDLPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSRP-LYKELESSL 208
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+F KV+LIRN R+GLIR+R RG ++G V+V LD+H EV NWLPPLL PI DRK
Sbjct: 209 AKF-PKVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPPLLHPISLDRK 267
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID + +++ V +P RG F+W + YK +P K+ K SEP++SP
Sbjct: 268 TVVCPMIDIIDNENFQY--VTQPGDAMRGAFDWELYYKRIPIPNE--KRPKDPSEPFESP 323
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA++R +F E+G YD GL +WGGE +ELSFK+WMCGG I PCSRIGH+YR F
Sbjct: 324 VMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSFKVWMCGGRILDSPCSRIGHIYRKF 383
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
+PY GP YNYKRV E W DE + +FY R P +D GD+S+
Sbjct: 384 VPYTIPNNG----GP--NYNYKRVAEVWMDE-YAEFFYRRRPYVRKIDAGDLSK 430
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 221/351 (62%), Gaps = 9/351 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP
Sbjct: 17 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDQ 76
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 77 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 136
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 137 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 196
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT
Sbjct: 197 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 253
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R P
Sbjct: 254 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 313
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
Y F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 314 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 359
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 225/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LE +P+ EGPGE GK +P+ + N+ S I+F+R++PD+R+E C
Sbjct: 48 LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIAFNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDSLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 151/334 (45%), Positives = 214/334 (64%), Gaps = 4/334 (1%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G + E + ++ N S+ IS R++ D R ++CK YP+DLP
Sbjct: 107 PGELGTGVTVEENEKEKEKLGYEKHAFNQLVSDKISIHRSLKDYRNDQCKVKKYPVDLPP 166
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
SVI+ FHNE +S+L+RTVHS+I RTP QYL+EIILVDD S+ DL Q+L+DYI
Sbjct: 167 TSVIICFHNEAWSTLLRTVHSVINRTPPQYLKEIILVDDASTSDDLKQRLDDYIPNLK-I 225
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V ++R +R+GLIR R GAK+++G ++ FLDAHCE L W PLLA I DR+ + +PV
Sbjct: 226 VSIVRLRDRQGLIRARLEGAKKAKGPILTFLDAHCECTLGWAEPLLAKIKEDRQNVVMPV 285
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID I + + +V EP RG+F+W + + +P E ++RK+ S+ K+P AGGL
Sbjct: 286 IDEISETNFNYNAVPEP--FQRGVFKWRLEFTWRPIPSYEEQRRKHESDGIKTPVMAGGL 343
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F+++R +F E+G YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R PY+F
Sbjct: 344 FSINRDYFYEMGSYDTGMDIWGGENIEISFRIWMCGGSIEMLPCSRVGHVFRPRFPYSFP 403
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
G +++ N RV + W DE K ++ R
Sbjct: 404 NRRGG-DGDVVSRNLMRVADVWMDEYAKHFYNIR 436
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 224/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LE +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKTNQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/362 (43%), Positives = 225/362 (62%), Gaps = 9/362 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVMSRSQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
E CK YP +LP SV++VFHNE +S+L+RT++S+I R P L EIILVDD S + L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PLL
Sbjct: 163 KVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394
Query: 369 IS 370
++
Sbjct: 395 VT 396
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 212/326 (65%), Gaps = 14/326 (4%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S+ ++ DR+IPD R + C YP LP ASVI++FHNE +S+L+RTVHS++
Sbjct: 13 YQFNELASSKVALDRSIPDNRPQSCLSLSYPTKLPTASVIIIFHNEAWSTLLRTVHSVLA 72
Query: 108 RTPAQYLEEIILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
R+P L EI+LVDD S + L KLE YI +F KV+LIR +REGLIR R GAK
Sbjct: 73 RSPPYLLREIVLVDDHSRLDTYGHLGSKLESYISQFT-KVQLIRAPKREGLIRARLIGAK 131
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+++GEV+VFLD+HCE L WL PLLA I +R I+ P I+ ID +T F +E +
Sbjct: 132 QAKGEVLVFLDSHCEANLGWLEPLLARIGENRSIVVTPDIEVIDLRT--FGYTHEHGANN 189
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RGIF W + +K +PE E ++RK +S+P +SPT AGGLFA+D+++F E+G YD + W
Sbjct: 190 RGIFNWELTFKWRGIPEYERRRRKSDSDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFW 249
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN E+SF+IWMCGGS+E +PCS++GHV+R PY G+ A I N R+ E
Sbjct: 250 GGENVEISFRIWMCGGSLEIIPCSKVGHVFRESQPYKIGEGA-------IDRNNMRLAEV 302
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDIS 370
W D+ +K FY P D GD+S
Sbjct: 303 WMDD-YKKIFYAMRPQLKGKDYGDVS 327
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 163/369 (44%), Positives = 227/369 (61%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK Y +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYADELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 305 bits (781), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 158/361 (43%), Positives = 225/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVIIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMVEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+I I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KQDRRTVVCPIICVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 159/362 (43%), Positives = 223/362 (61%), Gaps = 9/362 (2%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
L L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVISRNQEGPGEMGKAVRIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L EIILVDD S + L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEIILVDDASERDFL 162
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
LE+Y++ V++IR +R GLIR R RGA S+G+VI FLDAHCE L WL PLL
Sbjct: 163 KMALENYVKNLEVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
A I RK + P+ID I +E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKESRKTVVCPIIDLISDDNFEYTA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394
Query: 369 IS 370
+S
Sbjct: 395 VS 396
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 224/361 (62%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LE +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R PY F G +I N +R+ E W DE K +FY P +D GDIS
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398
Query: 372 Q 372
+
Sbjct: 399 R 399
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 225/363 (61%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G + + EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++
Sbjct: 45 GQVVTVISRSHEGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLD 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK Y DLP S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L
Sbjct: 105 GCKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+KLE Y++ V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA
Sbjct: 165 KKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY+F G +I N +R+ E W DE K +FY P M +D GD+
Sbjct: 342 VGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDV 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
G++ ++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPELVQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEIILVDD S + L
Sbjct: 105 GCKTKVYSDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHILEEIILVDDASERDFLK 164
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RLLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+R PY F G +I N +R+ E W DE K +FY P +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396
Query: 370 SEQ 372
S +
Sbjct: 397 SSR 399
>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
Length = 613
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/358 (44%), Positives = 219/358 (61%), Gaps = 15/358 (4%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
+E + GPGE G A L DA G N S+ IS R++ D+R +C+
Sbjct: 86 IEKQRTGPGEQGAAVILSSDEEKKKDALYKVNGFNGFASDKISLQRSLKDIRHPQCRTQK 145
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP SV++ FHNE +S+L+RT S++ R+P + + EIILVDDFSSK + L+D+
Sbjct: 146 YWNKLPTVSVVVPFHNEHWSTLLRTAESVLVRSPPELIHEIILVDDFSSKEHCGKPLDDH 205
Query: 137 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
+ + GKV++I +REGLIRTR GA+E+ G+V++FLD+HCE +NWLPPLL PI D
Sbjct: 206 LATHYGGKVKVIHQPKREGLIRTRLAGAREATGDVLIFLDSHCEANVNWLPPLLDPIAED 265
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPY 254
+ + P ID +DY+T+ +R+ D RG F+W YK L PE K+ + P+
Sbjct: 266 YRTVVCPFIDVVDYETFAYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPARPF 318
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
KSP AGGLFA+ +F ELGGYDPGL +WGGE +ELSFK+W CGG + PCSR+GH+Y
Sbjct: 319 KSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKLWQCGGQMLDAPCSRVGHIY 378
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R F P+ + D + NY+RV E W DE + + Y R P + G+I+EQ
Sbjct: 379 RKFAPFPNPGVGD-----FVGRNYRRVAEVWMDE-YAEFLYKRRPQYRSIQPGNITEQ 430
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 55 EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 114
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L +KLE Y++
Sbjct: 115 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 174
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA I DR +
Sbjct: 175 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 234
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT A
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY
Sbjct: 292 GGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+F G +I N +R+ E W DE K +FY P M +D GD+S +
Sbjct: 352 SFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDVSSR 398
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 57 EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 116
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L +KLE Y++
Sbjct: 117 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 176
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA I DR +
Sbjct: 177 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 236
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT A
Sbjct: 237 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 293
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY
Sbjct: 294 GGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 353
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+F G +I N +R+ E W DE K +FY P M +D GD+S +
Sbjct: 354 SFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDVSSR 400
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 158/351 (45%), Positives = 217/351 (61%), Gaps = 9/351 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GPGE GKA +P+ N+ S I+ +R++PD+RME CK YP LP
Sbjct: 70 GPGEMGKAVIIPKDKEKEKQEKFKINQFNLMASEMIALNRSLPDVRMEGCKSKKYPEKLP 129
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+++VFHNE +S+L+RTVHSII R+P+ LEEIILVDD S + L LE Y+++
Sbjct: 130 TTSIVIVFHNEAWSTLLRTVHSIINRSPSHLLEEIILVDDASERDFLGAPLERYVRKLRT 189
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +R GLIR R RGA S G+VI FLDAHCE WL PLL+ I DR + P
Sbjct: 190 LVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPLLSEIAKDRTTVVCP 249
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 261
+ID I +T+EF + D Y G F W + ++ +P+RE +RK + + P +SPT AG
Sbjct: 250 IIDVISDETFEF--MVGSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRSPTMAG 306
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YD G+ +WGGEN E+SF+IW CGG++ V CS +GHV+R PY
Sbjct: 307 GLFSIDKSYFEELGTYDAGMDIWGGENLEISFRIWQCGGTLLIVTCSHVGHVFRKATPYT 366
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F G +I N +R+ E W D K +FY P + + GDISE+
Sbjct: 367 FPGGT----GQIINKNNRRLAEVWMDS-FKNFFYIITPGVLKQEYGDISER 412
>gi|226482458|emb|CAX73828.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 303 bits (777), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 165/358 (46%), Positives = 221/358 (61%), Gaps = 13/358 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
LE + GPGE G L + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP ASVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189
Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
I +GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ + + P ID ID +E+R+ D RG F+W + YK LP R + + EP+
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDNHHPEEPF 303
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIY 363
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R + NF K G + NYKRV E W DE +K Y Y R P LD GD++EQ
Sbjct: 364 RKYST-NFPKSQ---LGDFVGRNYKRVAEVWMDE-YKEYLYKRRPSYRHLDPGDLTEQ 416
>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
[Crassostrea gigas]
Length = 526
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 159/350 (45%), Positives = 217/350 (62%), Gaps = 14/350 (4%)
Query: 24 PGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PGE G+A L P+ + GD G N S+ IS R++ D+R +CK Y L
Sbjct: 13 PGEQGQALILSPDEEKKKGDL-YKVNGFNAYASDKISLHRSLKDIRHSDCKKKKYLNHLM 71
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
ASVI+ FHNE +S+L+RT S++ R+P + E+ILVDD+SSK Q L+DY++
Sbjct: 72 NASVIVPFHNEHWSTLLRTAWSVLNRSPKHLIHEVILVDDYSSKEHCKQPLDDYVKEHFT 131
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
V+++R +REGLIRTR GA+ + G+V++FLD+HCE +NWLPPLL PI D K + P
Sbjct: 132 NVKVVRAKKREGLIRTRLLGARAATGQVLIFLDSHCEANINWLPPLLEPIAEDYKTVVCP 191
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
ID ID++ + +R+ D RG F+W YK L E + K+ +EP+KSP AGG
Sbjct: 192 FIDVIDFENFAYRA---QDEGARGAFDWEFFYKRLPLLEEDL---KHPAEPFKSPVMAGG 245
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+ +F E+GGYDPGL +WGGE +ELSFK+W CGG + PCSRIGH+YR F P+
Sbjct: 246 LFAISAKWFWEMGGYDPGLDIWGGEQYELSFKLWQCGGMMVDAPCSRIGHIYRKFAPFPN 305
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ D + NY+RV E W DE + Y Y R P +D GD+SEQ
Sbjct: 306 PGVGD-----FVGRNYRRVAEVWMDE-YAEYLYKRRPHYRNIDPGDVSEQ 349
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 161/369 (43%), Positives = 225/369 (60%), Gaps = 13/369 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA + + + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRSQEGPGEMGKAVLISKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP V++VFHNE +S+L+RTV S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKIYPDELPNTRVVIVFHNEAWSTLLRTVFSVINRSPRSLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ + V++IR +R GLIR R RGA SRG+VI FLDAHCE
Sbjct: 156 ASERDFLKTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAASRGQVITFLDAHCECTF 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDIS 370
+ +D GD+S
Sbjct: 388 VKVDYGDVS 396
>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
Length = 501
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 219/348 (62%), Gaps = 10/348 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
HV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISP 386
>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
tropicalis]
gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
Length = 452
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 219/352 (62%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
EGPGE GK +P+ + N+ S I+ +R++PD+R+E CK YP L
Sbjct: 56 EGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDSL 115
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SV++VFHNE +++L+RTVHS+I R+P L+EIILVDD S + L + LE Y+++
Sbjct: 116 PTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLQEIILVDDASEREFLKRPLETYVKKLT 175
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V ++R +R GLIR R RGA S+G+VI FLDAHCE + WL PLLA I DR+ +
Sbjct: 176 VPVHVLRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARIKHDRRTVVC 235
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I T+E+ + D Y G F W + ++ +P+RE +R+ + + P ++PT A
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDRTLPVRTPTMA 292
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R PY
Sbjct: 293 GGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPY 352
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F G +I N +R+ E W DE K +FY P +D GDIS +
Sbjct: 353 TFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISTR 399
>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
Length = 534
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 165/356 (46%), Positives = 220/356 (61%), Gaps = 18/356 (5%)
Query: 23 GPGEGGKAY-HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
GPGE G+ Y + E + LG G N S+ IS +R +PD R + CK YP L
Sbjct: 14 GPGEYGRPYVYTEEDNKRKSFGYLGN-GFNAHVSDKISVERALPDTRDQPCKDRLYPSRL 72
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ FHNE +S+L+RTVH +I RTP L E+ILVDDFSSK + + L +Y+ F
Sbjct: 73 PNVSVIIPFHNEHWSTLLRTVHGVIGRTPPHLLGEVILVDDFSSKENCGRPLNEYMATFP 132
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VR++R +REGLIR R RG + +RG V+VF+DAHCEV +NWLPPLL PI +T+
Sbjct: 133 -QVRILRMKQREGLIRARLRGVEVARGNVLVFMDAHCEVNVNWLPPLLEPISVSMTTVTI 191
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P ID ID+ T+E++ + RG+F+W + YK +P + + RK + P+ +P
Sbjct: 192 PTIDVIDHATFEYKE--QQGGPMRGVFDWQLNYKR--IPVLDGRGRKVRPTLPFSTPVMP 247
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GG+FA+D+ FF LGGYD GL +WGGE FELSFKIW CGG ++ VPCSR+GHV+R F PY
Sbjct: 248 GGVFAIDKEFFHHLGGYDSGLEIWGGEQFELSFKIWQCGGVLQEVPCSRVGHVFRKFSPY 307
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR----EPLAMFLDMGDISEQ 372
A I NY RV E W D+ +K Y+Y R D+GD+S Q
Sbjct: 308 -----ATDNDVLQILKNYMRVAEVWMDD-YKQYYYKRMLRGPKNVTNFDLGDLSSQ 357
>gi|383863685|ref|XP_003707310.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Megachile
rotundata]
Length = 610
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 220/354 (62%), Gaps = 16/354 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK L + + + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RSGTGEHGKPAFLSPSLDSLKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
L SVI+ FHNE FS+LMRT S+I R+PA LEEIILVDD S+K +L ++L+DY+ +
Sbjct: 148 LDAVSVIVSFHNEHFSTLMRTCWSVINRSPASLLEEIILVDDASTKVELKKELDDYVAQR 207
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KV++IR +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI + +
Sbjct: 208 LPKVKIIRLPQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANVNWLPPLLEPIAQNYRTCV 267
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
P ID I Y+T+E+R+ D RG F+W + YK LPE K+ + P+KSP
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPE----DLKHPTLPFKSPVM 320
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F P
Sbjct: 321 AGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGEMYDAPCSRVGHIYRKFPP 380
Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N G KG + NYKRV E W DE + Y Y R P LD G++++Q
Sbjct: 381 FPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLTKQ 427
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 157/361 (43%), Positives = 221/361 (61%), Gaps = 13/361 (3%)
Query: 16 PLEPYKEGP---GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
P P+ P GE G++ +P+ +A D +G N S+H+S RT+PDLR C
Sbjct: 22 PTLPHNFNPNAIGENGESVIVPDKAKAESDKLFKNHGFNQWASDHMSLHRTLPDLRPSLC 81
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K +P DLP+ SV++VFHNE S+L+RTVHS++ R+ + +IILVDDFSS D
Sbjct: 82 KSQVFPKDLPQTSVVIVFHNEALSTLLRTVHSVLDRSAPDLIHQIILVDDFSSIKGHD-P 140
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
L+ YI KV L+RN +REGLIR+R G + ++ FLDAHCEV + WL PLL +
Sbjct: 141 LKKYIADLK-KVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRV 199
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK-YNS 251
+ +R ++ P ID ID +T+++R+ D RG+F W M ++ P +E K+R YN
Sbjct: 200 HQNRSVVVCPEIDVIDDKTFQYRAGSSGD--IRGVFNWDMKFRWRLTPSQEQKRRNNYNV 257
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+SPT AGGLFA+DR +F E+G YD + +WGGEN ELSF+IW CGG +E +PCS +G
Sbjct: 258 LFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELSFRIWQCGGQLEIMPCSHVG 317
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+R+ +PY F K A G I N R E W D +K + Y R+P + G+I+E
Sbjct: 318 HVFRNVIPYKFPKDA----GLTINKNSVRTAEVWMD-GYKEFVYQRQPYMRNIHFGNITE 372
Query: 372 Q 372
+
Sbjct: 373 R 373
>gi|260789758|ref|XP_002589912.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
gi|229275097|gb|EEN45923.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
Length = 292
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 143/284 (50%), Positives = 195/284 (68%), Gaps = 10/284 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
E G N++ SN IS DR IPD+R C Y DLP S+++ FHNEG+++L+RTVHS++
Sbjct: 13 ECGFNIKASNKISLDRAIPDIRHPNCASKKYVRDLPDVSLVIPFHNEGWTTLLRTVHSVL 72
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
R+P Q + EIILVDDFS ++ L + LEDY+ + + KVR++R +REGLIRTR GA+ +
Sbjct: 73 NRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKLSPKVRVVRTKQREGLIRTRLLGAQVA 132
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
+G+V++FLD+HCE +NWLPPLL PI ++K + P ID ID + + + + RG
Sbjct: 133 KGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIVCPNIDVIDKDDFHYET--QAGDAMRG 190
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W M YK +P+ K S+P++SP AGGLFA+DR +F ELGGYDPGL +WGG
Sbjct: 191 AFDWEMYYKRIPIPDE--IKNPDPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGG 248
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY------NFGK 324
E +ELSFK+W CGG + PCSR+GHVYR F+PY N GK
Sbjct: 249 EQYELSFKVWQCGGRMVDAPCSRVGHVYRKFVPYKVPAGVNLGK 292
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 219/350 (62%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G+ + + A N+ S+ I+ +R++PD+R+E+CK YP LP
Sbjct: 81 PGENGRGVEIGKDEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 140
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
SV +VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L ++LEDY+ + +
Sbjct: 141 TSVDIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVVKLDTP 200
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+++R +R GLIR R GA +G+VI FLDAHCE NWL PLLA I DR + PV
Sbjct: 201 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLARIAEDRTRVVCPV 260
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ S + G F W + ++ +P+RE +R + + P ++PT AGG
Sbjct: 261 IDVISDETFEYISASDLTW---GGFNWKLNFRGYRVPQRELDRRGGDRTLPVRTPTMAGG 317
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +GHV+R PY F
Sbjct: 318 LFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPYTF 377
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ + +N R+ E W DE K +++ P A +D GD+S +
Sbjct: 378 PGGTSKI----VNHNNARLAEVWLDE-WKEFYFAINPAAKNVDKGDLSHR 422
>gi|226482456|emb|CAX73827.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 164/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
LE + GPGE G L + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP ASVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189
Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
I +GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ + + P ID ID +E+R+ D RG F+W + YK LP R + + +P+
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDSHHPEKPF 303
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIY 363
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R + NF K G + NYKRV E W DE +K Y Y R P LD GD++EQ
Sbjct: 364 RKYST-NFPKSQ---LGDFVGRNYKRVAEVWMDE-YKEYLYKRRPSYRHLDPGDLTEQ 416
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/366 (43%), Positives = 225/366 (61%), Gaps = 15/366 (4%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL----- 67
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVSLPDV 106
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R+E CK YP +LP SV++VFHNE +S+L+RTVHSII +P LEEI+LVDD S +
Sbjct: 107 RLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSIINHSPRHMLEEIVLVDDASERD 166
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL P
Sbjct: 167 FLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEP 226
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 227 LLARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 283
Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V
Sbjct: 284 KGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVT 343
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CS +GHV+R PY F G +I N +R+ E W DE K +FY P +D
Sbjct: 344 CSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDY 398
Query: 367 GDISEQ 372
GDIS +
Sbjct: 399 GDISSR 404
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +L
Sbjct: 55 EGPGELGKAVIIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDIRLEGCKTKVYPDEL 114
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P S+++VFHNE +S+L+RTVHS+I R+P + + EIILVDD S + L LE+Y++
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDASERDFLKTPLENYVKHLE 174
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V+++R +R GLIR R GA ++G++I FLDAHCE WL PLLA I DRK +
Sbjct: 175 VAVKILRMEQRSGLIRARLSGANVAKGKIITFLDAHCECTFGWLEPLLARIKEDRKTVVC 234
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT A
Sbjct: 235 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY
Sbjct: 292 GGLFSIDKKYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F G +I N +R+ E W D+ K +FY P + +D GD+SE+
Sbjct: 352 TFPGGT----GHVINKNNRRLAEVWMDD-FKDFFYIISPGVVKVDYGDVSER 398
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 154/351 (43%), Positives = 215/351 (61%), Gaps = 10/351 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GPGE G+ L + D + +G N+ S+ IS +R+I D++ C Y DLP
Sbjct: 583 GPGENGQPVLLYGEQKKEADETFDVHGFNVVVSDMISLERSITDVKHSLCDTVRYNKDLP 642
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
ASVI+ FHNE +S+L+RT++S+I R+ + L+EIILVDD+S + +L L++YIQ FN
Sbjct: 643 TASVIISFHNEAWSTLLRTIYSVINRSKIKLLQEIILVDDYSDRDELKVALDEYIQSNFN 702
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV+++ TEREGLIR R GA ++ G+++VFLD+HCEV NWL PL+ IY D +
Sbjct: 703 NKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLEPLIERIYRDSSTIAC 762
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
PVID ID ++ Y RG WG+ +K +P E +R EP KSP AG
Sbjct: 763 PVIDIIDPDSF----AYSASPLVRGGVNWGLQFKWKNVPPVELLRRNSEIEPIKSPIMAG 818
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+DR +F +G YD + +WGGE+ ELSF+IW CGG++E VPCSR+GH++R PY
Sbjct: 819 GLFAVDRNYFEHIGSYDKDMQIWGGEHLELSFRIWQCGGTLEIVPCSRVGHIFRKSHPYT 878
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ V T+N RV E W D+ +K +FY P A GD+SE+
Sbjct: 879 IPGGMENV----FTHNSIRVAEVWMDD-YKRFFYATRPDAQGKTYGDLSER 924
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 220/352 (62%), Gaps = 9/352 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+GPGE GK + + + N+ S I+ +R++PD+R+E CK YP DL
Sbjct: 54 DGPGEMGKPVVIAKDQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDDL 113
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P+ SV++VFHNE +++L+RTVHS+I R+P LEEI+LVDD S + L ++LE Y+++
Sbjct: 114 PRTSVVIVFHNEAWTTLLRTVHSVIDRSPRHLLEEIVLVDDASERDFLKRQLEHYVRKLE 173
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLL+ I D+K +
Sbjct: 174 VPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLSRIKLDKKTVVC 233
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT A
Sbjct: 234 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 290
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R PY
Sbjct: 291 GGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPY 350
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F G +I N +R+ E W DE K +FY P +D GDIS +
Sbjct: 351 TFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISTR 397
>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cavia porcellus]
Length = 568
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 160/358 (44%), Positives = 221/358 (61%), Gaps = 13/358 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISP 385
>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Monodelphis domestica]
Length = 609
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 164/378 (43%), Positives = 223/378 (58%), Gaps = 18/378 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAY-------RAAGDASLGEYGMNMETS 55
+ + K+ +EP LE E G+ PE + D ++ N+ S
Sbjct: 66 LLEPQSKVNKIEPILENNGEDAGKEEDTELSPEMGMIFNERDQELRDLGYQKHAFNLLIS 125
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R ECK YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 126 NRLGYHRDVPDTRNAECKEKSYPSDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLH 185
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L+ Y+Q++ GK++++RN +REGLIR R GA + GEV+VFL
Sbjct: 186 EIILVDDNSEFDDLKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFL 245
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV WL PLL PI DR+ + PVID I T +Y RG F WG+ +
Sbjct: 246 DSHCEVNKMWLQPLLVPIQEDRRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHF 301
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 302 KWDLVPFSELEGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFR 361
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +TYN R+ W DE + YF
Sbjct: 362 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTYNSLRLAHVWLDEYKEQYF 416
Query: 355 YTREPLAMFLDMGDISEQ 372
R L + G+ISE+
Sbjct: 417 SLRPELKL-KSYGNISER 433
>gi|380024969|ref|XP_003696257.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
florea]
Length = 598
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 73 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 306
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 307 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 366
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G KG + NYKRV E W DE + Y Y R P LD G++ Q
Sbjct: 367 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 415
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 162/394 (41%), Positives = 233/394 (59%), Gaps = 36/394 (9%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K G+L P L EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DRKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R++ CK YP D+P SV++VFHNE +S+L+RTVHS+I R+P L EI+LVDD
Sbjct: 96 SLPDVRLDGCKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L +KLE+Y++ VR++R +R GLIR R RGA ++G+VI FLDAHCE +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215
Query: 183 NWLPPLLAPIYSD-----------------------RKIMTVPVIDGIDYQTWEFRSVYE 219
WL PLLA I D R + P+ID I +T+E+ +
Sbjct: 216 GWLEPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMA--G 273
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYD 278
D Y G F W + ++ +P+RE +RK + + P ++PT AGGLF++D+ +F E+G YD
Sbjct: 274 SDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYD 332
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNY 338
PG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY+F G +I N
Sbjct: 333 PGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGT----GQVINKNN 388
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R+ E W D+ K +FY P M +D GD+S +
Sbjct: 389 RRLAEVWMDD-FKDFFYIISPGVMRVDYGDVSSR 421
>gi|380024967|ref|XP_003696256.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
florea]
Length = 611
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 86 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 319
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 320 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 379
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G KG + NYKRV E W DE + Y Y R P LD G++ Q
Sbjct: 380 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 428
>gi|380024971|ref|XP_003696258.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 3 [Apis
florea]
Length = 590
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 65 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 124
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 125 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 184
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 185 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 244
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP
Sbjct: 245 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 298
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 299 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 358
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G KG + NYKRV E W DE + Y Y R P LD G++ Q
Sbjct: 359 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 407
>gi|350644736|emb|CCD60531.1| n-acetylgalactosaminyltransferase,putative [Schistosoma mansoni]
Length = 508
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 162/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
LE + GPGE G + L + + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LESLRVGPGENGMPFELSYHDKELSNKTINENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 129
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP SVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 189
Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+ G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ K + P ID ID T+E+R+ D RG F+W + YK LP R + R + EP+
Sbjct: 250 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPF 303
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIY 363
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R + NF K G + NYKRV E W DE +K Y Y R P LD GD+++Q
Sbjct: 364 RKYST-NFPKAE---FGDFVGRNYKRVAEVWMDE-YKEYLYKRRPRYRDLDAGDLTKQ 416
>gi|328781649|ref|XP_003250010.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
mellifera]
Length = 598
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 162/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 73 EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+L+RT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 306
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 307 VMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 366
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G KG + NYKRV E W DE + Y Y R P LD G++ Q
Sbjct: 367 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 415
>gi|328781647|ref|XP_003250009.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
mellifera]
Length = 611
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 162/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 86 EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SVI+ FHNE FS+L+RT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID I Y+T+E+R+ D RG F+W + YK L + + +EP+KSP
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 319
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCSR+GH+YR F
Sbjct: 320 VMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 379
Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ N G KG + NYKRV E W DE + Y Y R P LD G++ Q
Sbjct: 380 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 428
>gi|321455342|gb|EFX66478.1| hypothetical protein DAPPUDRAFT_302681 [Daphnia pulex]
Length = 613
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 161/355 (45%), Positives = 219/355 (61%), Gaps = 13/355 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + GPGE G A++L D+ G N S+ I+ +RT+ D+R +CK +Y
Sbjct: 89 ESKQTGPGEQGLAFYLSPEDEKIKDSLYKVNGFNALVSDRINLNRTLKDIRHPDCKAQNY 148
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
DLP AS+++ FHNE FS L+RT +S + R PA LE +ILVDD S+K + L+DY+
Sbjct: 149 LEDLPTASIVVPFHNEHFSVLLRTAYSALNRAPANLLE-VILVDDASTKEHSKKPLDDYV 207
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ +VR+I ER GLIR R GA+ ++G+VI+FLD+H E +NWLPPLL PI D +
Sbjct: 208 TQHMPRVRVIHLAERSGLIRARMAGARRAKGDVIIFLDSHSEANVNWLPPLLDPIAEDYR 267
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P ID ID++T+ +R+ D RG F+W YK L + K + + P+KSP
Sbjct: 268 TVVCPFIDVIDFETFAYRA---QDEGARGAFDWEFFYKRLPLLPDDLK---HPARPFKSP 321
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ + FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSRIGH+YR +
Sbjct: 322 VMAGGLFAISKKFFFELGGYDEGLEIWGGEQYELSFKIWQCGGQMFDAPCSRIGHIYRKY 381
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ + KG + NYKRV E W DE +K Y Y R P L++GD+S Q
Sbjct: 382 APF-----PNSAKGDFVGRNYKRVAEVWMDE-YKEYLYKRRPQYRNLEVGDLSSQ 430
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/352 (42%), Positives = 216/352 (61%), Gaps = 15/352 (4%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
G+ L + D + N+ S+ I+ +R++ D R +C YP +P ASV+
Sbjct: 2 GRRVELSAEKQEEADKLFKKEAFNIVASDMIALNRSVSDNRDPQCSRVSYPKVMPNASVV 61
Query: 88 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF--NGKVR 145
++FHNE +S L+RTVHS++ R+P +YL E+IL+DDFS +A L +KL+ YI+ +G V+
Sbjct: 62 IIFHNEAWSPLLRTVHSVVNRSPPEYLHEVILLDDFSDRAGLGEKLDGYIKDTWPDGIVK 121
Query: 146 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVID 205
++R ER+GLIR R GAK + GEV+VFLD+HCE + WL PL+A I R + P+ID
Sbjct: 122 VVRAPERQGLIRARVLGAKAATGEVLVFLDSHCECNVQWLEPLVARIKESRSALLCPMID 181
Query: 206 GIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
ID + + + G F W + + LP+RE K+RK + E +SPT AGGLFA
Sbjct: 182 VIDAKAMSYNGIGAGS---VGGFWWSLHFSWRPLPQRERKRRKSSVETIRSPTMAGGLFA 238
Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
DR +F E+GGYDPG+ VWGGEN E+SF++WMCGG++E+VPCSR+GH++RS PY F
Sbjct: 239 ADRKYFFEIGGYDPGMDVWGGENLEISFRVWMCGGTLEFVPCSRVGHIFRSSHPYTFPGN 298
Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF-----LDMGDISEQ 372
D N KR+ E W D + +++ R L + D GD S++
Sbjct: 299 KD-----THGLNSKRLAEVWMDGYKRLFYHHRRDLLVINPQFNADAGDFSDR 345
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 154/357 (43%), Positives = 223/357 (62%), Gaps = 10/357 (2%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L+P +EGPGE GK + + + N+ S I+ +R++PD+R+E CK
Sbjct: 52 LKP-QEGPGEMGKPVVILKEEQERMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKV 110
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
YP +LP SV++VFHNE +++L+RTVHS+I R+P L EI+LVDD S + L + LE Y
Sbjct: 111 YPDNLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLREIVLVDDASERDFLKRALETY 170
Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
+++ + V +IR +R GLIR R RGA S+G+VI FLDAHCE + WL PLLA I DR
Sbjct: 171 VKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARINHDR 230
Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYK 255
+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R+ + + P +
Sbjct: 231 RTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDRTLPVR 287
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
+PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R
Sbjct: 288 TPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFR 347
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F G +I N +R+ E W DE K +FY P +D GDI+ +
Sbjct: 348 KATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIATR 399
>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
Length = 587
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 212/325 (65%), Gaps = 10/325 (3%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S+ +SF R IPD+R + C+ +YP +LP ASV++ F+NE +S L+RTVHSII
Sbjct: 87 YAFNELISDRLSFHRPIPDVRHQLCQSEEYPAELPSASVVICFYNEAWSVLLRTVHSIID 146
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
RTP+ L EIILVDDFS L ++L+ Y+ + +L+RNT REGLIR R G++ +
Sbjct: 147 RTPSALLHEIILVDDFSDLDHLAEQLDAYVSEHLPQTKLVRNTRREGLIRARVIGSEHAT 206
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
GEV+VFLD+HCEV + W+ PLL+ I+ + K + VP+ID ID T FR YE RG
Sbjct: 207 GEVLVFLDSHCEVNVEWIQPLLSHIHGNHKRVAVPIIDIIDQDT--FR--YESSPLVRGG 262
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F WG+ Y+ +++PE +K++ +P K+PT AGGLFAM+R +F +LG YD G+ VWGGE
Sbjct: 263 FNWGLFYRWDQIPESLLRKQEDYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGE 322
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF++W CGGS+ +PCSR+GH++R PY V IT N RV W D
Sbjct: 323 NLEISFRVWQCGGSMHILPCSRVGHIFRKRRPY-----GSPVGVDTITKNSLRVAHVWMD 377
Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
E K +F R+ A + GD+S++
Sbjct: 378 EYIKYFFQVRKT-ADHAEYGDVSDR 401
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 219/348 (62%), Gaps = 9/348 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G+ + A N+ S+ I+ +R++PD+R+E+CK YP LP
Sbjct: 120 PGERGRGVEIGPEEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 179
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
SV++VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L +KLEDY+ +
Sbjct: 180 TSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKKLEDYVVKLEVP 239
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+++R +R GLIR R GA +G+VI FLDAHCE +WL PLLA I DR + PV
Sbjct: 240 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWLEPLLARIAEDRTRVVCPV 299
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ S D + G F W + ++ +P+RE ++R + + P ++PT AGG
Sbjct: 300 IDVISDETFEYISA--SDMTWGG-FNWKLNFRWYRVPQREVERRGGDRTLPIRTPTMAGG 356
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +GHV+R PY+F
Sbjct: 357 LFSIDKDYFNELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPYSF 416
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
R+ + +N R+ E W DE K +++ P A +D GD+S
Sbjct: 417 PGGTSRI----VNHNNARLAEVWLDE-WKDFYFAINPAAKNVDKGDLS 459
>gi|256081587|ref|XP_002577050.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 469
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 162/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
LE + GPGE G + L + + ++ E G ++ S I DR+I D+R CK
Sbjct: 31 LESLRVGPGENGMPFELSYHDKELSNKTVNENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 90
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP SVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 91 YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 150
Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+ G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 151 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 210
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ K + P ID ID T+E+R+ D RG F+W + YK LP R + R + EP+
Sbjct: 211 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPF 264
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRIGH+Y
Sbjct: 265 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIY 324
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R + NF K G + NYKRV E W DE +K Y Y R P LD GD+++Q
Sbjct: 325 RKYST-NFPKAE---FGDFVGRNYKRVAEVWMDE-YKEYLYKRRPRYRDLDAGDLTKQ 377
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 227/376 (60%), Gaps = 19/376 (5%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHIS 59
RPV+ K + P K GEGG+A +L EA + + + N S+ IS
Sbjct: 110 RPVYDISAKKN-----INPMK---GEGGEASYLDTEAEKQYAEKIFANHSFNSVLSDKIS 161
Query: 60 FDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
DRT+ D+R + C K+ YP LP ASVI+ FHNE +S L+RTVHS++ RTP L +I
Sbjct: 162 LDRTMRDVRGDLCIEKHKTYPRKLPTASVIICFHNEAYSVLLRTVHSVLNRTPPDLLTDI 221
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S +L + L+D++ + + K+++IRN +R GLIR+R GA SRG+V++FLD+H
Sbjct: 222 ILVDDKSEYENLKRPLDDHVAQLSKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSH 281
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
CE W PLLA I + VP+I+ I+ T ++ + PD RG F W + YK
Sbjct: 282 CETTPGWAEPLLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQ--RGGFSWDLFYKWK 339
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+P E RK + ++PT AGGLFA+DR +F ++G YD + +WGGEN E+SF+IWM
Sbjct: 340 PIPLDEQHLRKSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLEMSFRIWM 399
Query: 298 CGGSIEWVPCSRIGHVYRSFM-PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG I+ +PCSR+GH++R F PY F ++ ++ N R+ E W DE +K +Y
Sbjct: 400 CGGRIDIIPCSRVGHIFRKFTSPYKFPDGVEKT----LSKNLNRLAEVWLDE-YKELYYQ 454
Query: 357 REPLAMFLDMGDISEQ 372
+ P + D GDIS++
Sbjct: 455 KRPQSKGKDYGDISQR 470
>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus laevis]
gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
Length = 603
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 164/374 (43%), Positives = 220/374 (58%), Gaps = 14/374 (3%)
Query: 1 RPVFKADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 59
+P+ G GN LE E + E G ++ E + D ++ N+ SN +
Sbjct: 66 QPIASHQGLNGNQLETKAEANADLSPELGMIFN--EQDQDVRDVGYQKHAFNLLISNRLG 123
Query: 60 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
+ R +PD R +C YP DLP AS+++ F+NE FS+L+RTVHS++ RTPAQ L EIIL
Sbjct: 124 YHRDVPDTRDSKCSKKTYPADLPHASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIIL 183
Query: 120 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
VDD S DL + L++Y+Q + KV+L+RN +REGLIR R GA + G+V+VFLD+HC
Sbjct: 184 VDDNSELDDLKKDLDNYMQENLSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHC 243
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
EV WL PLLAPI + K + PVID I T +Y RG F WG+ +K +
Sbjct: 244 EVNEMWLQPLLAPIRENPKTVVCPVIDIISSDTL----IYSSSPVVRGGFNWGLHFKWDP 299
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P E + + P++SPT AGGLF MDR +F LG YD G+ +WGGEN E+SF+IWMC
Sbjct: 300 VPLSELGGPEGYTAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISFRIWMC 359
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GGS+ VPCSR+GH++R PY D + YN R+ W DE YF R
Sbjct: 360 GGSLLIVPCSRVGHIFRKRRPYGSPGGHDT-----MAYNSLRLAHVWMDEYKDQYFALR- 413
Query: 359 PLAMFLDMGDISEQ 372
P D GDISE+
Sbjct: 414 PELRNKDYGDISER 427
>gi|358331987|dbj|GAA50722.1| putative polypeptide N-acetylgalactosaminyltransferase 10
[Clonorchis sinensis]
Length = 738
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 160/357 (44%), Positives = 223/357 (62%), Gaps = 13/357 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + GPGE G A L + + L + G N S+ I+ DR++ D+R +CK Y
Sbjct: 207 EANRVGPGEQGAAVRLFGEQKVESEKFLNQNGFNTYISDMIAIDRSVADIRHPKCKAMLY 266
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+++ F E +++L+RT S +KR+P ++E+ILVDD S++ L L+ Y+
Sbjct: 267 LAKLPSVSLVIPFFQENWNALLRTFVSSLKRSPPGLIKEVILVDDGSTREYLKGPLDRYL 326
Query: 138 QRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ +G VR+IR+ +REGLI R RGA+ + GEV+VFLD+HCE NWLPPL+ PI D
Sbjct: 327 EQHYPDGLVRVIRSPKREGLITARIRGARAATGEVLVFLDSHCEANPNWLPPLVDPIARD 386
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
K++T P ID I T+E+R+ D RG F+W + YK LP + + + P+
Sbjct: 387 YKVVTCPFIDVISADTFEYRA---QDEGARGAFDWELFYKR--LP-KLPQDLPHPERPFD 440
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + +PCSRIGH+YR
Sbjct: 441 SPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDIPCSRIGHIYR 500
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ P +F G + NYKRV ETW DE +K Y Y+R P +D GD+SEQ
Sbjct: 501 TH-PTDFPSAG---LGDFLGKNYKRVAETWMDE-YKEYIYSRRPHYRHIDAGDLSEQ 552
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 155/359 (43%), Positives = 220/359 (61%), Gaps = 9/359 (2%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P ++P +E PGE GKA ++P E N+ S+ IS +R++ D+R E+CK
Sbjct: 113 PFVKPQEETPGEMGKAVNIPIEQEKVMLEKFQENQFNLLASDMISLNRSLTDVRFEKCKA 172
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP LP SV++VFHNE +++L+RT+ S I R+P L+EIILVDD S K L +KLE
Sbjct: 173 KRYPTLLPTTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLE 232
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+YI+ RL R R GLIR R GAK +G+VI FLDAHCE WL PLL+ I
Sbjct: 233 EYIKTLPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVE 292
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
DR + P+ID I T+E+ + D + G F W + ++ +PERE ++R + + P
Sbjct: 293 DRSTVVCPIIDVISDTTFEY--IQASDMTWGG-FNWKLNFRWYRVPEREMQRRGGDRTAP 349
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF++W CGG +E VPCS +GHV
Sbjct: 350 LRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPCSHVGHV 409
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY+F V + N RV E W DE + ++Y P A+ + +GD+SE+
Sbjct: 410 FRDKSPYSFPGGVQAV----VLKNAARVAEVWMDEWGE-FYYAMNPGALNVPVGDVSER 463
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 161/367 (43%), Positives = 226/367 (61%), Gaps = 15/367 (4%)
Query: 9 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
++ NL+ P +GPGE G + A N S+ IS +R IPD+
Sbjct: 3 RVQNLDVTTAPRNPKGPGEYGVSVITRPEDEAKVKTGWKHASFNEFVSDMISVERAIPDV 62
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R EEC+ Y LP S+I+ F E +S+L+R+VHS+I R+P Q ++EIILVDDFSS+
Sbjct: 63 RPEECQDKLYSDSLPSTSIIICFTEESWSTLVRSVHSVINRSPPQLIKEIILVDDFSSRE 122
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L L+ Y++RF +V+++R REGLIR R RG + ++GEV+ FLD+H E G+ WL P
Sbjct: 123 YLKAPLDKYMKRF-PQVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEP 181
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
+L I DR+ + P+IDGID + Y + RG F W M +K +P+ E K+R
Sbjct: 182 MLQRIKEDRRNVVAPMIDGIDATKFS----YAASNLIRGGFSWEMQFKWKPIPDYEMKRR 237
Query: 248 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
K + P +SPT AGGLFA+D+++FLE+G YDPGL +WG EN ELSFKIWMCGG++E +PC
Sbjct: 238 KDETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCGGNLEMIPC 297
Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLD 365
S +GHV+R+ PY F +G + T+ N RV E W DE +K FY +P D
Sbjct: 298 SHVGHVFRASQPYKFP------EGNIKTFMRNNMRVAEVWMDE-YKDIFYALKPQLKGED 350
Query: 366 MGDISEQ 372
GD++E+
Sbjct: 351 YGDVTER 357
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 153/360 (42%), Positives = 228/360 (63%), Gaps = 12/360 (3%)
Query: 16 PLEPYKEG-PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P+ P + G PGE GK HLP + N+ S+ IS +R++PD+R+E C+
Sbjct: 123 PVVPEQAGQPGEMGKPVHLPADQESLMREKFRLNQFNLLASDSISLNRSLPDVRLEGCRD 182
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP LP S+++VFHNE +S+L+RTV SII R+P + L EIILVDD S + L ++LE
Sbjct: 183 KSYPGLLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAEIILVDDASERDYLGKELE 242
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
D++ F V ++R +R GLIR R GAK+ +G+VI FLDAHCE WL PLLA +
Sbjct: 243 DHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPLLARVAE 302
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+RKI+ P+ID I +++E+ V D + G F W + ++ +P+RE +R + ++P
Sbjct: 303 NRKIVVCPIIDVISDESFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMDRRNGDRTQP 359
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 360 LRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMSFRVWQCGGELEIIPCSHVGHV 419
Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY+F G +A ++ N RV E W D + K +FY P A +++GD+S +
Sbjct: 420 FRDKSPYSFPGGVA-----KIVNKNAARVAEVWMD-RWKDFFYEMNPGARSVEVGDVSSR 473
>gi|355689586|gb|AER98882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mustela putorius
furo]
Length = 320
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 147/298 (49%), Positives = 200/298 (67%), Gaps = 7/298 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 20 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 77
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 78 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 137
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 138 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 196
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
+ID ID+ +FR + RG F+W M YK +P K S+P++SP AGG
Sbjct: 197 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 252
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 253 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 310
>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Rattus norvegicus]
Length = 608
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 164/376 (43%), Positives = 221/376 (58%), Gaps = 17/376 (4%)
Query: 2 PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P FKA D N+E P + + E G ++ E + D ++ NM SN
Sbjct: 69 PQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R EC+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L EI
Sbjct: 127 LGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D + PVID I T Y RG F WG+ +K
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P + + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 363 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417
Query: 357 REPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 418 RPDLKT-KSFGNISER 432
>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Rattus norvegicus]
Length = 609
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 164/376 (43%), Positives = 221/376 (58%), Gaps = 17/376 (4%)
Query: 2 PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P FKA D N+E P + + E G ++ E + D ++ NM SN
Sbjct: 70 PQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 127
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R EC+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L EI
Sbjct: 128 LGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 187
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+
Sbjct: 188 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 247
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D + PVID I T Y RG F WG+ +K
Sbjct: 248 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 303
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P + + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 304 DLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 363
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 364 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 418
Query: 357 REPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 419 RPDLKT-KSFGNISER 433
>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 10-like [Loxodonta
africana]
Length = 602
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 159/352 (45%), Positives = 215/352 (61%), Gaps = 19/352 (5%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GHGEQGRPYPMTDAERV--DQAYRENGFNIYISDKISLNRSLPDIRHPNCNSKRYLEMLP 145
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + L R +G
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLHKPL----XRLHG 201
Query: 143 KVRLIRNT--EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+R + E EGLIRTR GA + +VI FLD+HCE +NWLPPLL I +RK +
Sbjct: 202 PFPSVRISVPETEGLIRTRMLGASAAIXDVITFLDSHCEANVNWLPPLLDRIARNRKTIV 261
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID+ +FR + RG F+W M YK +P K S+P++SP A
Sbjct: 262 CPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMA 317
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 318 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 377
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N KRV E W DE + Y Y R P L GD++ Q
Sbjct: 378 KVP------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 422
>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_c [Rattus norvegicus]
Length = 560
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 159/362 (43%), Positives = 216/362 (59%), Gaps = 13/362 (3%)
Query: 12 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
N+E P + + E G ++ E + D ++ NM SN + + R +PD R E
Sbjct: 35 NIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAE 92
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
C+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L EIILVDD S DL
Sbjct: 93 CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 152
Query: 132 KLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL PLLA
Sbjct: 153 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 212
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I D + PVID I T Y RG F WG+ +K + +P +
Sbjct: 213 IILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGGADSA 268
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+
Sbjct: 269 TAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRV 328
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GH++R PY + D +T+N R+ W DE + YF R L G+IS
Sbjct: 329 GHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSFGNIS 382
Query: 371 EQ 372
E+
Sbjct: 383 ER 384
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 220/359 (61%), Gaps = 9/359 (2%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + P PGE GKA H+P N+ S+ IS +R++ D+R+E CK
Sbjct: 86 PTVLPAHGLPGEMGKAVHIPPEQEGLMKEKFKLNQFNLLASDMISLNRSLADVRLEGCKD 145
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L +KLE
Sbjct: 146 KKYPKLLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRPLLKEIILVDDASEREHLGRKLE 205
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+Y+Q V ++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I
Sbjct: 206 EYVQTLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVQ 265
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
DRK + P+ID I +T+E+ + D + G F W + ++ +P+RE ++R + + P
Sbjct: 266 DRKTVVCPIIDVISDETFEY--ITASDMTWGG-FNWKLNFRWYRVPQREMERRNNDRTAP 322
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 323 LRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMSFRVWQCGGKLEIIPCSHVGHV 382
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY F ++ + +N RV E W DE + ++Y P A + +GD+S +
Sbjct: 383 FRDKSPYTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARSVPVGDVSAR 436
>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 165/353 (46%), Positives = 221/353 (62%), Gaps = 14/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L + + + E G N S+ IS +R++PD+R + C+ DY +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 140
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ T+ +R+ D RG F+W YK L + + K Y ++P+KSP A
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK---YPADPFKSPVMA 383
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 384 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 441
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N + KG + NYKRV E W DE +K Y Y + + +D GD++EQ
Sbjct: 442 NH--VPSPRKGDYLHRNYKRVAEVWMDE-YKNYLYDHADGIYDRIDAGDLTEQ 491
>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
Length = 676
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 165/353 (46%), Positives = 221/353 (62%), Gaps = 14/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L + + + E G N S+ IS +R++PD+R + C+ DY +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 140
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ T+ +R+ D RG F+W YK L + + K Y ++P+KSP A
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK---YPADPFKSPVMA 383
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 384 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 441
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N + KG + NYKRV E W DE +K Y Y + + +D GD++EQ
Sbjct: 442 NH--VPSPRKGDYLHRNYKRVAEVWMDE-YKNYLYDHADGIYDRIDAGDLTEQ 491
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 216/344 (62%), Gaps = 9/344 (2%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +LP SV+
Sbjct: 2 GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61
Query: 88 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
+VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++ V++I
Sbjct: 62 IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKII 121
Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
R ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181
Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT AGGLF++
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSI 238
Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY F
Sbjct: 239 DRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT 298
Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 299 ----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 337
>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
Length = 558
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 96 DLGYQKHAFNLLISNRLGHHRDVPDTRNAACKDKIYPTDLPVASVVICFYNEALSALLRT 155
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++++Q+ GK++LIRNT+REGLIR R
Sbjct: 156 VHSVLDRTPARLLHEIILVDDSSDFDDLKGELDEFVQKHLPGKIKLIRNTKREGLIRGRM 215
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 216 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 271
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAM+R++F ELG YD G
Sbjct: 272 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRSYFSELGQYDSG 331
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 332 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 386
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R P G++SE+
Sbjct: 387 LAHVWLDEYKEQYFSLR-PDLRTRSYGNVSER 417
>gi|339249613|ref|XP_003373794.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
gi|316970007|gb|EFV54023.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
Length = 587
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 157/355 (44%), Positives = 219/355 (61%), Gaps = 15/355 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASL--GEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
++GPGE G+A++LP + G N S++++ +R+I DLR ++C Y
Sbjct: 75 RQGPGEQGEAFYLPNVSSVDHKKGILYKSNGFNALVSDYLALNRSIKDLRPKQCIGRSYL 134
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
L K SV++ F+NE +++L+RTVHS++ R+P + L+E+IL DDFS K L Q LE Y++
Sbjct: 135 AKLEKVSVVIPFYNEHWTTLLRTVHSVVNRSPVELLQEVILADDFSDKPFLKQPLEAYVR 194
Query: 139 -RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ G VR++R +REGLIR R G+K + V+VFLD+H E G NWLPPLL P+ + +
Sbjct: 195 DTWPGLVRIVRARKREGLIRARLLGSKAAISSVLVFLDSHSECGYNWLPPLLEPVALNYR 254
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+T P +D ID+ T+ +R D RG F+W + YK L +A Y P+ SP
Sbjct: 255 TVTCPFVDVIDHSTFLYRL---QDQGARGSFDWELYYKRLPLLPEDAA---YPDRPFNSP 308
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGG FA+ +F ELGGYD GL +WGGE +ELSFKIW CGG++ VPCS +GH+YR F
Sbjct: 309 VMAGGYFAISTKWFWELGGYDEGLDIWGGEQYELSFKIWQCGGTLIDVPCSHVGHIYREF 368
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ A+ G + NYKRV E W DE +K Y Y R P LD GDIS+Q
Sbjct: 369 SPF-----ANPGAGDFVGRNYKRVAEVWMDE-YKEYVYMRRPHYRKLDPGDISKQ 417
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 159/356 (44%), Positives = 219/356 (61%), Gaps = 15/356 (4%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E ++ G GE GK LP+ + +A G N S+ I + ++PD+R CK Y
Sbjct: 60 ESHRIGVGEQGKPAFLPDKEKVQKEALYAVNGFNALLSDKIYLN-SLPDIRHPGCKEKKY 118
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+L SV++ FHNE +S+L+RTV+S++ R+P+ L+EIILVDD+SSK L +KL+ Y+
Sbjct: 119 RKNLNTVSVVVPFHNEHWSTLLRTVYSVLNRSPSHLLKEIILVDDYSSKPFLKKKLDIYV 178
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
R KV++IR ER GLIR R GAK+++ +V++FLD+H E +NWLPPLL PI + K
Sbjct: 179 DRHLPKVKIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPLLEPIAENYK 238
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKS 256
P ID I + T+E+R+ D RG F+W YK LPE K+ +EP++S
Sbjct: 239 TCVCPFIDVIAHDTFEYRA---QDEGRRGAFDWEFFYKRLPLLPE----DLKHPTEPFQS 291
Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
P AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR
Sbjct: 292 PVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSFKIWQCGGKMVDAPCSRVGHIYRK 351
Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F P+ + D + NY+RV E W DE + Y Y R P +D GD++ Q
Sbjct: 352 FAPFPNPGIGD-----FVGKNYRRVAEVWMDE-YAEYLYKRRPHYRNIDPGDLTVQ 401
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 149/359 (41%), Positives = 218/359 (60%), Gaps = 9/359 (2%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
PP+ + GEGG+ + A E N+ S+ IS +R++ D+R ECK
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP +P S+++VFHNE +S+L+RTV S+I R+P L+EI+LVDD S + L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PLLA I
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+RK + PVID I T+E+ V D + G F W + ++ +P+RE +R + + P
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDRTAP 339
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF+IWMCGG++E PCS +GHV
Sbjct: 340 LRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRIWMCGGTLEISPCSHVGHV 399
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY F + + +N R+ E W DE K ++Y P A +++GD+SE+
Sbjct: 400 FRKSTPYTFPGGTSHI----VNHNNARLAEVWMDE-WKHFYYAINPGASNVEVGDVSER 453
>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
Length = 527
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 161/351 (45%), Positives = 217/351 (61%), Gaps = 13/351 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G G+GG ++ E + D ++ NM S+ + + R +PD R CK YP DLP
Sbjct: 13 GCGQGGMIFN--ERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLP 70
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-N 141
ASV++ F+NE FS+L+RTVHS+I RTPA L EIILVDD S DL +L++Y+Q++
Sbjct: 71 AASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLP 130
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL PLLA I DR +
Sbjct: 131 GKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVC 190
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
PVID I T Y RG F WG+ +K + +P E + + + P KSPT AG
Sbjct: 191 PVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAG 246
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY
Sbjct: 247 GLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 306
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ D +T+N R+ W DE + YF R L G+ISE+
Sbjct: 307 SPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351
>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Otolemur garnettii]
Length = 608
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 163/376 (43%), Positives = 229/376 (60%), Gaps = 17/376 (4%)
Query: 2 PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P F+A D K G++E P++ + + E G ++ E + D ++ N+ SN
Sbjct: 69 PQFRANRIDDMKDGHVEDPVKDHLKFSSELGMIFN--ERDQELRDLGYQKHAFNVLISNR 126
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTP L E+
Sbjct: 127 LGYHRDVPDTRNAACKEQSYPTDLPVASVVICFYNEAFSALLRTVHSVIDRTPVHLLHEV 186
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA ++ GEV+VFLD+
Sbjct: 187 ILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVFLDS 246
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D++ + PVID I T Y RG F WG+ +K
Sbjct: 247 HCEVNVMWLQPLLAAIREDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P E + + P KSPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPLSELGGEEGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISFRIW 362
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 363 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417
Query: 357 REPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 418 RPDLKT-KSYGNISER 432
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/359 (40%), Positives = 220/359 (61%), Gaps = 9/359 (2%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
PP+ + GEGG+ + A E N+ S+ IS +R++ D+R ECK
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP +P S+++VFHNE +S+L+RTV S+I R+P L+EI+LVDD S + L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PLLA I
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+RK + PVID I T+E+ V D + G F W + ++ +P+RE +R + + P
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDRTAP 339
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF++W CGG++E +PCS +GHV
Sbjct: 340 LRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRVWQCGGTLEIIPCSHVGHV 399
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY+F ++ + +N RV E W DE + ++Y P A +++GD+SE+
Sbjct: 400 FRDKSPYSFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGASNVEVGDVSER 453
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 224/356 (62%), Gaps = 13/356 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
+EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK +DY
Sbjct: 72 REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYA 131
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 A-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
RF GKVRLIR +R GLIR + GA+E+ G++IVFLD+HCE WL P++ I +R
Sbjct: 191 RFGGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+ P+ID I T + + G F W + + L E E K+R ++ +SPT
Sbjct: 251 IVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367
Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PYN G+ ++ V G N KR+ E W D+ + Y+ RE L D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTAR 418
>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
partial [Pongo abelii]
Length = 587
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVVCFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELRGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRSDLKT-KSYGNISER 432
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 150/352 (42%), Positives = 218/352 (61%), Gaps = 12/352 (3%)
Query: 25 GEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLDL 81
G GGK A+ E + + + N S+ IS DRT+ D+R E CK + YP L
Sbjct: 8 GGGGKPAFLESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKL 67
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ FH E S L+RTVHS+I RTP + L E+I+VDDFS A L + L+D++ +F
Sbjct: 68 PTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFT 127
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV+++R +REGL+R R +GA ++G+V+ FLD+HCE W PLLA I +DR+ +
Sbjct: 128 -KVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNVVC 186
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P I+ I+ T+ ++ D RG F W + +K +P E K R +S+P ++PT AG
Sbjct: 187 PAIEVINADTFAYQGSTNADQ--RGGFSWDLFFKWKGIPPEEQKLRNDDSDPIRTPTMAG 244
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-PY 320
GLF++ R +F ++G YD + +WGGEN ELSF++WMCGG +E V CSR+GHV+R + PY
Sbjct: 245 GLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKYTSPY 304
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F +R +T N+ R+ E W DE +K +Y ++P A D GDIS++
Sbjct: 305 KFPDGVERT----LTKNFNRLAEVWMDE-YKDLYYNKKPQAKNSDYGDISKR 351
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 225/356 (63%), Gaps = 13/356 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
+EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK +DY
Sbjct: 72 REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYS 131
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 -SLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
RF GKV+LIR R GLIR + GA+E+ G++IVFLD+HCE WL P++ I +R
Sbjct: 191 RFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+ P+ID I T + + G F W + + +PE E K+RK ++ +SPT
Sbjct: 251 IVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGIPEDEQKRRKKPTDYIRSPT 307
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGHIFRAGH 367
Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PYN G+ ++ V G N KR+ E W D+ + Y+ RE L D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTSR 418
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 151/351 (43%), Positives = 215/351 (61%), Gaps = 11/351 (3%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE GK +P + + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 82 PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPTKLPT 141
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L Q+LEDY+Q
Sbjct: 142 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGQQLEDYVQTLPVH 201
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 202 TYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 261
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ V D + G F W + ++ +P RE ++R ++ + P ++PT AGG
Sbjct: 262 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 318
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PCS +GHV+R PY F
Sbjct: 319 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 378
Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G +A+ ++ N RV E W DE K ++Y P A GD+SE+
Sbjct: 379 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 423
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 152/354 (42%), Positives = 213/354 (60%), Gaps = 14/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
E PG GK +P ++ D N+ S+ I +R++PD+R ++C Y Y L
Sbjct: 248 ELPGANGKPVQIPSELQSEADDLFIINSFNLMASDMIGINRSLPDVRPKQCLYKQYSSAL 307
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+VFHNE +S+L+RTVHS+I RTP QYL EIILVDD S A L +L+ Y+ +
Sbjct: 308 PNTSVIIVFHNEAWSALLRTVHSVINRTPRQYLSEIILVDDASIHAHLGHQLDSYVAKLP 367
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V + R R GLIR R RGA ++G+V+ FLD+HCE WL PLLA I DR +
Sbjct: 368 VPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWLEPLLARIAEDRSNVVT 427
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
PVID I+ Q YE D+ G+F+W + ++ + R+ K++ + P SPT
Sbjct: 428 PVIDVINAQNL----AYEADNQTPAIGVFDWSLTFRWQSIQRRDLPLLKHDPTHPIPSPT 483
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+DR++F+E G YD G +WG EN E+SFK WMCGG IE +PCS +GH++R
Sbjct: 484 MAGGLFAIDRSYFIETGMYDSGFEIWGAENLEISFKTWMCGGRIEILPCSHVGHIFRKHA 543
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+ L D I+YN KR+ E W D +K +FY P A+ ++ G+ +++
Sbjct: 544 PYS-NTLTD-----FISYNNKRLAEVWLD-GYKEFFYFMSPSALKVNAGNYTDR 590
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 212/323 (65%), Gaps = 9/323 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S+ I+ +R++PD+R+E CK YP +LP S+++VFHNE +S+L+RTVHS+I R+P
Sbjct: 11 NLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSP 70
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
+ + EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA ++G++
Sbjct: 71 HRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQI 130
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
I FLDAHCE + WL PLLA I DRK + P+ID I T+E+ + D Y G F W
Sbjct: 131 ITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ ++ +P+RE +RK + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENL 247
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF+IW CGGS+E V CS +GHV+R PY F G +I N +R+ E W D+
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDD- 302
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
K +FY P + +D GD+SE+
Sbjct: 303 FKDFFYIISPGVVKVDYGDVSER 325
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 159/354 (44%), Positives = 211/354 (59%), Gaps = 16/354 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK L A + G N S+ I+ DR +PD+R CK Y D
Sbjct: 93 RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 152
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++ FHNE +++L+RT S++ R+P L+E+ILVDD S+K + L+DY+
Sbjct: 153 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 212
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 213 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 272
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
P ID I Y+T+E+R+ D RG F+W YK LPE ++ +EP+KSP
Sbjct: 273 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEHPTEPFKSPVM 325
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR + P
Sbjct: 326 AGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAP 385
Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N G KG + NY+RV E W DE + Y Y R P +D GD+++Q
Sbjct: 386 FPNPG------KGDFVGRNYRRVAEVWMDE-YAEYLYKRRPHYRDIDPGDLTKQ 432
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 159/354 (44%), Positives = 211/354 (59%), Gaps = 16/354 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE GK L A + G N S+ I+ DR +PD+R CK Y D
Sbjct: 71 RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 130
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++ FHNE +++L+RT S++ R+P L+E+ILVDD S+K + L+DY+
Sbjct: 131 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 190
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 191 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 250
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
P ID I Y+T+E+R+ D RG F+W YK LPE ++ +EP+KSP
Sbjct: 251 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEHPTEPFKSPVM 303
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR + P
Sbjct: 304 AGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAP 363
Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N G KG + NY+RV E W DE + Y Y R P +D GD+++Q
Sbjct: 364 FPNPG------KGDFVGRNYRRVAEVWMDE-YAEYLYKRRPHYRDIDPGDLTKQ 410
>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
musculus]
gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
musculus]
gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
musculus]
Length = 608
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 164/376 (43%), Positives = 223/376 (59%), Gaps = 17/376 (4%)
Query: 2 PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P FKA+ +L N +E P + + E G ++ E + D ++ NM SN
Sbjct: 69 PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D + PVID I T Y RG F WG+ +K
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P E + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417
Query: 357 REPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 418 RPDLKN-KSFGNISER 432
>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
6 [Macaca mulatta]
gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
Length = 608
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
P FKA+ ++ ++ + E P EG + E + D ++ NM S
Sbjct: 69 PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 125 NRLGYRRNVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV + WL PLLA I DR + PVID I T Y RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415
Query: 355 YTREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432
>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
Length = 608
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 164/376 (43%), Positives = 223/376 (59%), Gaps = 17/376 (4%)
Query: 2 PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P FKA+ +L N +E P + + E G ++ E + D ++ NM SN
Sbjct: 69 PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D + PVID I T Y RG F WG+ +K
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P E + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417
Query: 357 REPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 418 RPDLKN-KSFGNISER 432
>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Ailuropoda melanoleuca]
gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
Length = 608
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 162/377 (42%), Positives = 226/377 (59%), Gaps = 17/377 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNRIDDMIDSHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125
Query: 57 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
+ + R +PD R CK YP+DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
+HCEV + WL PLLA I D++ + PVID I T Y RG F WG+ +K
Sbjct: 246 SHCEVNVMWLQPLLAAIQQDQRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
+ +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416
Query: 356 TREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432
>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
Length = 675
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 162/352 (46%), Positives = 216/352 (61%), Gaps = 14/352 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+A L E+ R E G N S+ IS +R++PD+R +EC+ Y LP
Sbjct: 152 GIGEHGEAAKLDESLRDKEQVLSLENGFNALLSDSISVNRSLPDIRHKECRKKQYLSKLP 211
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS +A L + LEDY+
Sbjct: 212 NVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAPLFKPLEDYVAEHFS 271
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VR++R +R GLI RS GA+ + +V++FLD+H E NWLPPLL PI +++ P
Sbjct: 272 MVRIVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLDPIAQNKRAAVCP 331
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHAG 261
ID ID+ + +R+ D RG F+W YK LPE K+ S+P+KSP AG
Sbjct: 332 FIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DLKHPSDPFKSPVMAG 384
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSR+GH+YR P
Sbjct: 385 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG--PRQ 442
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
K + G + NYKRV E W DE +K Y Y + + +D GD++ Q
Sbjct: 443 GVK--NPRSGDYLHKNYKRVAEVWMDE-YKNYLYNHGDGIYDNVDPGDLTAQ 491
>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
Length = 608
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
P FKA+ ++ ++ + E P EG + E + D ++ NM S
Sbjct: 69 PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV + WL PLLA I DR + PVID I T Y RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415
Query: 355 YTREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 212/323 (65%), Gaps = 9/323 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S+ I+ +R++PD+R+E CK YP +LP S+++VFHNE +S+L+RTVHS+I R+P
Sbjct: 11 NLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSP 70
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
+ + EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA ++G++
Sbjct: 71 HRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQI 130
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
I FLDAHCE + WL PLLA I DRK + P+ID I T+E+ + D Y G F W
Sbjct: 131 ITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ ++ +P+RE +RK + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENL 247
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF+IW CGGS+E V CS +GHV+R PY F G +I N +R+ E W D+
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDD- 302
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
K +FY P + +D GD+SE+
Sbjct: 303 FKDFFYIISPGVVKVDYGDVSER 325
>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
musculus]
Length = 529
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RT
Sbjct: 32 DLGYQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRT 91
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R
Sbjct: 92 VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRM 151
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I D + PVID I T Y
Sbjct: 152 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 207
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P +SPT AGGLFAM+R +F +LG YD G
Sbjct: 208 SPVVRGGFNWGLHFKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSG 267
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 268 MDIWGGENLEISFRIWMCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 322
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 323 LAHVWLDEYKEQYFSLRPDLKN-KSFGNISER 353
>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
10-like [Acyrthosiphon pisum]
Length = 592
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 154/358 (43%), Positives = 217/358 (60%), Gaps = 15/358 (4%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
+E + G GE G + L R D G N S+ IS +R+IPD+R + C++
Sbjct: 77 IEKQRTGIGEQGVSASLSSHNRHKYDELYKVNGFNALLSDSISVNRSIPDIRHKLCRFKK 136
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP SV++ FHNE FS+L+RTV+S++ R+P L+EIILVDD S+K L + L+++
Sbjct: 137 YNSKLPTVSVVIPFHNEHFSTLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKRPLDNF 196
Query: 137 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
+ V++I +R+GLIR R GA+++ E+++FLD+H E NWLPPLL PI D
Sbjct: 197 LSNNLADTVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPPLLEPITED 256
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPY 254
+ P ID I ++T+E+R+ D RG F+W YK L PE Y ++P+
Sbjct: 257 YRTCVCPFIDVIAFETFEYRA---QDEGARGAFDWEFFYKRLPLLPEDLL----YPTKPF 309
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+SP AGGLFA+ +F ELGGYDPGL +WGGE +ELSFKIW CGG+I PCSR+GH+Y
Sbjct: 310 RSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKIWQCGGTILDAPCSRVGHIY 369
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R F P+ + D + NY+RV E W DE + Y Y R P ++ GDI++Q
Sbjct: 370 RKFAPFPNPGIGD-----FVGKNYRRVAEVWMDE-YAEYLYLRRPHYRNINTGDITKQ 421
>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Cricetulus griseus]
gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
griseus]
Length = 608
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R +C+ YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAKCRGKSYPADLPTASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R
Sbjct: 171 VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNRKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I D + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P +SPT AGGLFAM+R +F +LG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPVSELGGADGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSFGNISER 432
>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
Length = 579
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 225/362 (62%), Gaps = 13/362 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK- 73
P +EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK
Sbjct: 66 PDYSQPREGPGEKGKPVVLSGKEAELGHADMKKWFMNVHASDKISLDRDVPDPRIQACKD 125
Query: 74 -YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+DY LPK SVI++F +E ++ L+RTVHS+I R+P + L+EIIL+DD S + +L +
Sbjct: 126 IKYDYAT-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKRQELQEP 184
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P++ I
Sbjct: 185 LDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRI 244
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
+R + P+ID I T + + G F W + + LP+ E K+R ++
Sbjct: 245 SDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGLPDEELKRRTKVTD 301
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
+SPT AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PCS +GH
Sbjct: 302 YIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGH 361
Query: 313 VYRSFMPYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
++R+ PYN G+ ++ V G N KR+ E W D+ + Y+ RE L D+GD++
Sbjct: 362 IFRAGHPYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLT 416
Query: 371 EQ 372
+
Sbjct: 417 AR 418
>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
Length = 608
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
P FKA+ ++ ++ + E P EG + E + D ++ NM S
Sbjct: 69 PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 125 NRLGYRRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV + WL PLLA I DR + PVID I T Y RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415
Query: 355 YTREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 148/321 (46%), Positives = 209/321 (65%), Gaps = 9/321 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S+ I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RT++S+I R P
Sbjct: 11 NLMASDMIALNRSLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAP 70
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
L EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA S+G+V
Sbjct: 71 HYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQV 130
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
I FLDAHCE L WL PLLA I DRKI+ P+ID I T+E+ + D Y G F W
Sbjct: 131 ITFLDAHCECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ ++ +P+RE +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENL 247
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF+IW CGGS+E V CS +GHV+R PY F G +I N +R+ E W DE
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE- 302
Query: 350 HKAYFYTREPLAMFLDMGDIS 370
K +FY P + +D GD++
Sbjct: 303 FKDFFYIISPGVVKVDYGDVT 323
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 206/319 (64%), Gaps = 9/319 (2%)
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P +
Sbjct: 2 SEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMI 61
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EEI+LVDD S + L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FL
Sbjct: 62 EEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFL 121
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
DAHCE WL PLLA I DR+ + P+ID I T+E+ + D Y G F W + +
Sbjct: 122 DAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNF 178
Query: 235 KENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
+ +P+RE +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF
Sbjct: 179 RWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISF 238
Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
+IW CGG++E V CS +GHV+R PY F G +I N +R+ E W DE K +
Sbjct: 239 RIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNF 293
Query: 354 FYTREPLAMFLDMGDISEQ 372
FY P +D GDIS +
Sbjct: 294 FYIISPGVTKVDYGDISSR 312
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/356 (42%), Positives = 223/356 (62%), Gaps = 13/356 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
+EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK +DY
Sbjct: 72 REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYA 131
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 A-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P++ I +R
Sbjct: 191 RFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+ P+ID I T + + G F W + + L E E K+R ++ +SPT
Sbjct: 251 IVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367
Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PYN G+ ++ V G N KR+ E W D+ + Y+ RE L D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTAR 418
>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 208/325 (64%), Gaps = 11/325 (3%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
+G N+ SN +S RTI D R E C+ YP +LP AS+++ F+NE ++ L+RT+HS++
Sbjct: 21 HGFNLLISNRLSLHRTIKDTRHELCRGKTYPKNLPVASIVICFYNEAWTILLRTIHSVLD 80
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
RTP Q+L EIILVDDFS+ +L KL+ Y+ K+R++RN +REGLIR R GA+ +
Sbjct: 81 RTPHQFLHEIILVDDFSNMLELKSKLDRYLSTMP-KIRIVRNNKREGLIRGRIIGAEAAT 139
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VFLD+HCEV +NWL PLL I+ D+K + PVID I T+E+ S RG
Sbjct: 140 GQVLVFLDSHCEVNINWLQPLLQHIHDDQKAVACPVIDVISSDTFEYSS----SPMVRGG 195
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F WG+ + +P K + +P +SPT AGGLFA+DR +F +LG YD G+ +WG E
Sbjct: 196 FNWGLHFTWEPIPPSLLVKPEDYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAE 255
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF+IWMCGGS++ +PCSR+GH++R F PY KG ++ N R+ E W D
Sbjct: 256 NLEISFRIWMCGGSLDILPCSRVGHLFRRFRPY-----GSDSKGDTMSRNSMRLAEVWLD 310
Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
+K YFY GDIS++
Sbjct: 311 -GYKKYFYQIRHDLEGKKFGDISQR 334
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 218/337 (64%), Gaps = 12/337 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G+ G+A +P+ + + + N+ S+ +S R +PD R + CK YPL LPK+
Sbjct: 1 GDMGEAVSVPKRLKEKEEEGYELHSFNLVASDMMSLYRRLPDYRNDACKAKKYPLHLPKS 60
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
S+I+ FHNE +S+L+RTVHS+I RTP + LEEI+L+DD S++ +L +KLE+Y+ + V
Sbjct: 61 SIIICFHNEAWSTLLRTVHSVINRTPPRLLEEILLIDDASNRDELKEKLEEYVAKLK-VV 119
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
R+IR ++R+GLIR R +GA ++G ++ FLDAHCE WL PL A I + + +PVI
Sbjct: 120 RIIRLSKRQGLIRARLKGAAAAKGSILTFLDAHCECSKGWLEPLAAKIAENSSNVVMPVI 179
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
D I T+ + +V EP H RG+F W + + +P+ E ++RK ++ ++P AGGLF
Sbjct: 180 DEISDTTFYYHAVPEPFH--RGVFRWRLEFGWKPVPQYEMERRKDEADGIRTPVMAGGLF 237
Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-- 322
++D+ +F ++G YD G+ +WGGEN E+SF+IWMCGG+IE +PCSR+GHV+R PY+F
Sbjct: 238 SIDKNYFEKIGTYDTGMDIWGGENLEISFRIWMCGGAIEMLPCSRVGHVFRPRFPYSFPA 297
Query: 323 --GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
G D +++ N RV + W DE K ++ R
Sbjct: 298 RPGHNTD-----VVSNNLMRVADVWMDEYKKHFYNIR 329
>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
anubis]
Length = 608
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 166/378 (43%), Positives = 224/378 (59%), Gaps = 21/378 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
P FKA+ ++ ++ + E P EG + E + D ++ NM S
Sbjct: 69 PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV + WL PLLA I DR + PVID I T Y RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415
Query: 355 YTREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432
>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Gorilla gorilla gorilla]
gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Gorilla gorilla gorilla]
Length = 608
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|158300139|ref|XP_320141.4| AGAP012414-PA [Anopheles gambiae str. PEST]
gi|157013013|gb|EAA00190.4| AGAP012414-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/356 (43%), Positives = 216/356 (60%), Gaps = 13/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GKA L ++ D + G N S+ IS +R++PD+R C+ Y
Sbjct: 82 EAKRSGIGEHGKAGQLDKSEHEMKDKLFKKNGFNAVLSDKISLNRSLPDIRHRGCRKKQY 141
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L Q+L++Y+
Sbjct: 142 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELIAEIILVDDCSTKEFLKQQLDEYV 201
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 202 TENMPKVKVVRLPERSGLITARLAGAKIATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 261
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID ID+ T+E+R+ D RG F+W YK L R+ + +EP++SP
Sbjct: 262 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPRDLQN---PTEPFESP 315
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF E+GGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR +
Sbjct: 316 VMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCSRVGHIYRGY 375
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P+ + D +T NYKRV E W DE +K Y Y R+ D+GDIS Q
Sbjct: 376 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYMRDRKKYENTDVGDISRQ 425
>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Gorilla gorilla gorilla]
Length = 527
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 89
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 265
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/368 (42%), Positives = 229/368 (62%), Gaps = 12/368 (3%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LG L L + GPGE G A + + + E ++ S+ IS +R +PD R
Sbjct: 70 ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERTRKFKENQFDVMASDLISINRALPDYR 129
Query: 69 MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
+C+ D LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSN 189
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
+ L + L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306
Query: 246 KRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
+R ++ S P ++PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E
Sbjct: 307 RRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEI 366
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
PCSR+GHV+R PY+F RV I +N R E W DE +K FY+ P A +
Sbjct: 367 HPCSRVGHVFRKHTPYSFPGGTARV----IHHNTARTAEVWMDE-YKDIFYSMVPAARNV 421
Query: 365 DMGDISEQ 372
D+GD++E+
Sbjct: 422 DVGDLTER 429
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 164/384 (42%), Positives = 229/384 (59%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGM-----NMET 54
GNL P ++P YK+G GE GKA + ++ + ++ + GM N
Sbjct: 96 GNLAKPKFMVDPNDPIYKKGDTSQAGELGKAVVVDKSKLTSEQKAIYDKGMLNNAFNQYA 155
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 156 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPDHLL 215
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 216 EEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYL 275
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 276 DSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 328
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 329 DWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 388
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 389 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 443
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GD+S +
Sbjct: 444 -YKTYYYERIN-NQLGDFGDVSAR 465
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 171/378 (45%), Positives = 227/378 (60%), Gaps = 23/378 (6%)
Query: 5 KADGKLGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETS 55
K + L N + P+ YK G PGEGGKA L R D + N S
Sbjct: 6 KPNKALFNPDSPI--YKSGDENQPGEGGKAVVIDRNKLSLDERKIYDDGFTKNAFNQYIS 63
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
+ IS R++P EECK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 64 DMISIHRSLPSYIDEECKNEKYTSDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLA 123
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
E+ILVDDFS A L LE Y+++F+ KVR++R +REGLIR R RGA S+G VI +LD
Sbjct: 124 ELILVDDFSDMAHLKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAISKGSVITYLD 182
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLY 234
+HCE W+ PLL I + K + PVID ID T+E+ Y + G F+W + +
Sbjct: 183 SHCECLEGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQF 240
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
+ +PE++ K R+ + +P KSPT AGGLF++DR FF ELG YDPGL +WGGEN ELSFK
Sbjct: 241 NWHAIPEKDRKGRR-DIDPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSFK 299
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG +E VPCS +GH++R PY + R ++ N R+ E W DE +K Y+
Sbjct: 300 IWMCGGILEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSVRLAEVWMDE-YKKYY 353
Query: 355 YTREPLAMFLDMGDISEQ 372
Y R + D GD+S +
Sbjct: 354 YERINNNLG-DFGDVSSR 370
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/360 (43%), Positives = 216/360 (60%), Gaps = 9/360 (2%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
E L ++GPGEGGK +P+ + N+ S I+ +R++PD+R+E CK
Sbjct: 46 EDTLTRPRDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCK 105
Query: 74 YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKL 133
YP +LP+ SV++VFHNE +S+L+RTVHS+I R+P LEEIILVDD S + L + L
Sbjct: 106 NKLYPDNLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPL 165
Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
E Y++R VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLLA I
Sbjct: 166 EQYVRRLEVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIK 225
Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SE 252
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 226 KDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTL 282
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P + AGG R +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GH
Sbjct: 283 PVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 342
Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
V+R PY F G +I N +R+ E W DE K +FY P +D GDI+ +
Sbjct: 343 VFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIATR 397
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
troglodytes]
Length = 527
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE FS+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRT 89
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 265
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 217/353 (61%), Gaps = 11/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 80
PGE GK + N+ SN IS +R++PD+RM+ CK YP
Sbjct: 61 APGEMGKGVVIAPEEEELKKEMFKINQFNLLASNKISVNRSLPDVRMDGCKKKTYPPHNT 120
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LPK S+++VFHNE +S+L+R VHSII R+P LEEIILVDD S + L ++LEDY+++
Sbjct: 121 LPKTSIVIVFHNEAWSTLIRNVHSIINRSPRMLLEEIILVDDASERDFLGKELEDYVKKL 180
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+VR+ R +R GLIR R RGA S GEVI FLDAHCE WL PL+A I DR +
Sbjct: 181 PVRVRVERMDKRSGLIRARLRGAGVSTGEVITFLDAHCECTQGWLEPLMARIAEDRSRVV 240
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I +T+EF + D Y G F W + ++ +P+RE +RK + + P +PT
Sbjct: 241 CPIIDVISDETFEFHA--GSDMTYGG-FNWKLNFRWYSVPKREMDRRKGDRTIPLNTPTM 297
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+ + +F E+G YD G+ +WGGEN E+SF+IWMCGG++E V CS +GHV+R P
Sbjct: 298 AGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGHVFRKTTP 357
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y+F G +I N +R+ E W D+ +K +FY P + + GD++ +
Sbjct: 358 YSFPGGT----GAIINKNNRRLAEVWMDD-YKTFFYKISPGSKKSEYGDVTNR 405
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 164/377 (43%), Positives = 223/377 (59%), Gaps = 34/377 (9%)
Query: 12 NLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGD-----ASLGEYGMNMETSNHISFDR 62
N + P+ YK+G GEGGK + + +A + A N S+ IS R
Sbjct: 105 NADSPI--YKKGDKNQAGEGGKPVKINQEQLSAQEREKYAAGFRNNAFNQYVSDMISIHR 162
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++P EECK Y DLP SVI+ FHNE +S L+RTVHS+I+RTP L E+ILVDD
Sbjct: 163 SLPSTIDEECKTEKYLDDLPSTSVIICFHNEAWSVLLRTVHSVIERTPEHLLTEVILVDD 222
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS L + LE+Y+ KVR++R +REGLIR R +GA S+G V+ FLD+HCE
Sbjct: 223 FSDMDHLKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECME 281
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYK 235
W+ PLL I + + PVID ID +T+E+ HY G F+W + +
Sbjct: 282 GWIEPLLDRIKRNSSTVVCPVIDVIDDETFEY--------HYSKAYFTNVGGFDWSLQFN 333
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
+ +PER+ K RK + +P +SPT AGGLF++DRA+F +LG YDPG +WGGEN ELSFKI
Sbjct: 334 WHAIPERDRKNRKRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSFKI 393
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG++E VPCS +GHV+R PY + R ++ N R+ E W DE +K Y+Y
Sbjct: 394 WMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKKNSVRLAEVWLDE-YKVYYY 447
Query: 356 TREPLAMFLDMGDISEQ 372
R D GD+S++
Sbjct: 448 ERIN-NQTGDYGDVSDR 463
>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
[Homo sapiens]
gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
sapiens]
gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Homo sapiens]
gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[synthetic construct]
gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 608
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Pan troglodytes]
gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
4 [Pan troglodytes]
gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
Length = 608
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Pan paniscus]
gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Pan paniscus]
Length = 608
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 165/378 (43%), Positives = 224/378 (59%), Gaps = 21/378 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
P FKA+ ++ ++ + E P EG + E + D ++ NM S
Sbjct: 69 PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
N + + R +PD R CK YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLH 184
Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCEV + WL PLLA I DR + PVID I T Y RG F WG+ +
Sbjct: 245 DSHCEVNVMWLQPLLATIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
IWMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415
Query: 355 YTREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 208/323 (64%), Gaps = 9/323 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P
Sbjct: 25 NLMASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSP 84
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
+EEI+LVDD S + L + LE Y+++ V +IR +R GLIR R +GA S+G+V
Sbjct: 85 RHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQV 144
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
I FLDAHCE + WL PLLA I DR+ + P+ID I T+E+ + D Y G F W
Sbjct: 145 ITFLDAHCECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 201
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ ++ +P+RE +RK + + P ++PT AGGLF++D +F E+G YD G+ +WGGEN
Sbjct: 202 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENL 261
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF+IW CGG++E V CS +GHV+R PY F G +I N +R+ E W DE
Sbjct: 262 EISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE- 316
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
K +FY P +D GDIS +
Sbjct: 317 FKNFFYIISPGVTKVDYGDISSR 339
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
Length = 608
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 159/370 (42%), Positives = 220/370 (59%), Gaps = 14/370 (3%)
Query: 5 KADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
K LGN ++ P++ E E G ++ E + D ++ NM SN + + R
Sbjct: 75 KIGNALGNHVQDPVKGEVEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLGYHRE 132
Query: 64 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
+PD R +C+ YP DLP ASVI+ F+NE S+L+RTVHS++ RTPA L EIILVDD
Sbjct: 133 VPDTRDVKCREKSYPSDLPSASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDN 192
Query: 124 SSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S ADL + L++Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HCEV
Sbjct: 193 SELADLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNE 252
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLL PI DR+ + PVID I T Y RG F WG+ +K + +P
Sbjct: 253 MWLQPLLTPIREDRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLS 308
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
E + + + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG +
Sbjct: 309 ELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRL 368
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
+PCSR+GH++R PY D + +N R+ W DE + YF R L M
Sbjct: 369 LIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLRLAHVWMDEYKEQYFALRPELRM 423
Query: 363 FLDMGDISEQ 372
+ G+I+++
Sbjct: 424 -RNYGNITDR 432
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 153/354 (43%), Positives = 215/354 (60%), Gaps = 11/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-- 79
+GPGE G +P+ E N+ SN IS +RT+ D+RM CK DY
Sbjct: 82 KGPGEMGAPVKIPKDKEKESKKMFQENQFNLMASNMISLNRTLKDVRMSGCKKHDYANLG 141
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LPK S+I VFHNE +S+L+R++HS+I R+P + LEEIILVDD S K L ++L+DY++
Sbjct: 142 ALPKTSIIFVFHNEAWSTLLRSIHSVINRSPREMLEEIILVDDKSEKDFLGKQLDDYVKN 201
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
V +IR REGLIR R GAK ++GEV+ FLDAH E WL PLL I DR +
Sbjct: 202 LPVPVHIIRQQHREGLIRARLEGAKIAKGEVLTFLDAHIEASPGWLEPLLYEIKKDRTNV 261
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
P+ID I T+EF + D Y G F W + ++ +P+RE +R + S P ++PT
Sbjct: 262 ICPIIDVISDDTFEF--LTGSDLTYGG-FNWKLNFRWYPVPQREVDRRGGDRSLPMQTPT 318
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLF++D+++F E+G YD G+ +WGGEN E+SF+IWMCGG++ CS +GHV+R
Sbjct: 319 MAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSFRIWMCGGTVLIATCSHVGHVFRKAT 378
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F ++ I N +R+ E W D+ +K +FY P M GD+S++
Sbjct: 379 PYTFPGGTSQI----INKNNRRLAEVWMDD-YKKFFYIVNPTVMKHKYGDVSDR 427
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
GNL P ++P YK+G GE GKA L +A D + N
Sbjct: 88 GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K Y+Y R D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 216/355 (60%), Gaps = 8/355 (2%)
Query: 19 PYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
P + GPGE GKA + ++ + N+ SN IS DR++ D R + C Y
Sbjct: 96 PLRVGPGEMGKAVTVAKSEEEEMEKMFKVNYFNLMISNRISNDRSLADYRPQGCFAKKYS 155
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
+LPK SVILV+HNE +S LMRTVHS+I R+P LEEI+L+DD S++ L + L+DYI
Sbjct: 156 RNLPKTSVILVYHNEAWSVLMRTVHSVINRSPRHLLEEILLIDDASTREYLGRPLDDYIT 215
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
+ VR+ ER GLI R +GA+ ++ V+ FLD+HCE WL PLL I ++R
Sbjct: 216 KLPVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPLLDRIAANRST 275
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSP 257
+ PVI+ ID +++ F + E H G F+W +++ +P+ E + + SEP +SP
Sbjct: 276 VVCPVINQIDDRSFAFVNATEVSH--IGGFDWNIIFNWYNIPQSEKDRIGGDKSEPVRSP 333
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
T AGGLF++D+++F ELG YDP WGGEN ELS KIWMCGG +E+VPCS +GHV+R
Sbjct: 334 TMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELSLKIWMCGGILEFVPCSHVGHVFRKH 393
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ + V G N +R+ E W DE +K FY +P M +D GDIS++
Sbjct: 394 NPHKYKNTTYNVVG----RNNRRLAEVWLDE-YKYLFYANQPETMKIDPGDISQR 443
>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Nomascus leucogenys]
Length = 527
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 212/350 (60%), Gaps = 11/350 (3%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PG G + E + D ++ NM SN + + R +PD R C+ YP DLP
Sbjct: 12 PGCGQRGMIFNERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPS 71
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NG 142
ASV++ F+NE FS+L+RT HS+I RTPA L EIILVDD S DL +L++Y+Q++ G
Sbjct: 72 ASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPG 131
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
K+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL PLLA I D+ + P
Sbjct: 132 KIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCP 191
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
VID I T Y RG F WG+ +K + +P E + + P KSPT AGG
Sbjct: 192 VIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGG 247
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY
Sbjct: 248 LFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGS 307
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ D +T+N R+ W DE + YF R L G+ISE+
Sbjct: 308 PEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351
>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
(Silurana) tropicalis]
gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus (Silurana) tropicalis]
Length = 563
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 201/329 (61%), Gaps = 11/329 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R +C YP DLP AS+++ F+NE FS+L+RT
Sbjct: 66 DVGYQKHAFNLLISNRLGYHRDVPDTRDSKCAKKTYPPDLPMASIVICFYNEAFSALLRT 125
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPAQ L EIILVDD S DL + L+ Y+Q + KV+L+RN +REGLIR R
Sbjct: 126 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDGYMQENLSKKVKLVRNKQREGLIRGRM 185
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V+VFLD+HCEV WL PLLAPI + + + PVID I T +Y
Sbjct: 186 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIKENPRTVVCPVIDIISADTL----IYSS 241
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + S P++SPT AGGLFAMDR +F LG YD G
Sbjct: 242 SPVVRGGFNWGLHFKWDPVPLAELGGPEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSG 301
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGGS+ VPCSR+GH++R PY D + +N R
Sbjct: 302 MDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHD-----TMAHNSLR 356
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+ W DE YF R P D GDI
Sbjct: 357 LAHVWMDEYKDQYFALR-PELRNRDFGDI 384
>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
Length = 654
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRSAACKGKSYPADLPVASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA+ L E+ILVDD S DL +L++Y+Q++ GK+++IRN +REGLIR R
Sbjct: 171 VHSVIDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNKKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPAVRGGFNWGLHFKWDLVPLSELAGAGGATAPIKSPTMAGGLFAMNRQYFSELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-RSYGNISER 432
>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11-like [Oryzias
latipes]
Length = 574
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 151/339 (44%), Positives = 203/339 (59%), Gaps = 11/339 (3%)
Query: 35 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
EA + DA + N+ SN + R +PD R ++C+ YP LP ASV++ F NE
Sbjct: 71 EADQEVRDAGYHRHAFNVLISNRLGSHRELPDTRDKQCRKRSYPQALPSASVVICFFNEA 130
Query: 95 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTERE 153
S+L+RTVHS++ RTPA L EIILVDD S +L + L+ + + GKVRL+RN +RE
Sbjct: 131 LSALLRTVHSVLDRTPAYLLHEIILVDDQSELEELKEGLDRCVREELQGKVRLVRNRKRE 190
Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
GLIR R GA + G+V+VFLD+HCEV +WL PLLAPI DR+ + P+ID I T
Sbjct: 191 GLIRGRMIGAAHATGDVLVFLDSHCEVNQDWLQPLLAPIQKDRRTVVCPIIDIISADTL- 249
Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
Y RG F WG+ +K + +P E + + P +SPT AGGLFAM+R +F E
Sbjct: 250 ---TYSSSPIVRGGFNWGLHFKWDPVPPSEISGPEGAAGPIRSPTMAGGLFAMNREYFNE 306
Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
LG YDPG+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D
Sbjct: 307 LGRYDPGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFRKRRPYGSPGGQD-----T 361
Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ +N R+ W DE + Y R P GDISE+
Sbjct: 362 MAHNSLRLAHVWMDEYKEQYLSLR-PELRNRSYGDISER 399
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 164/382 (42%), Positives = 225/382 (58%), Gaps = 36/382 (9%)
Query: 11 GNLEPP---LEP----YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGM-----NMET 54
GNL P ++P YK+G GE GKA + + + + GM N
Sbjct: 92 GNLAKPKFMVDPNDPIYKKGDASQAGELGKAVIVDKTKLTPEQKGIYDKGMLNNAFNQYA 151
Query: 55 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 152 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLL 211
Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +L
Sbjct: 212 EEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYL 271
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
D+HCE W+ PLL I D + PVID ID T+E+ HH + G F
Sbjct: 272 DSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 324
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 325 DWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 384
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFKIWMCGG++E VPCS +GHV+R PY + R ++ N R+ E W D+
Sbjct: 385 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 439
Query: 349 KHKAYFYTREPLAMFLDMGDIS 370
+K Y+Y R D GD+S
Sbjct: 440 -YKTYYYERIN-NQLGDFGDVS 459
>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Oryctolagus cuniculus]
Length = 608
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSEQGGAEGATAPIKSPTMAGGLFAMNRLYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Anolis carolinensis]
Length = 608
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/332 (45%), Positives = 203/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R +CK YPLDLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRDAKCKGKKYPLDLPSASIIICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
VHS++ RTP+ L EIILVDD S DL + L+ Y+++ V+L+RN +REGLIR R
Sbjct: 171 VHSVLDRTPSHLLHEIILVDDNSELVDLKEDLDVYLRKNLPNNVKLVRNGKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V+VFLD+HCEV WL PLL PI RK + PVID I T Y
Sbjct: 231 IGASHATGKVLVFLDSHCEVNELWLQPLLTPIRESRKTVVCPVIDIISADTL----TYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAMDR +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D + +N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE YF R L M + G+I+++
Sbjct: 402 LAHVWMDEYKDQYFALRPELRM-RNYGNITDR 432
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 149/348 (42%), Positives = 212/348 (60%), Gaps = 9/348 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G+ +PE A + N+ S I+ +R++PD+R+ EC+ YP LP
Sbjct: 78 PGENGEGVEIPEKETALKNEKFKINQFNLLASERIALNRSLPDVRLAECRKKTYPDRLPT 137
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +++L+RTVHSII+ +P + + EIILVDD S L QKLEDY+ +
Sbjct: 138 TSIVIVFHNEAWTTLLRTVHSIIQMSPRELIAEIILVDDASEFDHLGQKLEDYVAKLPVP 197
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I D + PV
Sbjct: 198 VHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWLEPLLARIAEDNTRVVCPV 257
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I + F V D + G F W + ++ +P+RE +R + + P ++PT AGG
Sbjct: 258 IDVISDEN--FAYVPASDQTWGG-FNWKLNFRWYRVPQRENDRRGGDRTLPVRTPTMAGG 314
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LFAMD+A+F +LG YD G+ +WGGEN E+SF+IWMCGG++E V CS +GHV+R PY F
Sbjct: 315 LFAMDKAYFEKLGKYDEGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGHVFRKSTPYTF 374
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
G ++ +N R+ + W DE K +++ P+A +D GD S
Sbjct: 375 PGGT----GKIVNHNNARLADVWLDE-WKDFYFAINPVAKKVDRGDTS 417
>gi|256083753|ref|XP_002578103.1| peptidase [Schistosoma mansoni]
Length = 1860
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 13/354 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
++GPGE G L + +A +L G N+ S I DR++ D+R CK Y
Sbjct: 2 RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ + E + +L+RTV S++ R+P + ++E+ILVDD SS+ L ++L++Y+ R
Sbjct: 62 LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121
Query: 141 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLPPLL PI + +
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+T P ID ID T+E+R+ D RG F+W YK LP R + + P++SP
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTDSLHPETPFESPV 235
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG + VPCSR+GH++R +
Sbjct: 236 MAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVPCSRVGHIFREY- 294
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P NF + ++K + N+KRV E W DE +K Y Y P +D GD+S+Q
Sbjct: 295 PTNFPQ--PKIKN-FLRRNFKRVAEVWMDE-YKEYIYRSLPECRKVDPGDLSQQ 344
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/368 (41%), Positives = 228/368 (61%), Gaps = 12/368 (3%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
+LG L L + GPGE G A + + + E ++ S+ IS +R +PD R
Sbjct: 70 ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERARKFKENQFDVMASDLISINRALPDYR 129
Query: 69 MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
+C+ D LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTLHSVINRSPLHLIKEVILIDDLSN 189
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
+ L + L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306
Query: 246 KRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
+R ++ S P ++PT AGGLFA+DR FF ++G YD G+ +WGGEN E+SF++WMCGGS+E
Sbjct: 307 RRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEISFRVWMCGGSLEI 366
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
PCSR+GHV+R PY+F RV I +N R E W DE +K FY P A +
Sbjct: 367 HPCSRVGHVFRKHTPYSFPGGTARV----IHHNAARTAEVWMDE-YKDIFYGMVPAAKNV 421
Query: 365 DMGDISEQ 372
D+GD++E+
Sbjct: 422 DVGDLTER 429
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 151/353 (42%), Positives = 213/353 (60%), Gaps = 10/353 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLD 80
G G+ G+A LP ++ + + N+ S+ IS DR + D+R +CK + YP
Sbjct: 1 GLGDLGEAATLPTRFKEHAAHAFDNHSFNVMLSDRISLDRRLKDVRGPKCKRKHKLYPRA 60
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+ FHNE S L+RTVHS++ +P + + +IILVDD+S DL Q L D+I
Sbjct: 61 LPTTSVIICFHNEALSVLLRTVHSVLNESPPRLIADIILVDDYSEYDDLKQPLIDHISML 120
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
N KV+LIR R+GL+ R RGA+E+RGEV+ FLD+HCE WL PLL I DR+ +
Sbjct: 121 N-KVKLIRMPSRQGLVPARLRGAEEARGEVLTFLDSHCEATPGWLEPLLVRIAEDRRNVV 179
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVI+ I+ +FR H RG F W + + +PE E K+RK ++ +SPT A
Sbjct: 180 CPVIEVINAD--DFRYQASDVIHERGGFTWDLFFTWKAIPEAEKKRRKDETDYIRSPTMA 237
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-P 319
GGLFA+ + +F +LG YD + +WGGEN E+SF+IWMCGG +E VPCSR+GHV+R + P
Sbjct: 238 GGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSFRIWMCGGQLEIVPCSRVGHVFRKYTSP 297
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F K + N+ R+ E W DE Y+ + +D+GDIS++
Sbjct: 298 YKFPKGTTTT----LARNFNRLAEVWMDEYKDHYYRKKTEEERNVDIGDISDR 346
>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Oreochromis niloticus]
Length = 584
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/360 (45%), Positives = 216/360 (60%), Gaps = 20/360 (5%)
Query: 18 EPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC-- 72
+P PGE G+A HL PE + D S+ Y +N+ S+ IS R I D RM+EC
Sbjct: 74 QPDNNAPGEWGRATHLNLSPEEKKQEQD-SVERYAINIYVSDKISLHRHIQDHRMKECRS 132
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K +DY LP SVI+ F+NE +S+L+RT+HS+++ TPA L+EIILVDDFS + L K
Sbjct: 133 KKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLKSK 191
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
L DYI +VRLIR +REGL+R R GA + G+V+ FLD HCE W+ PLL I
Sbjct: 192 LADYISDLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERI 250
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
+ + PVID ID+ T+EF + D G F+W + ++ + +PE E K+RK +
Sbjct: 251 SENASTIVCPVIDTIDWNTFEF--YMQTDEPMIGGFDWRLTFQWHSVPEMERKRRKSRID 308
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P +SPT AGGLFA+ +A+F LG YD G+ VWGGEN ELSF++W CGGS+E PCS +GH
Sbjct: 309 PIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSFRVWQCGGSLEIHPCSHVGH 368
Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
V+ PY P N R E W D +K +FY R P A G+ISE+
Sbjct: 369 VFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKKHFYNRNPPARKEKYGNISER 418
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 159/371 (42%), Positives = 231/371 (62%), Gaps = 13/371 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R++ CK YP +LP SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD
Sbjct: 96 SLPDVRLDGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVHSVVARSPRRLLAEIILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y+++ V+++R +R GLIR R RGA +RG+VI FLDAHCE
Sbjct: 156 ASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTR 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I+ DR+ + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIWEDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDISEQ 372
+ +D GD+S +
Sbjct: 388 VKVDYGDVSAR 398
>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mustela putorius
furo]
Length = 609
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 163/380 (42%), Positives = 225/380 (59%), Gaps = 20/380 (5%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNKIDDTVDNHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125
Query: 57 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
+ + R +PD R CK YP+DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD S DL +LE+Y+Q++ GK+++IRN +REGLIR R GA S GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLD 245
Query: 176 AHCEVG---LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 232
+HCEV L WL PLLA I DR+ + PVID I T Y RG F WG+
Sbjct: 246 SHCEVNVMWLMWLQPLLAAIQQDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGL 301
Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
+K + +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+S
Sbjct: 302 HFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEIS 361
Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
F+IWMCGG + +PCSR+GH++R PY + D +T+N R+ W D+ +
Sbjct: 362 FRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDDYKEQ 416
Query: 353 YFYTREPLAMFLDMGDISEQ 372
YF R L G+ISE+
Sbjct: 417 YFSLRPDLRT-KSYGNISER 435
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 157/378 (41%), Positives = 226/378 (59%), Gaps = 16/378 (4%)
Query: 3 VFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
VF D + N L +E GPG +GG +PE + + E N+ S I
Sbjct: 86 VFPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKDIKEKRFLENQFNVVASEMI 145
Query: 59 SFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
S +RT+PD R E C+ L LP S+I+VFHNE +++L+RT+HS+I R+P LE
Sbjct: 146 SINRTLPDYRSEACRTTGNSLKTEGLPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLE 205
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
EIILVDD S + L + L+ YI++F V L+ +R GLIR R G+ ++G++++FLD
Sbjct: 206 EIILVDDKSDRDYLVKPLDAYIKKFPVPVHLVHLEDRSGLIRARLTGSGMAKGKILLFLD 265
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
AH EV WL PL+ + DRK + P+ID I T+E+ + E G F W + ++
Sbjct: 266 AHVEVTDGWLEPLVTRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFR 322
Query: 236 ENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
+P+RE +R + S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF+
Sbjct: 323 WYAVPKRELNRRGADRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFR 382
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
+WMCGGS+E PCSR+GHV+R PY F +V I +N R E W DE +KA+F
Sbjct: 383 VWMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKV----IHHNAARTAEVWMDE-YKAFF 437
Query: 355 YTREPLAMFLDMGDISEQ 372
Y P A ++ GD++E+
Sbjct: 438 YKMVPAARNVEAGDVTER 455
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 222/355 (62%), Gaps = 13/355 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--L 79
+GPGE G+ +P N+ SN IS +RT+PD+R++ CK YP
Sbjct: 52 KGPGEMGQPVIIPPEEEELKKEMFKINQFNLLASNKISVNRTLPDVRIDGCKKKIYPPSQ 111
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LP S+I+VFHNE +S+L+R +HSII R+P + LEEIILVDD S + L ++L+DY++
Sbjct: 112 KLPTTSIIIVFHNEAWSTLIRNIHSIINRSPREILEEIILVDDASERDFLGKQLDDYVRG 171
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
+ +VR++R ER G++ R RGA S GEV+ FLDAHCE WL PL+A I DR +
Sbjct: 172 LSVRVRVVRMAERSGIVGARLRGAAISTGEVLTFLDAHCECTKGWLEPLIARIAEDRTRV 231
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE-PYKSPT 258
PVID I +T+E+ SV E G F W + ++ + +RE K+RK ++ P +PT
Sbjct: 232 VSPVIDSISDETFEYNSVPELGC---GGFNWRLNFRWYPMSKREKKRRKGDATIPINTPT 288
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLF++ + +F +G YD G+ +WGGEN E+SF+IWMCGG++E VPCS +GHV+R
Sbjct: 289 MAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMSFRIWMCGGTLEIVPCSHVGHVFRGKS 348
Query: 319 PYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F G +A ++ N +R+ E W DE +K+++Y P A + GDI ++
Sbjct: 349 PYTFPGGVA-----TVVHNNNRRLAEVWMDE-YKSFYYKTVPNARNAEYGDIEDR 397
>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
aries]
Length = 606
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 209/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP+DLP ASV++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRT 168
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRAVVCPVIDIISADTL----AYSS 284
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430
>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
glaber]
Length = 330
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 151/337 (44%), Positives = 210/337 (62%), Gaps = 9/337 (2%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +LP SV+
Sbjct: 2 GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61
Query: 88 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
+VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++ V++I
Sbjct: 62 IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKFTLENYVKNLEVPVKII 121
Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
R ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181
Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT AGGLF++
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSI 238
Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R PY F
Sbjct: 239 DRNYFEEIGTYDAGMDIWGGENLEISFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT 298
Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
G +I N +R+ E W DE K +FY P F
Sbjct: 299 ----GHVINKNNRRLAEVWMDE-FKDFFYIISPGMQF 330
>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Callithrix jacchus]
Length = 607
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 110 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 169
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 170 VHSVIDRTPAHLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKIIRNTKREGLIRGRM 229
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I D+ + PVID I T Y
Sbjct: 230 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 285
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ ++ + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 286 SPIVRGGFNWGLHFRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMNRQYFHELGQYDSG 345
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 346 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 400
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 401 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 431
>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
Length = 595
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 210/341 (61%), Gaps = 11/341 (3%)
Query: 32 HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
H PE + D + N+ S+ I F R IPD R ++C+ YP LPK S+++ F
Sbjct: 92 HSPED-QETRDMGYRRHAFNLLISDRIGFHRNIPDTRNDKCRGKSYPSGLPKTSIVICFF 150
Query: 92 NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
NE +S+L+RTVHS++ RTP + L+EIIL+DDFS ++ L ++LE+YI+ V+L R +
Sbjct: 151 NEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQSHLKEELEEYIRDHLPMVQLYRTDK 210
Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
REGLIR R +GA + G+V++FLD+HCEV WL PLLA I DR + P+ID I+ T
Sbjct: 211 REGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPLLARIAEDRTRVVCPIIDIINSDT 270
Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
+E Y RG F WG+ +K +++P++ + + P SPT AGGLFA+DR +F
Sbjct: 271 FE----YTASPLVRGGFNWGLHFKWDQVPQQLLQGPDGAAAPINSPTMAGGLFAIDREYF 326
Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
ELG YD G+ +WGGEN E+SF+IWMCGG++E +PCSR+GHV+R PY D
Sbjct: 327 DELGRYDEGMDIWGGENLEISFRIWMCGGTLEIIPCSRVGHVFRKRRPYGSPNGED---- 382
Query: 332 PLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ N R+ W DE YF R P GDIS++
Sbjct: 383 -TMSKNSLRMAHVWMDEYKDQYFSLR-PEMKTRTYGDISDR 421
>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
Length = 415
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/368 (41%), Positives = 224/368 (60%), Gaps = 14/368 (3%)
Query: 12 NLEPPLEPYK-EGPGEGGKAYH--LPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
N++ EP PG G+A +P+ ++A +A N S+ I ++R++PD R
Sbjct: 17 NIDATTEPRDPHAPGARGRAVEDAMPQ-HQADIEAGWKAASFNQFVSDLIPYERSLPDTR 75
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
C + DLP S+I+ F E +S+L+R+VHS+I R+P +EEI+L+DD S ++
Sbjct: 76 PPRCAEQEVADDLPTTSIIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSH 135
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L QKL+ Y+ +F +VR++ ER GLIR R +GA+ + G V+ FLD+H E + WL PL
Sbjct: 136 LKQKLDQYMSKF-PQVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPL 194
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L I DR + P ID ++ T+ + E + RG F+W + ++ LP EAK+R
Sbjct: 195 LDRIREDRTRVVCPSIDRVNEATFAYEVANE---NVRGGFDWELFFQWVSLPAVEAKRRT 251
Query: 249 YN---SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+N E +SPT AGGLF++DR FF ELGGYDPG +WGGEN ELSFKIWMCGGS+E +
Sbjct: 252 HNVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIWMCGGSLEIL 311
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
PCSR+GHV+R PYN+ ++ ++ +N R+ E W DE K Y+ + + L
Sbjct: 312 PCSRVGHVFRKSQPYNYSNATSIME--VVHHNNVRLAEVWLDEYKKIYYALHPGVEVELA 369
Query: 365 DMGDISEQ 372
MGDISE+
Sbjct: 370 KMGDISER 377
>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 399
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 150/361 (41%), Positives = 219/361 (60%), Gaps = 10/361 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
LEP +P+ EGPGE GK +P+ + +N+ S I+ +R++P+ R+E C
Sbjct: 27 LEPVQKPH-EGPGEMGKPVDIPKEDQEKMKEMFKINQVNLMASEMIALNRSLPNDRLEGC 85
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
K YP +LP SV++VFHNE +S+L+RTVHS+I +P +EEI+LVDD + + L +
Sbjct: 86 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINCSPRHMVEEIVLVDDANERDFLKRT 145
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE Y+++ V +IR R GLIR R +G S+G+VI+FLDAHCE + WL PLL I
Sbjct: 146 LESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWLEPLLTRI 205
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
DR+ + P+ID I T F + D Y G F W + ++ +P+RE +RK + +
Sbjct: 206 KQDRRTVVCPIIDVISDDT--FECMAGSDMTYGG-FNWKLNFRWYLVPQREMDRRKGDRT 262
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGG F++DR +F E+G YD G+ +WG EN E+SF+IW CGG++E V CS +G
Sbjct: 263 LPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEISFRIWQCGGTLEIVTCSHVG 322
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV++ PY F G +I N +R+ E W DE K +FY P +D GD+S
Sbjct: 323 HVFQKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDVSS 377
Query: 372 Q 372
+
Sbjct: 378 R 378
>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Felis catus]
Length = 608
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 161/377 (42%), Positives = 224/377 (59%), Gaps = 17/377 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNRIDDMIDNHVEDPEKGNTKFSSELGMIFDERDQELRDLGYQKHAFNMLISN 125
Query: 57 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
+ + R +PD R CK YP DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
+HCEV + WL PLLA I D + + PVID I T Y RG F WG+ +K
Sbjct: 246 SHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
+ +P E + + P +SPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416
Query: 356 TREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432
>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Felis catus]
Length = 527
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 89
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPAQ L EIILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I D + + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSS 205
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P +SPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSG 265
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R P G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLR-PDLRTKSYGNISER 351
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/353 (41%), Positives = 214/353 (60%), Gaps = 9/353 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
++ PGE GKA H+P A N+ S+ IS +R++ D+R+ CK +P
Sbjct: 123 RDSPGEMGKAVHIPPEQDAIQQELFKLNQFNLMASDMISLNRSLKDVRLSGCKSKKFPKL 182
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L QKLEDY++
Sbjct: 183 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRALLKEIILVDDASEREHLKQKLEDYVETL 242
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I D+K +
Sbjct: 243 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAHDKKTVV 302
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT
Sbjct: 303 CPIIDVISDDTFEY--ITASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTM 359
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E PCS +GHV+R P
Sbjct: 360 AGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGILEISPCSHVGHVFRDKSP 419
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F ++ + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 420 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVPVGDVSER 467
>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
caballus]
Length = 609
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 112 DLGYQKHAFNMLISNRLGYHREVPDTRNAACKDKSYPTDLPVASVVICFYNEALSALLRT 171
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 172 VHSVLDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 231
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+++ PVID I T Y
Sbjct: 232 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAVIQEDRRMVVCPVIDIISADTL----AYSS 287
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM R +F ELG YD G
Sbjct: 288 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMSRRYFSELGQYDSG 347
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 348 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 402
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 403 LAYVWLDEYKEQYFSLRPDLRT-KSYGNISER 433
>gi|390350617|ref|XP_784979.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 647
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/352 (42%), Positives = 212/352 (60%), Gaps = 14/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
GE GK + DA + N+ S+ I+F+R++PD+R ++CK YP LP
Sbjct: 235 GEMGKPVIFEGDMKTHADALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPTT 294
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF---N 141
SVI++FHNE FS+L+RTVHS+I R+P L+EIILVDD S++ L KL+DYI R +
Sbjct: 295 SVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISRHFHSS 354
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VR+ R R GLIR R GA + G+++ FLD+HCEV + WL PLLA I DR+ +
Sbjct: 355 ARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNVVT 414
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P ID ID ++ + G F W M ++ + + ++ K N + P +SPT A
Sbjct: 415 PTIDVIDDNDLAYKGSDQLPQ--VGSFGWTMAFRWTAIQTMDLEEAKRNPTLPIRSPTMA 472
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F+ELG YDPG +WG EN ELSFK WMCGGS+ + CS +GH++R F PY
Sbjct: 473 GGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSHVGHIFRKFAPY 532
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ G N KR+IE W + +A++Y P + +D GDI +Q
Sbjct: 533 SG-------MGSYFHRNNKRLIEVWLGDA-RAFYYKLHPDVLRIDAGDIQDQ 576
>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
Length = 515
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 153/354 (43%), Positives = 210/354 (59%), Gaps = 10/354 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
K+ PGE GKA +P+ + N S+ IS R +PD R + CK YP D
Sbjct: 7 KDAPGENGKAVDIPKEFLIESKRLFERNKFNQWASDKISLHRILPDARPKLCKDKVYPGD 66
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++VFHNE +S+L+RT+HS++ RT L EIILVDD S +L L+ YI +
Sbjct: 67 LPPTSVVIVFHNEAWSTLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHAPLDAYIAKL 126
Query: 141 NGKVRLIRNTEREGLIRTRSRGAK--ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
KV++IRN +REGLIR+R G S+ V+ FLDAHCE WL PLL IY+DR
Sbjct: 127 -AKVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLLERIYNDRST 185
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+ P ID I + + ++ Y P RGIF W + ++ + E K+R+ +P ++PT
Sbjct: 186 VVCPEIDVISDENFAYQ--YGPSGLMRGIFNWDLHFRWRAVSTEEQKRRQSPIDPVRTPT 243
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA++R +F E+G YD + +WGGEN E+SF+IW CGG++E VPCS +GHV+R
Sbjct: 244 MAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISFRIWQCGGTLEIVPCSHVGHVFRKSQ 303
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F K G N +RV E W D +K +FY R+P GDIS++
Sbjct: 304 PYGFPKGVVDTLGK----NSQRVAEVWMD-GYKEFFYQRQPHLRGHAYGDISKR 352
>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
mutus]
Length = 606
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 168
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRQTVVCPVIDIISADTL----AYSS 284
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430
>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
Length = 673
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 162/353 (45%), Positives = 218/353 (61%), Gaps = 15/353 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE G A L + + + +L E G N S+ IS +R++PD+R ++C+ Y L
Sbjct: 146 GIGEQGVAAKLEDESQREYERALSLENGFNALLSDSISVNRSVPDIRHKDCRKKLYLSKL 205
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L + LEDYI +
Sbjct: 206 PTVSVIIIFYNEYMSVLMRSVHSLINRSPPELLKEIILVDDFSDRDYLFKPLEDYIAQHF 265
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KVR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 266 TKVRVVRLPRRTGLIGARSAGARNATAEVLIFLDSHVEANYNWLPPLLEPIAQNKRTAVC 325
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ +EP+KSP A
Sbjct: 326 PFIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DLKHPAEPFKSPVMA 378
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 379 GGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPCSRIGHIYRG--PR 436
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT-REPLAMFLDMGDISEQ 372
N +++ G + NYKRV E W DE +K Y Y + + +D GD++ Q
Sbjct: 437 NH--VSNPRGGDYLHKNYKRVAEVWMDE-YKQYLYNGADGVYERIDAGDLTAQ 486
>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
Length = 535
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 211/354 (59%), Gaps = 16/354 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE G HLP G N S+ I +R++PD+R C+ Y
Sbjct: 12 ERGIGEHGLPAHLPIKDSEIEKDLYAVNGFNGALSDKIPLNRSLPDIRHPGCQNRLYIES 71
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SV++ FHNE +S+L+RT +S++ R+P ++E+ LVDD S+K L ++L+DY+ +
Sbjct: 72 LPTVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDFLKEQLDDYVSKH 131
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KV++IR R GLI R GA+++ +V+VFLD+H E +NWLPPLL PI + K +
Sbjct: 132 MPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPLLEPIALNYKTVV 191
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTH 259
P ID + Y T+ +R+ D RG F+W + YK LP EA EP+ SP
Sbjct: 192 CPFIDVVAYDTFAYRA---QDEGARGAFDWELFYKRLPVLPADEANM----PEPFPSPVM 244
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+ R FF ELGGYDPGL +WGGE +ELSFK+W CGG + PCSR+GH+YR F P
Sbjct: 245 AGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPCSRVGHIYRKFAP 304
Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N G G + NY+RV E W DE + Y Y R P + +D GDIS+Q
Sbjct: 305 FPNPG------HGDFVGKNYRRVAEVWMDE-YAQYLYKRRPHYLKIDTGDISKQ 351
>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
lupus familiaris]
Length = 608
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 159/377 (42%), Positives = 224/377 (59%), Gaps = 17/377 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNRIDDKIDNHVEDPEKGNIKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125
Query: 57 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
+ + R +PD R C+ +P DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACRDKSFPADLPAASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD S DL +LE+Y+Q++ GK+++IRN +REGLIR R GA + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNIKREGLIRGRMIGAAHATGEVLVFLD 245
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
+HCEV + WL PLLA I D++ + PVID I T Y RG F WG+ +K
Sbjct: 246 SHCEVNVMWLQPLLAAIQEDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
+ +P E + + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
WMCGG + +PCSR+GH++R PY + D +T+N R+ W DE + YF
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416
Query: 356 TREPLAMFLDMGDISEQ 372
R L G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432
>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Nomascus leucogenys]
Length = 608
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 206/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R C+ YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPSASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
HS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 AHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I D+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Bos taurus]
gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
taurus]
Length = 606
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 155/364 (42%), Positives = 214/364 (58%), Gaps = 14/364 (3%)
Query: 13 LEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
P L Y+ GE G+A +P Y+ D N S+ IS RT+PD R
Sbjct: 5 FRPTLPHYRRNSYGENGQAVVVPAVYKEESDRLFSRNRFNQWASDRISLHRTLPDQRPAA 64
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS---KAD 128
C+ +P +LP AS+++VFHNE +S+L+RTVHS++ R+ + + EIILVDD S +
Sbjct: 65 CRKQLFPTNLPPASLVIVFHNEAWSTLLRTVHSVLDRSDPRLMREIILVDDCSEIKGHEE 124
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L LE YIQ+ V+L+RN +R+GLIR R RG KE VIVFLDAHCEV WL PL
Sbjct: 125 LQAPLEKYIQKLK-IVKLVRNKKRQGLIRARLRGYKEVTSPVIVFLDAHCEVVDGWLEPL 183
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
LA I+ +R + P ID I ++ + Y RG+F W + ++ LP E ++RK
Sbjct: 184 LARIHENRSNVVCPEIDVISFENFG----YSYASGIRGVFNWNLHFRWRTLPAVEQQRRK 239
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+P +SPT AGGLFA+ + +F ++G YD + +WGGEN E+SF+IW CGG++E +PCS
Sbjct: 240 SVIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSFRIWQCGGNLEIIPCS 299
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY F K A G + N +RV E W D +K FY R P GD
Sbjct: 300 HVGHVFRKSQPYTFPKGA----GETLNKNLQRVAEVWMD-NYKDIFYNRFPNLRQHSYGD 354
Query: 369 ISEQ 372
IS++
Sbjct: 355 ISKR 358
>gi|268564602|ref|XP_002647197.1| C. briggsae CBR-GLY-10 protein [Caenorhabditis briggsae]
Length = 623
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 156/355 (43%), Positives = 214/355 (60%), Gaps = 19/355 (5%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 77
+EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +ECK Y
Sbjct: 95 REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 153
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LED++
Sbjct: 154 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 213
Query: 138 QR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI D
Sbjct: 214 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 273
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
+ + P +D ID +T+E R D RG F+W YK L +++ R+ ++P+
Sbjct: 274 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTKPFD 327
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H+YR
Sbjct: 328 SPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAPCSRVAHIYR 387
Query: 316 S-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+ P+ + D ++ NYKRV E W DE +K Y P D GD+
Sbjct: 388 CKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGIGNADAGDL 436
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 155/380 (40%), Positives = 228/380 (60%), Gaps = 16/380 (4%)
Query: 1 RPVFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETSN 56
+ V+ D + N L +E GPG +GG +PE + + E N+ S
Sbjct: 82 KQVYPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKTIKEKRFLENQFNVVASE 141
Query: 57 HISFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
IS +RT+PD R + C+ L +PK S+I+VFHNE +++L+RT+HS+I R+P
Sbjct: 142 MISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWTTLLRTLHSVINRSPRHL 201
Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
LEEIILVDD S + L + L+ YI+ F + L+ R GLIR R G++ ++G++++F
Sbjct: 202 LEEIILVDDKSDRDYLVKPLDSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLF 261
Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
LDAH EV WL PL++ + DRK + P+ID I T+E+ + E G F W +
Sbjct: 262 LDAHVEVTDGWLEPLVSRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLN 318
Query: 234 YKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
++ +P+RE +R + S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+S
Sbjct: 319 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 378
Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
F++WMCGGS+E PCSR+GHV+R PY F +V I +N R E W DE +KA
Sbjct: 379 FRVWMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKV----IHHNAARTAEVWMDE-YKA 433
Query: 353 YFYTREPLAMFLDMGDISEQ 372
+FY P A ++ GD+SE+
Sbjct: 434 FFYKMVPAARNVEAGDVSER 453
>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 653
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 196/313 (62%), Gaps = 10/313 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
++ N+ SN + F R++PD R C+ ++ +LP ASV++ F+NE +S+L+RTVH+++
Sbjct: 154 QHAFNLLISNRLGFYRSLPDTRNPLCRSEEHGAELPTASVVVCFYNEAWSTLLRTVHTVL 213
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKE 165
RTP L E+ILVDD S++ DL +L +Y+ + VRLIR +REGLIR R GA+
Sbjct: 214 GRTPRHLLHEVILVDDNSTQVDLGPQLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARN 273
Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
+ GEV+VFLD+HCEV + WL PLL I ++R +T P+ID I+ T+E Y R
Sbjct: 274 ASGEVLVFLDSHCEVNVGWLEPLLERIRANRATVTCPIIDIINADTFE----YTASPIVR 329
Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
G F WG+ +K P A+K + P SPT AGGLFAMDR FF LG YD G+ +WG
Sbjct: 330 GGFNWGLHFKWESPPAGLARKGRGAIAPIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWG 389
Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
GEN E+SF+IWMCGG +E +PCSR+GHV+R PY D +T N RV W
Sbjct: 390 GENLEISFRIWMCGGQLEIIPCSRVGHVFRRRRPYGSPNGED-----TLTKNSLRVAHVW 444
Query: 346 FDEKHKAYFYTRE 358
D+ K YF TR
Sbjct: 445 MDDYKKYYFQTRS 457
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 165/372 (44%), Positives = 220/372 (59%), Gaps = 24/372 (6%)
Query: 8 GKLGNLEPPL----EPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISF 60
G G L PL P PGE G+A HL P+ + D S+ Y +N+ S+ IS
Sbjct: 60 GPEGQLARPLYVKPPPDTNAPGELGRAAHLNLSPDEKKQEED-SIERYAINIFVSDKISL 118
Query: 61 DRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
R I D RM+EC K ++Y LP SVI+ F+NE +S+L+RT+HS+++ TPA L+EII
Sbjct: 119 HRHIQDHRMKECRSKTFNY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEII 177
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
L+DDFS +A L +L DYI +VRLIR +REGL+R R GA + GEV+ FLD HC
Sbjct: 178 LIDDFSDRAYLKSQLADYISNLE-RVRLIRTKKREGLVRARLIGATYATGEVLTFLDCHC 236
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
E W+ PLL I + + PVID ID+ T+EF + + G F+W + ++ +
Sbjct: 237 ECVPGWIEPLLERIGENSSTIVCPVIDTIDWNTFEF--YMQTEEPMIGGFDWRLTFQWHS 294
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+PERE K+RK +P +SPT AGGLFA+++ FF LG YD G+ VWGGEN ELSF++W C
Sbjct: 295 VPERERKRRKSPVDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSFRVWQC 354
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GGS+E PCS +GHV+ PY P N R E W D +K +FY R
Sbjct: 355 GGSLEIHPCSHVGHVFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKQHFYNRN 404
Query: 359 PLAMFLDMGDIS 370
P A GDIS
Sbjct: 405 PPARKETYGDIS 416
>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
Length = 927
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 150/354 (42%), Positives = 219/354 (61%), Gaps = 11/354 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLD 80
G GE G+ L E D + G N+ S+ I+ +R++PD+R +C K + P +
Sbjct: 98 GVGEDGRPVKLDELEDRLSDDTFGINQFNLIISDKIALNRSLPDVRKHQCRDKIYPAPSE 157
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+V+HNE FS+L+RTV S+I R+P + L+EIILVDDFSS++ L L++++
Sbjct: 158 LPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFLKDDLDNFVVTL 217
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
++++IR R GLIR R GA E+ GEV+ FLD+HCE WL PLLA I +RK +
Sbjct: 218 GIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPLLARIKENRKAVV 277
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
PVID I+ +T+ ++ E +RG F W + ++ +P K R + + P +SPT
Sbjct: 278 CPVIDVINDRTFAYQKGIE---LFRGGFNWNLQFRWYAVPPDIVKGRANDPTMPIQSPTM 334
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++D+ +F ELG YDPG+ +WGGEN E+SF+IW CGG IE +PCS +GH++R P
Sbjct: 335 AGGLFSIDKRYFEELGAYDPGMEIWGGENIEISFRIWQCGGRIEILPCSHVGHIFRKASP 394
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG-DISEQ 372
++F + G ++ N RV E W DE K FY P A+ + D+SE+
Sbjct: 395 HDF---PGKSSGKILNSNLLRVAEVWMDE-WKYLFYKTAPQALQMRSSIDVSER 444
>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Taeniopygia guttata]
Length = 608
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 155/362 (42%), Positives = 213/362 (58%), Gaps = 17/362 (4%)
Query: 5 KADGKLGN-----LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 59
+ D K+GN ++ P++ E E G ++ E + D ++ NM SN +
Sbjct: 71 QKDNKIGNSFGNHIQDPVKGEIEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLG 128
Query: 60 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
+ R +PD R +C+ YP DLP ASV++ F+NE S+L+RTVHS++ RTPA L EIIL
Sbjct: 129 YHREVPDTRDAKCREKSYPADLPSASVVICFYNEALSALLRTVHSVLDRTPAHLLHEIIL 188
Query: 120 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
VDD S ADL + L +Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HC
Sbjct: 189 VDDNSELADLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHC 248
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
EV WL PLLAPI D + + PVID I T Y RG F WG+ +K +
Sbjct: 249 EVNEMWLQPLLAPIREDPRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDL 304
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P E + + + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMC
Sbjct: 305 VPLAELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMC 364
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG + +PCSR+GH++R PY D + +N R+ W DE + YF R
Sbjct: 365 GGRLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLRLAHVWMDEYKEQYFALRP 419
Query: 359 PL 360
L
Sbjct: 420 EL 421
>gi|360043880|emb|CCD81426.1| putative n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 526
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 13/354 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
++GPGE G L + +A +L G N+ S I DR++ D+R CK Y
Sbjct: 2 RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+ + E + +L+RTV S++ R+P + ++E+ILVDD SS+ L ++L++Y+ R
Sbjct: 62 LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121
Query: 141 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLPPLL PI + +
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+T P ID ID T+E+R+ D RG F+W YK LP R + + P++SP
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTDSLHPETPFESPV 235
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG + VPCSR+GH++R +
Sbjct: 236 MAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVPCSRVGHIFREY- 294
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P NF + ++K + N+KRV E W DE +K Y Y P +D GD+S+Q
Sbjct: 295 PTNFPQ--PKIKN-FLRRNFKRVAEVWMDE-YKEYIYRSLPECRKVDPGDLSQQ 344
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 158/371 (42%), Positives = 230/371 (61%), Gaps = 13/371 (3%)
Query: 7 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
++PD+R++ CK YP +LP SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD
Sbjct: 96 SLPDVRLDGCKTKVYPEELPNTSVVIVFHNEAWSTLLRTVHSVLARSPRRLLAEIILVDD 155
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L LE+Y+++ V+++R +R GLIR R RGA +RG+V+ FLDAHCE
Sbjct: 156 ASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTR 215
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIREDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
E +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGS 332
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E V CS +GHV+R PY F G +I N +R+ E W DE K +FY P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387
Query: 362 MFLDMGDISEQ 372
+ +D GD+S +
Sbjct: 388 VKVDYGDVSAR 398
>gi|312372346|gb|EFR20327.1| hypothetical protein AND_20267 [Anopheles darlingi]
Length = 616
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 154/356 (43%), Positives = 215/356 (60%), Gaps = 13/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GKA L E D + G N S+ IS +R++PD+R C+ Y
Sbjct: 86 EAKRTGIGEQGKAGRLSEKEAEMKDKLFKKNGFNAVLSDLISLNRSLPDIRHRGCRKKKY 145
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F+NE +S+L+RT S++ R+P++ + E+ILVDD S+K L +LE Y+
Sbjct: 146 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPSELIAEVILVDDCSTKDFLKGQLELYV 205
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPLL PI +D +
Sbjct: 206 GENMPKVKIVRLPERSGLIAARLAGAKVATADVLIFLDSHTEANVNWLPPLLDPIAADYR 265
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID ID+ T+E+R+ D RG F+W YK L ++ +EP++SP
Sbjct: 266 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPKDLAN---PTEPFESP 319
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF E+GGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR +
Sbjct: 320 VMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCSRVGHIYRGY 379
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P+ + D +T NYKRV E W DE +K Y Y R+ D+GDIS+Q
Sbjct: 380 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYMRDRKKYDNTDVGDISKQ 429
>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
scrofa]
Length = 608
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK Y DLP ASVI+ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYRTDLPVASVIICFYNEALSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLTGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSA 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ ++ + +P E + + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFRWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432
>gi|261260064|sp|A8Y236.2|GLT10_CAEBR RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
Length = 629
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 156/355 (43%), Positives = 214/355 (60%), Gaps = 19/355 (5%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 77
+EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +ECK Y
Sbjct: 101 REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 159
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LED++
Sbjct: 160 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 219
Query: 138 QR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI D
Sbjct: 220 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 279
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
+ + P +D ID +T+E R D RG F+W YK L +++ R+ ++P+
Sbjct: 280 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTKPFD 333
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H+YR
Sbjct: 334 SPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAPCSRVAHIYR 393
Query: 316 S-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+ P+ + D ++ NYKRV E W DE +K Y P D GD+
Sbjct: 394 CKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGIGNADAGDL 442
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 148/351 (42%), Positives = 213/351 (60%), Gaps = 11/351 (3%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE GK +P + + E N+ S+ I +R++ D+R +CK Y LP
Sbjct: 87 PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYSAKLPT 146
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LEDY+
Sbjct: 147 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGKQLEDYVSTLPVS 206
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 207 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 266
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ V D + G F W + ++ +P RE ++R ++ + P ++PT AGG
Sbjct: 267 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPSREMQRRNHDRTAPLRTPTMAGG 323
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PCS +GHV+R PY F
Sbjct: 324 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 383
Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G +A+ ++ N RV E W DE K ++Y P A GD+SE+
Sbjct: 384 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 428
>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
Length = 600
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 151/332 (45%), Positives = 201/332 (60%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R +C YP DLP AS+++ F+NE S+L+RT
Sbjct: 103 DVGYQKHAFNLLISNRLGYHRDLPDTRDSKCSKKTYPADLPLASIVICFYNEASSALLRT 162
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPAQ L EIILVDD S DL + L+ Y+Q + KV+L+RN REGLIR R
Sbjct: 163 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDYYMQENLSKKVKLVRNKRREGLIRGRM 222
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V+VFLD+HCEV WL PLLAPI + K + PVID I T +Y
Sbjct: 223 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIRENPKTVVCPVIDIISADTL----IYSQ 278
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P++SPT AGGLFAMDR +F LG YD G
Sbjct: 279 SPVVRGGFNWGLHFKWDPVPLSELGGPEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSG 338
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGGS+ VPCSR+GH++R PY D + +N R
Sbjct: 339 MDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHD-----TMAHNSLR 393
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE YF R P D GDI ++
Sbjct: 394 LAHVWMDEYKDQYFALR-PELRNRDFGDIRDR 424
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 220/362 (60%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++WMCGG +E PCSR+G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRVWMCGGVLEIAPCSRVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G + ++ +N R++E W D+ K ++Y+ P A GD+S
Sbjct: 415 HVFRKSTPYTFPGGTTE-----IVNHNNARLVEVWLDD-WKEFYYSFYPGARKASAGDVS 468
Query: 371 EQ 372
++
Sbjct: 469 DR 470
>gi|195454523|ref|XP_002074278.1| GK18434 [Drosophila willistoni]
gi|194170363|gb|EDW85264.1| GK18434 [Drosophila willistoni]
Length = 646
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 159/360 (44%), Positives = 218/360 (60%), Gaps = 24/360 (6%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE G+ H+ + D G N S+ IS +R++PD+R EECK Y
Sbjct: 121 RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKSRKYLAK 180
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 139
LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S L Q+L+DY+
Sbjct: 181 LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 240
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI D KI
Sbjct: 241 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 300
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPT 258
T P++D I++ T+ + ++ RG F+W YK+ LPE K S PY++P
Sbjct: 301 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQLPVLPEDSLDK----SLPYRNPV 354
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VPCSR+ H++R M
Sbjct: 355 MMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM 414
Query: 319 -----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P N+ + N+KRV E W D K+K Y YTR+P +D GD+S Q
Sbjct: 415 DARPNPRNYN---------FVARNHKRVAEVWMD-KYKEYVYTRDPETYEKIDAGDLSRQ 464
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 209/350 (59%), Gaps = 10/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLP 82
PGE G+ +P Y+ N S+ IS R++PD R+ EC YP+ LP
Sbjct: 14 PGENGRGVIVPPEYQEESRKLFQRNRFNQWASDRISLHRSLPDARILECSSLKYPIHKLP 73
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
+ SVI+VFHNE +S+L+RTVHS++ R+P + L EIILVDD S +L LE Y+ + +
Sbjct: 74 QTSVIIVFHNEAWSTLLRTVHSVLDRSPPELLREIILVDDSSDHEELHSTLEKYVAKLS- 132
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KV+++RN REGLIR+R G + + FLDAHCE + WL PLL I +R I+ P
Sbjct: 133 KVKIVRNKAREGLIRSRLNGFAHATSPTVTFLDAHCEANVGWLEPLLYRIMQNRTIVVCP 192
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
ID I +T+E+ Y + RG F W + ++ +PE E K+R ++ +SPT AGG
Sbjct: 193 EIDVISDETFEY--TYSSGN-VRGSFNWNLNFRWKAVPEYENKRRAARTDGIRSPTMAGG 249
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF + +F ++G YD + +WGGEN ELSF+IW CGG +E +PCS +GHV+R PY+F
Sbjct: 250 LFTIHSQYFKDIGLYDKQMEIWGGENLELSFRIWQCGGQLEIIPCSHVGHVFRKSQPYSF 309
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
K G ++ N +RV E W D +K YFY R+P GDIS++
Sbjct: 310 PKGT----GETLSKNLQRVAEVWMD-GYKRYFYKRQPHLKGHPFGDISKR 354
>gi|21707970|gb|AAH34184.1| Galnt11 protein [Mus musculus]
Length = 411
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 156/352 (44%), Positives = 212/352 (60%), Gaps = 16/352 (4%)
Query: 2 PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
P FKA+ +L N +E P + + E G ++ E + D ++ NM SN
Sbjct: 69 PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126
Query: 58 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
+ + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186
Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
ILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCEV + WL PLLA I D + PVID I T Y RG F WG+ +K
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P E + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
MCGG + +PCSR+GH++R PY + D +T+N R+ W DE
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDT-----MTHNSLRLAHVWLDE 409
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 167/373 (44%), Positives = 224/373 (60%), Gaps = 23/373 (6%)
Query: 10 LGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETSNHISF 60
L N + P+ YK G PGEGGKA L + + D + N S+ IS
Sbjct: 97 LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFNQYVSDMISI 154
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
R++P EECK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L EIILV
Sbjct: 155 HRSLPSYIDEECKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILV 214
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS A L LE Y+++F KVR++R +REGLIR R +GA S+G VI +LD+HCE
Sbjct: 215 DDFSDMAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCEC 273
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENEL 239
W+ PLL I + K + PVID ID T+E+ Y + G F+W + + + +
Sbjct: 274 LEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAI 331
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
PE++ K R+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCG
Sbjct: 332 PEKDRKGRR-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCG 390
Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
G +E VPCS +GH++R PY + + +K N R+ E W DE +K Y+Y R
Sbjct: 391 GILEIVPCSHVGHIFRKRSPYKWLSGVNVLK-----RNSVRLAEVWMDE-YKKYYYERIN 444
Query: 360 LAMFLDMGDISEQ 372
+ D GD+S +
Sbjct: 445 NNLG-DFGDVSSR 456
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 286 bits (733), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 144/360 (40%), Positives = 218/360 (60%), Gaps = 11/360 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + + PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 122 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 181
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 182 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 241
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R + + P
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 358
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 359 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 418
Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY F G +A ++ +N RV E W DE + ++Y A GD+S++
Sbjct: 419 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 472
>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oreochromis niloticus]
Length = 619
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 200/339 (58%), Gaps = 11/339 (3%)
Query: 35 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
EA + D+ + N+ SN + F R +P+ R +C+ YP+ LP ASV++ F NE
Sbjct: 80 EADQEVRDSGYHRHAFNVLISNRLGFHRQLPETRDAQCREKSYPVALPSASVVICFFNEA 139
Query: 95 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
S+L+RTVHS++ RTPA L EIILVDD S +L +L+ Y++ GKV+L+RN RE
Sbjct: 140 LSALLRTVHSVLDRTPAYLLHEIILVDDHSELEELKDELDRYVRAELQGKVQLVRNQRRE 199
Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
GLIR R GA + GEV+VFLD+HCEV WL PLLAPI D + + PVID I T
Sbjct: 200 GLIRGRMIGASHATGEVLVFLDSHCEVNQAWLQPLLAPIQKDHRTVVCPVIDIISADTL- 258
Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
Y P RG F WG+ +K + +P E + S P +SPT AGGLFAM+R +F E
Sbjct: 259 ---AYSPSPIVRGGFNWGLHFKWDPVPPSELSGPEGASGPIRSPTMAGGLFAMNRKYFNE 315
Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
LG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D
Sbjct: 316 LGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGHD-----T 370
Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ +N R+ W D + Y R P GDI E+
Sbjct: 371 MAHNSLRLAHVWMDGYKEQYLSLR-PELRNRSYGDIGER 408
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 148/351 (42%), Positives = 214/351 (60%), Gaps = 11/351 (3%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE GK +P + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 85 PGEMGKPVKIPANQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 144
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LE+Y++
Sbjct: 145 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEEYVRTLPVP 204
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 205 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 264
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ V D + G F W + ++ +P RE ++R ++ + P ++PT AGG
Sbjct: 265 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 321
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PCS +GHV+R PY F
Sbjct: 322 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEISPCSHVGHVFRDKSPYTF 381
Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G +A+ ++ N RV E W DE K ++Y P A GD+SE+
Sbjct: 382 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 426
>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
taurus]
Length = 605
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/332 (46%), Positives = 209/332 (62%), Gaps = 12/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE +K YF R L + G+ISE+
Sbjct: 400 LAHVWLDE-YKQYFSLRPDLRT-RNYGNISER 429
>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
Length = 550
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 152/368 (41%), Positives = 223/368 (60%), Gaps = 14/368 (3%)
Query: 10 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGM-----NMETSNHISFDRTI 64
LG +E P E + PGE G A+ + E ++ + ++G N S+ IS RT+
Sbjct: 23 LGKVESP-EHNADDPGEMGVAFQVDEKKLSSAEKEEYDFGFKRNAFNQYASDRISVHRTL 81
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
PD R EC+ + +PKASVI++FHNE +S L+RTV+SI++R+P ++LEE+ILVDD+S
Sbjct: 82 PDYRDVECRAILHSSKMPKASVIVIFHNEAWSVLLRTVYSILERSPPRFLEEVILVDDYS 141
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
+ L +L++++ KVRL+R+ +REGLIR R GA+ ++G+V+VFLD+HCE W
Sbjct: 142 DQEHLHDQLDEFVAT-QQKVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGW 200
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 244
L P+L I D + P+ID ID +T + G F+W M + + LP E
Sbjct: 201 LEPMLDRIGQDWSHVVTPIIDVIDDKTLMYNFNPLSRGFSVGGFDWAMGFTWHALPNHEK 260
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
++RK S+P +SPT AGGLFA+DR +F +G YDPG+ +WGGEN E+SF+IWMCGG++E
Sbjct: 261 ERRKKISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSFRIWMCGGTLET 320
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
+PCS +GH++R P + K G + N R E W DE Y Y
Sbjct: 321 LPCSHVGHIFRKRNPNHSAK-----HGNFVQRNSVRTAEVWMDE--YKYLYYDRIGNHIG 373
Query: 365 DMGDISEQ 372
D GD+S++
Sbjct: 374 DFGDVSDR 381
>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
Length = 679
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 160/352 (45%), Positives = 212/352 (60%), Gaps = 13/352 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE GK L + + + + E G N S+ IS +R+I D+R + C+ +Y L
Sbjct: 151 GIGEQGKPAKLDDENQRELERKMSLENGFNALLSDSISVNRSIADIRHKSCRKKEYLAKL 210
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS +A L LE+YI
Sbjct: 211 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLYVPLENYIAEHF 270
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R T+R GLI RS GA+ + GEV++FLD+H E NWLPPLL PI + +
Sbjct: 271 KNVRVVRLTKRTGLIGARSEGARNATGEVLIFLDSHVEANYNWLPPLLEPIAINERTAVC 330
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ SEP+KSP A
Sbjct: 331 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPSEPFKSPVMA 383
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSR+GH+YR P
Sbjct: 384 GGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG--PR 441
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N + G + NYKRV E W DE K + + + +D GD++ Q
Sbjct: 442 NH--VPSPRTGDYLHKNYKRVAEVWMDEYKKYLYDHGDGIYDRVDAGDLTAQ 491
>gi|341897758|gb|EGT53693.1| CBN-GLY-10 protein [Caenorhabditis brenneri]
Length = 620
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 157/358 (43%), Positives = 212/358 (59%), Gaps = 19/358 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
E +EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +CK
Sbjct: 89 EKAREGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHRDCKK 147
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MTYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
D++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + + P +D ID +T+E R D RG F+W YK L + K R+ +
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTK---KDRENPTT 321
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381
Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+YR + P+ + D ++ NYKRV E W DE +K Y P D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGNADAGDL 433
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/371 (39%), Positives = 224/371 (60%), Gaps = 17/371 (4%)
Query: 9 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
K+ ++ P+ ++GPGE GK + ++ D N+ S+ I+ +R++PD+R
Sbjct: 7 KIQDMPKPVN--RDGPGEQGKPVIIEPEFKKERDEKWKINEFNLMASDKIALNRSLPDVR 64
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
C YP LP SVI+VFHNE +S+L+RT HSII R+P + L E+ILVDD S++
Sbjct: 65 PRGCNDKKYPGKLPTTSVIVVFHNEAWSTLLRTTHSIINRSPRELLMEVILVDDCSTQEH 124
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L + L+DY+ + V + R R GLIR+R RG ++G+V+ +LD+HCE WL PL
Sbjct: 125 LKKPLDDYVAKLPVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPL 184
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR- 247
++ I DRK P+ID ID +++ + E + G F W + ++ +PE E +R
Sbjct: 185 VSRIGDDRKTRVQPIIDIIDDRSFAYIGASESNS---GGFTWQLQHQWVRIPEYEQNRRV 241
Query: 248 ------KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
+ + +++PT AGGLF++++ +F ++G YD G+ VWGGEN E+SF+IWMCGG
Sbjct: 242 SEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGGENIEMSFRIWMCGGK 301
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
IE +PCSRIGHVYR ++PY+F +D P I N RV E W D +K +FY +
Sbjct: 302 IEIIPCSRIGHVYRRYIPYSFPNGSD----PTIYRNAMRVAEVWMDH-YKKFFYATQTKL 356
Query: 362 MFLDMGDISEQ 372
+D GD+S++
Sbjct: 357 HMVDYGDVSDR 367
>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Strongylocentrotus purpuratus]
Length = 654
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 153/372 (41%), Positives = 220/372 (59%), Gaps = 12/372 (3%)
Query: 3 VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
V+K GK + L+ +G GE + R+ D ++ N S I F
Sbjct: 109 VYKKQGKPMRAKQRLKADAQGDWGEDELGMVRTDEERSIRDGGYRQHAFNELISQRIGFH 168
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R + D R CKY Y +LP S+++ F+NE +S+L+RTV+S++ RTP + + E+ILVD
Sbjct: 169 RNVTDTRNPLCKYQVYSEELPTVSIVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVD 228
Query: 122 DFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DFS L ++L+ Y+ + FNG V +I N +REGLIR R+ GA+ + G+V++FLD+HCEV
Sbjct: 229 DFSELTHLKKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEV 288
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
WL PLL I +D + P+ID I++ T+ Y +G F WGM +K + +
Sbjct: 289 NEQWLEPLLERIKADSHTVVCPIIDIINHDTF----AYTASPLVKGGFNWGMHFKWDTIR 344
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
R+ ++ +P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW CGG
Sbjct: 345 SRQLVGKEDYVKPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGG 404
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
+E VPCSR+GHV+R PY D T N RV E W DE +K +FY +P
Sbjct: 405 KLEIVPCSRVGHVFRKRRPYGSPNRQDTT-----TKNAVRVAEVWMDE-YKEHFYQVQPK 458
Query: 361 AMFLDMGDISEQ 372
A +D GDIS +
Sbjct: 459 AKNIDYGDISSR 470
>gi|115533032|ref|NP_001041036.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
gi|182676440|sp|O45947.3|GLT10_CAEEL RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|3880991|emb|CAA16378.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
Length = 684
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/358 (43%), Positives = 215/358 (60%), Gaps = 19/358 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
E +EGPGE GK LPE +A L Y G N S+ IS +R+I D+R +ECK
Sbjct: 153 EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 211
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 212 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 271
Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
D++++ + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLPPLL PI
Sbjct: 272 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 331
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + + P +D ID +T+E R D RG F+W YK L +++ R+ ++
Sbjct: 332 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD---RESPTK 385
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H
Sbjct: 386 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 445
Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+YR + P+ + D ++ NYKRV E W D+ +K Y P D GD+
Sbjct: 446 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDD-YKETLYKHRPGVGNADAGDL 497
>gi|115533034|ref|NP_001041037.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
gi|87251651|emb|CAJ76949.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
Length = 622
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/358 (43%), Positives = 215/358 (60%), Gaps = 19/358 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
E +EGPGE GK LPE +A L Y G N S+ IS +R+I D+R +ECK
Sbjct: 91 EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 149
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 150 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 209
Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
D++++ + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLPPLL PI
Sbjct: 210 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 269
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + + P +D ID +T+E R D RG F+W YK L +++ R+ ++
Sbjct: 270 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD---RESPTK 323
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H
Sbjct: 324 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 383
Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+YR + P+ + D ++ NYKRV E W D+ +K Y P D GD+
Sbjct: 384 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDD-YKETLYKHRPGVGNADAGDL 435
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/360 (40%), Positives = 217/360 (60%), Gaps = 11/360 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + + PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 123 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 182
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 183 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 242
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 243 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 302
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+R+ + P+ID I T+E+ + D + G F W + ++ +P+RE +R + + P
Sbjct: 303 NRRTVVCPIIDVISDDTFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 359
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 360 LRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 419
Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY F G +A ++ +N RV E W DE + ++Y A GD+S++
Sbjct: 420 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 473
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 202/323 (62%), Gaps = 10/323 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S+ I+ +R++PD+R C YP LP SVILV+HNE S+L+R VHSII R+P
Sbjct: 25 NLMASDRIALNRSLPDVRPRGCANKVYPKKLPTTSVILVYHNEARSTLLRNVHSIINRSP 84
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
L EIILVDD S + L + LEDYI + V +++ R GLIR R GA ++G+V
Sbjct: 85 HDLLAEIILVDDASDQEHLGKSLEDYIAKLPVSVYVVKMKGRSGLIRARMAGAAVAKGQV 144
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCEV WL P+LA I DR PVID I T++++ +P G F W
Sbjct: 145 LTFLDSHCEVTEGWLEPMLARIAEDRTTSVCPVIDVISDDTFQYQHGNDPQ---MGGFGW 201
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ +K +P+RE +RK + +EP + T AGGLFA+D+++F ELG YDPG +WGGEN
Sbjct: 202 SLFFKWFPVPKREQIRRKGDPTEPVRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENL 261
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
ELSFK+WMCGG +E++PCS +GHV+R PY+F + V N KR+ E W DE
Sbjct: 262 ELSFKLWMCGGKLEFIPCSHVGHVFRKKSPYHFPPGTNYV-----NKNNKRLAEVWLDE- 315
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P D GDIS++
Sbjct: 316 YKNFYYRISPSVAKTDPGDISDR 338
>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAVCKEKSYPTDLPVASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL +L++YIQ++ K++LIRN REGLIR R
Sbjct: 171 VHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYIQKYLPAKIKLIRNPRREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA ++ D + PVID I T + S
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAVVHGDPHTVVCPVIDIISADTLAYSS---- 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGADSATAPIKSPTMAGGLFAMNRQYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDT-----MTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432
>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
adamanteus]
Length = 608
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 201/332 (60%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R +CK YP DLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRDRKCKEKIYPHDLPSASIIICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
+HS++ RTP+ L EIILVDD S ADL + L+ Y+ + KV+L+RN REGLIR R
Sbjct: 171 IHSVLDRTPSHLLHEIILVDDRSELADLKEDLDIYLTKDLPNKVKLVRNENREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V+VFLD+HCEV WL PLL PI R+ + PVID I T Y
Sbjct: 231 VGASHATGKVLVFLDSHCEVNEMWLQPLLTPIQESRRTVVCPVIDIISADTL----TYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAMDR +F LG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLLEMEGPEQATAPIKSPTMAGGLFAMDREYFNALGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D + +N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLVIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R P + G+I+++
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRTRNYGNITDR 432
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 192/306 (62%), Gaps = 6/306 (1%)
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
L+ C Y LP S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +
Sbjct: 15 LQNLNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDR 74
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L + LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLP
Sbjct: 75 EHLKKPLEDYMALF-PSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLP 133
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
PLL I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 134 PLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKA 191
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +P
Sbjct: 192 DP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIP 249
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CSR+GH+YR +PY L N KRV E W DE + Y Y R P L
Sbjct: 250 CSRVGHIYRKSVPYKVPAGPADPCNCLSLQNLKRVAEVWMDE-YAEYIYQRRPEYRHLSA 308
Query: 367 GDISEQ 372
GD+ Q
Sbjct: 309 GDVVAQ 314
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 222/357 (62%), Gaps = 14/357 (3%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ GPGE G A + + + E ++ S+ IS +R +PD R +C+ D
Sbjct: 82 RNGPGEMGSAVIIDPSQQEERKKKFNENQFDVMASDLISINRALPDYRSSKCREAARKYD 141
Query: 81 ---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S++ L L+ YI
Sbjct: 142 ITSLPTVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYI 201
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL PLL + DRK
Sbjct: 202 KRFSLPFHLIHLPERSGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVSVDRK 261
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKS 256
+ P+ID I + +E+ + D + G F W + ++ +P RE ++R ++ S P ++
Sbjct: 262 RVVAPIIDVISDENFEY--ITASDITWGG-FNWHLNFRWYPVPMREMERRNHDRSVPLQT 318
Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PCSR+GHV+R
Sbjct: 319 PTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRK 378
Query: 317 FMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+F G A+ +I N R E W DE +K FY P A +D+GD++E+
Sbjct: 379 HTPYSFPGGTAN-----VIHRNAARTAEVWMDE-YKDIFYKMVPAAKNVDIGDLTER 429
>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
Length = 590
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/332 (45%), Positives = 200/332 (60%), Gaps = 14/332 (4%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R ++C+ Y + LP AS+++ F NE FS+L+RT
Sbjct: 97 DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
VHS++ RTP L EIILVDD S DL + L+ Y+Q+ KV+++RN +REGLIR R
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV WL PLL PI +RK + PVID I T VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E +SPT AGGLFAMDR +F ELG YD G
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSELNS---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRG 329
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + VPCSR+GH++R PY D + +N R
Sbjct: 330 MDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 384
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W D+ + YF R P D GDISE+
Sbjct: 385 LAHVWMDDYKEQYFALR-PELRNRDYGDISER 415
>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
Length = 590
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/332 (45%), Positives = 200/332 (60%), Gaps = 14/332 (4%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R ++C+ Y + LP AS+++ F NE FS+L+RT
Sbjct: 97 DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
VHS++ RTP L EIILVDD S DL + L+ Y+Q+ KV+++RN +REGLIR R
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV WL PLL PI +RK + PVID I T VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E +SPT AGGLFAMDR +F ELG YD G
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSELNS---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRG 329
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + VPCSR+GH++R PY D + +N R
Sbjct: 330 MDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 384
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W D+ + YF R P D GDISE+
Sbjct: 385 LAHVWMDDYKEQYFALR-PELRNRDYGDISER 415
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 212/353 (60%), Gaps = 9/353 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
K PGE G A H+P A N+ S+ IS +R++ D+R++ CK Y
Sbjct: 100 KGSPGEMGAAVHIPPENEAKQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKNKKYNKY 159
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++VFHNE +++L+RTV S+I R+P L+E+ILVDD S + L Q LEDYI
Sbjct: 160 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQDLEDYIATL 219
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR +
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 279
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTM 336
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R P
Sbjct: 337 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSP 396
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F ++ + +N RV E W DE + ++Y P A +D+GD+SE+
Sbjct: 397 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 444
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/368 (41%), Positives = 216/368 (58%), Gaps = 11/368 (2%)
Query: 7 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
D L L ++ G GE G + + ++ A E N+ S IS +RT+PD
Sbjct: 56 DSALQTLLAAMKSKSPGAGEMGSPVIIQSSLQSEVKARFKENQFNVVASERISLNRTLPD 115
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
R C+ Y K SV++VFHNE +S+LMRTV S+I R+ YLEEIILVDD S K
Sbjct: 116 YRSSACRSIKYEKISLKTSVVIVFHNEAWSTLMRTVQSVINRSSVDYLEEIILVDDASEK 175
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
+L +E +++ LIR +R GLI R RGA+ ++G+V+ FLDAH EV WL
Sbjct: 176 DELIALVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDVLTFLDAHVEVTDGWLE 235
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
PLL+ I DR + PVID I T+++ + E G F W M ++ + RE K+
Sbjct: 236 PLLSRISEDRTRVVAPVIDVISDDTFQYVTAAESTW---GGFSWTMNFRWYQASAREQKR 292
Query: 247 R-KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
R K + P ++PT AGGLF++DR +F ++G YD G+ +WGGEN E+SF++WMCGG++E
Sbjct: 293 RGKNKTTPIRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEISFRVWMCGGTLEIN 352
Query: 306 PCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
PCS +GHV+R PY F G ++ + G N +R E W DE +K ++Y P AMF
Sbjct: 353 PCSHVGHVFRKQTPYTFEGGTSNVIYG-----NARRTAEVWMDE-YKEFYYKMTPSAMFA 406
Query: 365 DMGDISEQ 372
+G+IS++
Sbjct: 407 PLGNISDR 414
>gi|301759365|ref|XP_002915552.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Ailuropoda melanoleuca]
Length = 448
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 206/330 (62%), Gaps = 11/330 (3%)
Query: 43 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 102
A +YG N S + +R +PD R + C YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159
Query: 103 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 162
S+I TP LEEIILVDD SS DL +KL+ ++ F GK++LIRN +REGLIR+R G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219
Query: 163 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 222
A + G+V+VFLD+HCEV WL PLLAPI D K++ P+ID ID++T E+R P
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275
Query: 223 HYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLL 282
RG F W + +K + + E + ++P +SP AGG+FA++R +F E+G YD +
Sbjct: 276 VVRGAFTWHLEFKWDNVLSYEIDGPEGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDME 335
Query: 283 VWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVI 342
+WG EN ELS +IWMCGG + +PCSR+GH+ + P G + +TYN R++
Sbjct: 336 LWGAENLELSLRIWMCGGQLFILPCSRVGHISKHRFPNQPGLMK------AVTYNNLRLV 389
Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ R+P + G+ISE+
Sbjct: 390 HVWLDE-YKEQFFLRQPGLKSVAYGNISER 418
>gi|281339845|gb|EFB15429.1| hypothetical protein PANDA_003532 [Ailuropoda melanoleuca]
Length = 447
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 146/330 (44%), Positives = 206/330 (62%), Gaps = 11/330 (3%)
Query: 43 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 102
A +YG N S + +R +PD R + C YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159
Query: 103 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 162
S+I TP LEEIILVDD SS DL +KL+ ++ F GK++LIRN +REGLIR+R G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219
Query: 163 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 222
A + G+V+VFLD+HCEV WL PLLAPI D K++ P+ID ID++T E+R P
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275
Query: 223 HYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLL 282
RG F W + +K + + E + ++P +SP AGG+FA++R +F E+G YD +
Sbjct: 276 VVRGAFTWHLEFKWDNVLSYEIDGPEGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDME 335
Query: 283 VWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVI 342
+WG EN ELS +IWMCGG + +PCSR+GH+ + P G + +TYN R++
Sbjct: 336 LWGAENLELSLRIWMCGGQLFILPCSRVGHISKHRFPNQPGLMK------AVTYNNLRLV 389
Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ R+P + G+ISE+
Sbjct: 390 HVWLDE-YKEQFFLRQPGLKSVAYGNISER 418
>gi|308457549|ref|XP_003091148.1| CRE-GLY-10 protein [Caenorhabditis remanei]
gi|308258137|gb|EFP02090.1| CRE-GLY-10 protein [Caenorhabditis remanei]
Length = 620
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 155/358 (43%), Positives = 214/358 (59%), Gaps = 19/358 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
E +EGPGE GK +P+ +A L Y G N S+ IS +R+I D+R ++CK
Sbjct: 89 EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
D++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + + P +D ID +T+E R D RG F+W YK L +++ R+ +
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTT 321
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381
Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+YR + P+ + D ++ NYKRV E W DE +K Y P D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGSADAGDL 433
>gi|354478320|ref|XP_003501363.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Cricetulus griseus]
Length = 435
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 209/339 (61%), Gaps = 17/339 (5%)
Query: 34 PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 93
PE Y+ +YG+N+ S + R +PD R + C YP +LP AS+I+ FHNE
Sbjct: 76 PEFYKG-----FAQYGLNVVISRRLGIQREVPDSRDKICHQKHYPFNLPTASIIICFHNE 130
Query: 94 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 153
F++L+RTV S+I TP+ +LEEIILVDD S DL +KL+ +++ F GK++LIRN +RE
Sbjct: 131 EFNTLLRTVSSVINLTPSHFLEEIILVDDMSDTDDLKEKLDYHLELFRGKIKLIRNKKRE 190
Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
GLIR+R GA + G+++VFLD+HCEV WL PLL I D K++ P+ID ID T +
Sbjct: 191 GLIRSRMIGASRASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPMIDVIDDTTLD 250
Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
Y RG F+W ++++ + + E + S+P +SP AGG+FA+DR +F E
Sbjct: 251 ----YTAAPLVRGAFDWDLMFRWDNVFSYEMDGPEGTSKPIRSPAMAGGIFAIDRHYFTE 306
Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
LG YD + +WGGEN ELS +IWMCGG + +PCSR+GH+ + NF A +
Sbjct: 307 LGQYDKDMDLWGGENVELSLRIWMCGGQLFILPCSRVGHIAKI---QNFNNAALKA---- 359
Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+++N RV W DE HK F+ R P + G+ISE+
Sbjct: 360 LSWNLLRVAHVWLDE-HKDNFFLRRPYLKYEPYGNISER 397
>gi|308452095|ref|XP_003088913.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
gi|308244364|gb|EFO88316.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
Length = 620
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 155/358 (43%), Positives = 214/358 (59%), Gaps = 19/358 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
E +EGPGE GK +P+ +A L Y G N S+ IS +R+I D+R ++CK
Sbjct: 89 EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
D++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + + P +D ID +T+E R D RG F+W YK L +++ R+ +
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTT 321
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381
Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+YR + P+ + D ++ NYKRV E W DE +K Y P D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGSADAGDL 433
>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 608
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 152/332 (45%), Positives = 204/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRSAACKDETYPEDLPVASVVICFYNEALSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPAQ L E+ILVDD S DL +L++++Q + GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPAQLLREVILVDDDSDFDDLKGQLDEFVQTQLPGKIKVIRNTKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNTMWLQPLLATIQEDRRTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLIPPSELGGPGGATAPIKSPTMAGGLFAMNRDYFDELGRYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D + +N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGRD-----TMAHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432
>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
gallus]
Length = 608
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 201/332 (60%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R +C+ YP DLP ASVI+ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHREVPDTRDAKCREKSYPSDLPFASVIICFYNEALSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL + L +Y++ R +L+RN +REGLIR R
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDNSELDDLKKDLVEYVKTRLPKTTKLVRNEKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V+VFLD+HCEV WL PLL PI DR+ + PVID I T Y
Sbjct: 231 IGASHATGKVLVFLDSHCEVNEMWLQPLLTPIKEDRRTVVCPVIDIISADTL----TYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + + P KSPT AGGLFAMDR +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D + +N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGRLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R P + G+I+++
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRTRNYGNITDR 432
>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
Length = 608
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 152/332 (45%), Positives = 206/332 (62%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKDKTYPADLPVASVVICFYNEALSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS++ RTPAQ L E+ILVDD S DL +L+ ++Q++ GK+++IRN +REGLIR R
Sbjct: 171 VHSVLDRTPAQLLHEVILVDDDSDFDDLKGELDAFVQKYLPGKIKVIRNRKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGASHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDRRTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLPEPGGPEGATAPIKSPTMAGGLFAMNRDYFSELGQYDRG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/360 (40%), Positives = 217/360 (60%), Gaps = 11/360 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + K PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 111 PTIRESKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 170
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 171 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 230
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 231 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPLLARIVQ 290
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R + + P
Sbjct: 291 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 347
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 348 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 407
Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R PY F G +A ++ +N RV E W DE + ++Y A GD+S++
Sbjct: 408 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 461
>gi|431895737|gb|ELK05156.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Pteropus alecto]
Length = 447
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 146/326 (44%), Positives = 202/326 (61%), Gaps = 11/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
EYG N S + +R +PD R + C+ YP+ LP AS+++ FHNE F++L RTV S+I
Sbjct: 104 EYGFNAVVSTSLGRERLVPDTRDKMCRRKHYPVSLPTASIVICFHNEEFNALFRTVSSVI 163
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA +
Sbjct: 164 NLTPHHVLEEIILVDDMSEFDDLKEKLDHHLEMFRGKIKLIRNQKREGLIRARLIGASRA 223
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+V+VFLD+HCEV WL PLL I DRK++ PVID ID T E+R P RG
Sbjct: 224 SGDVLVFLDSHCEVNRVWLEPLLYAISKDRKMVVCPVIDVIDSTTLEYR----PSPLVRG 279
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + +K + + E + + P +SP AGG+FA+ R +F E+G YD G+ +WGG
Sbjct: 280 AFDWYLQFKWDNVFSYELDGPEGLTRPIRSPAMAGGIFAIRRHYFNEIGQYDKGMDLWGG 339
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELS +IWMCGG I +PCSR+GH+ + ++ G + +TYN R+ W
Sbjct: 340 ENLELSLRIWMCGGQIFILPCSRVGHITKQQFSHSSGVIRA------MTYNSLRLAHVWL 393
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K + R P F+ G+ISE+
Sbjct: 394 DE-YKEQVFLRRPGLRFIPYGNISER 418
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 146/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVKEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y+ A GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 468
Query: 371 EQ 372
++
Sbjct: 469 DR 470
>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Ornithorhynchus anatinus]
Length = 608
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 150/332 (45%), Positives = 199/332 (59%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + R +PD R ECK YP LP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGSHRDVPDTRDAECKEKSYPPHLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
+HS++ RTPA L EIILVDD S DL L++YI+ +++IRN +REGLIR R
Sbjct: 171 IHSVLDRTPAHLLHEIILVDDNSELDDLKSGLDEYIRLHLPRNIQVIRNEKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA ++ GEV+VFLD+HCEV WL PLL PI DR+ + PVID I T + S
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNAMWLQPLLVPIREDRRTVVCPVIDIIGADTLAYSS---- 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGPGRATAPIKSPTMAGGLFAMNREYFRELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D + +N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R P G+ISE+
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRLRSYGNISER 432
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 283 bits (725), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y+ A GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 468
Query: 371 EQ 372
++
Sbjct: 469 DR 470
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 146/353 (41%), Positives = 217/353 (61%), Gaps = 13/353 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW----DYPLD 80
G+GG +PE ++ + E N+ S IS +RT+PD R E C+ +
Sbjct: 110 GQGGTGVTVPEDQKSIKEKRFLENQFNVVASEMISVNRTLPDYRSEACRNAAGNEKTTVG 169
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+I+VFHNE +++L+RT+HS+I R+P LEEII++DD S + L + L+ YI++F
Sbjct: 170 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKF 229
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
V L+ ER GLIR R G+ ++G++++FLDAH EV WL PL+ + DRK +
Sbjct: 230 PIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVHRVAEDRKRVV 289
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + E G F W + ++ +P+RE +R + S P ++PT
Sbjct: 290 APIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRGSDRSMPIQTPTI 346
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PCSR+GHV+R P
Sbjct: 347 AGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTP 406
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F +V I +N R E W DE +KA+FY P A ++ GD++++
Sbjct: 407 YTFPGGTAKV----IHHNAARTAEVWMDE-YKAFFYKMVPAAKNVEAGDVTDR 454
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 216/352 (61%), Gaps = 12/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD---L 81
G+GG +PE ++ + E N+ S IS +RT+PD R E C+ + +
Sbjct: 111 GQGGTGVTVPEEKKSIKEKRFLENQFNVVASEMISVNRTLPDYRSEACRTAGNSIKTTGM 170
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P S+I+VFHNE +++L+RT+HS+I R+P LEEII++DD S + L + L+ YI+
Sbjct: 171 PTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKALP 230
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V L+ ER GLIR R G+ ++G++++FLDAH EV WL PL++ + DRK +
Sbjct: 231 VPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTEGWLEPLISRVAEDRKRVVA 290
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
P+ID I T+E+ + E G F W + ++ +P+RE +R + S P ++PT A
Sbjct: 291 PIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYSVPKRELNRRGSDRSMPIQTPTIA 347
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PCSR+GHV+R PY
Sbjct: 348 GGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPY 407
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F +V I +N R E W DE +KA+FY P A ++ GD++E+
Sbjct: 408 TFPGGTAKV----IHHNAARTAEVWMDE-YKAFFYKMVPAARNVEAGDVTER 454
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 217/362 (59%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P +E K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y A GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVS 468
Query: 371 EQ 372
++
Sbjct: 469 DR 470
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 105 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 164
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 165 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 224
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 225 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 284
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 285 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 341
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 342 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 401
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y+ A GD+S
Sbjct: 402 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 455
Query: 371 EQ 372
++
Sbjct: 456 DR 457
>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
adhaerens]
Length = 490
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 151/351 (43%), Positives = 207/351 (58%), Gaps = 14/351 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G+ G A +P + A + N S+ IS RT+PD R CK +PL LP
Sbjct: 1 GQNGTAVIVPAESKNASEQLFNRNHFNQWISDRISLHRTLPDPRHPMCKDQIFPLHLPTT 60
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS---KADLDQKLEDYIQRFN 141
SV++VFHNE +S+L+RTVHSI+ R+P L EIIL DD+S A+L LE Y +
Sbjct: 61 SVVVVFHNEAWSTLLRTVHSILSRSPPDLLHEIILQDDYSDPIGHAELFMPLELYTSKLE 120
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV++ RN + EGLIR+R G + V+ FLDAHCEV WL PLL IY + +
Sbjct: 121 -KVKIFRNEKHEGLIRSRLNGFSHATAPVVTFLDAHCEVTTGWLEPLLERIYLNETTVVC 179
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P ID ID +T++++ + P RG+F W + ++ +P E K+RK +P SPT AG
Sbjct: 180 PEIDVIDDRTFQYQ--FGPPALMRGVFNWQLYFRWALIPPEEHKRRKSPIDPVWSPTMAG 237
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+ + FF LG YD VWGGEN E+SFK W+CGG +E VPCSR+GHV+R PY
Sbjct: 238 GLFAISKKFFKRLGTYDDQFDVWGGENMEISFKAWLCGGKLEIVPCSRVGHVFRHNQPYK 297
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
FG G ++ N +RV E W D+ +K +FY +P + G+I+E+
Sbjct: 298 FG-------GNFLSRNSQRVAEVWLDD-YKEFFYQVQPHLRKEEFGNIAER 340
>gi|157107416|ref|XP_001649767.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884053|gb|EAT48278.1| AAEL000654-PA [Aedes aegypti]
Length = 607
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 214/356 (60%), Gaps = 13/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE G A HL + D + G N S+ IS +R++PD+R + C+ Y
Sbjct: 80 EEKRTGIGEHGIAGHLEKKDEDMKDKLFKKNGFNAVLSDLISLNRSLPDIRHKGCRKKKY 139
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 140 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELITEIILVDDCSTKEFLKDQLDRYV 199
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 200 EENMPKVKVIHLPERSGLITARLAGAKVATADVLIFLDSHTEANINWLPPLLEPIAEDYK 259
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID ID+ +E+R+ D RG F+W YK L +++ + +EP++SP
Sbjct: 260 TCVCPFIDVIDWDNFEYRA---QDEGARGAFDWKFFYKRLPLLQKDLENP---TEPFESP 313
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR +
Sbjct: 314 VMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMYDAPCSRVGHIYRGY 373
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P+ + D ++ NYKRV E W DE +K Y Y R+ D GD+S+Q
Sbjct: 374 APFGNPRKKD-----FLSRNYKRVAEVWMDE-YKEYLYMRDRKKYDNTDAGDLSKQ 423
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 157/361 (43%), Positives = 222/361 (61%), Gaps = 22/361 (6%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 72
E + GPGE GK P R GD + E G + S+ I+ +R+IPD+R +C
Sbjct: 82 ETERHGPGEHGK----PVKLRDPGDIKMNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 137
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y +LP SVI++F+NE +S+L+RTV+S++ R+P+ L+EIILV+D S+K L +
Sbjct: 138 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPSHLLKEIILVNDHSTKPFLWKP 197
Query: 133 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
L+++++ + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 198 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 257
Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
I D + P+ID I + T+E+RS D RG F+W YK LP R +
Sbjct: 258 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD-PT 311
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+G
Sbjct: 312 EPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSRVG 371
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HVYR + P+ + + +T N+KRV E W DE +K + Y R P + GD+++
Sbjct: 372 HVYRGYSPFPNPRGVN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDKTNPGDLTK 425
Query: 372 Q 372
Q
Sbjct: 426 Q 426
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 217/362 (59%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P +E K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y A GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVS 468
Query: 371 EQ 372
++
Sbjct: 469 DR 470
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 158/361 (43%), Positives = 221/361 (61%), Gaps = 22/361 (6%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 72
E + GPGE GK P R GD L E G + S+ I+ +R+IPD+R +C
Sbjct: 93 ETERHGPGEHGK----PLKLRDPGDIKLNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 148
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y +LP SVI++F+NE +S+L+RTV+S++ R+P L+EIILV+D S+K L +
Sbjct: 149 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPHLLKEIILVNDHSTKPFLWKP 208
Query: 133 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
L+++++ + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 209 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 268
Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
I D + P+ID I + T+E+RS D RG F+W YK LP R +
Sbjct: 269 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD-PT 322
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+G
Sbjct: 323 EPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSRVG 382
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HVYR + P+ + + +T N+KRV E W DE +K + Y R P + GD+++
Sbjct: 383 HVYRGYSPFPNPRGVN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDKTNPGDLTK 436
Query: 372 Q 372
Q
Sbjct: 437 Q 437
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 202/339 (59%), Gaps = 9/339 (2%)
Query: 35 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNE 93
E + D N S+ IS R I D R CK YPLD LP +VI+ FHNE
Sbjct: 106 EEVKQEQDEGWKRNNFNQYISDRISLHRPIKDTRHAMCKDRTYPLDKLPDTTVIIPFHNE 165
Query: 94 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 153
++L+RTV SI+ R+P + EI+L+DD S+ L L++ + K R++R +ER
Sbjct: 166 ARTTLLRTVWSILDRSPPSLINEILLIDDASTMEHLKAPLDEELATI-PKTRVLRLSERS 224
Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
GLIR + GA++++G+V+ FLD+HCE + WL PLL IY DR + PVID ID +T+
Sbjct: 225 GLIRAKVFGAEQAKGKVVTFLDSHCECNVGWLEPLLERIYLDRTTVVTPVIDNIDKKTFA 284
Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
+ P RGIF W + + +LP E KKRK P SPT AGGLF+MDR +F E
Sbjct: 285 YTG--SPTVITRGIFTWSLTFSWLDLPWFEQKKRKDPIAPLPSPTMAGGLFSMDREYFFE 342
Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
+G YD G+ VWGGEN E+SF+IW CGG++E++PCSR+GHVYR F PY F A +
Sbjct: 343 IGSYDMGMDVWGGENLEISFRIWQCGGTLEFIPCSRVGHVYRDFHPYKFPSGAVQT---- 398
Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
I N RV E W DE +K +Y P + GDIS++
Sbjct: 399 INKNLNRVAEVWMDE-YKELYYGVRPHHRAIGTGDISDR 436
>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
Length = 693
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 151/365 (41%), Positives = 223/365 (61%), Gaps = 13/365 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P E + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPREEWGEG---GSGVTHLTPEQQKLADSTFAVNQFNLFVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ YP DLP SVI+V+HNE +S+L+RTV S+I R+P L EI+LVDDFS + L
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLR 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KL++ ++ +++IR+ +R GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTEMAKQHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-DMG 367
+GHV+R P++F + G ++ N RV E W DE K YFY P+A + +
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNANLLRVAEVWMDE-WKYYFYKIAPVAFRMRESI 438
Query: 368 DISEQ 372
D+SE+
Sbjct: 439 DVSER 443
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 24 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 83
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 84 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 143
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 144 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 203
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 204 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 260
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 261 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 320
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
HV+R PY F G +A ++ +N RV E W DE + ++Y+ A GD+S
Sbjct: 321 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 374
Query: 371 EQ 372
++
Sbjct: 375 DR 376
>gi|348568063|ref|XP_003469818.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Cavia porcellus]
Length = 499
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 151/349 (43%), Positives = 203/349 (58%), Gaps = 16/349 (4%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PG+ Y PE + EYG N+ S + DR +PD R + C++ YPL LP
Sbjct: 138 PGDQNINYSDPELFNG-----YLEYGFNVIVSRSLGHDREVPDTRDKSCRHRHYPLHLPT 192
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
ASVI+ FHNE F++L+RTV S++ TP LEEIILVDD S DL KL Y++ F K
Sbjct: 193 ASVIICFHNEEFNALLRTVSSVVYLTPPYLLEEIILVDDMSKFDDLKSKLNYYLESFRDK 252
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+L+RN +REGLIR R GA + GEV+VFLD+HCEV WL PLLA I D + + PV
Sbjct: 253 VQLVRNKKREGLIRARMIGAWYASGEVLVFLDSHCEVNRVWLEPLLAAISKDSRTVVTPV 312
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID ID + + Y P RG F+W + +K + + E + P +SP AGG+
Sbjct: 313 IDIIDGISLQ----YLPSPLVRGAFDWKLQFKWDSVFSYETDSEGSPTNPIRSPAMAGGI 368
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
FAM R FF ELG YD + +WGGEN ELS +IWMCGG + +PCSR+GH+ + +
Sbjct: 369 FAMHRPFFYELGEYDKDMDLWGGENLELSLRIWMCGGQLLIIPCSRVGHITKLY------ 422
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
D + N+ R++ W DE +K F+ R P + G+ISE+
Sbjct: 423 SKPDSALSKAVARNHLRLVHVWLDE-YKEQFFLRNPDLKSMTYGNISER 470
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 158/368 (42%), Positives = 216/368 (58%), Gaps = 24/368 (6%)
Query: 13 LEPPLEPYKEGPGEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
L P + GPGEGG A L PE + D S+ Y +N S IS R + D R
Sbjct: 67 LSRPADIDPRGPGEGGSAVRLLNLSPEVSKQQED-SIQTYAVNQFVSERISLHRRLQDPR 125
Query: 69 MEECKY---WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
E CK +DY LP SV++ F+NEG+S+L+RTV S++ +P L EIILVDD+S
Sbjct: 126 HEMCKSRRPFDY-RSLPTTSVVIAFYNEGWSTLIRTVFSVLHNSPDALLTEIILVDDYSD 184
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
K L KL D+++ +VRL+R T+REGL+R R GA ++GEV+ FLD HCE WL
Sbjct: 185 KVYLKDKLADFLKAL-ARVRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWL 243
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 244
PLL I D ++ VPVID ID+ T+E+ Y H + G F+W + ++ + +P+ E
Sbjct: 244 EPLLERIMEDESVIVVPVIDTIDWNTFEY---YYGGHEPQIGGFDWRLTFQWHTIPDHER 300
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K+RK +P +SPT AGGLFA+ + +F +G YD G+ +WGGEN ELSF+ WMCGG +E
Sbjct: 301 KRRKSPVDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSFRTWMCGGKLET 360
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
+PCS +GHV+ PY P N R E W D+ +K +FY R P A
Sbjct: 361 IPCSHVGHVFPKQSPY---------PRPKFLTNTLRAAEVWMDD-YKRHFYIRNPPASKE 410
Query: 365 DMGDISEQ 372
+ GDIS +
Sbjct: 411 NYGDISAR 418
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 161/370 (43%), Positives = 213/370 (57%), Gaps = 27/370 (7%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L P G G+A +A D S +YG N S+ IS +R+IPD R C D
Sbjct: 67 LNPEPGSAGMEGQAVSNSVNEKAIEDKSFDDYGFNELASSKISLERSIPDNRDSSCFNVD 126
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK---ADLDQKL 133
YP+ L SVI++FHNE +S L+RTVH+++ R+P L+EIILVDD S K L +KL
Sbjct: 127 YPVKLSTTSVIVIFHNEAWSVLLRTVHTVLARSPPHMLKEIILVDDASVKEKYGHLGEKL 186
Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
E+Y+ + KV+LIR+ R GL + R GA + GEV+VFLD+HCE WL PLLA +
Sbjct: 187 ENYVNTLS-KVKLIRSPVRVGLTQARLIGADNAVGEVLVFLDSHCEASFGWLEPLLARLQ 245
Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ K+ VP I+ I ++ +E+ S E + RGIF W +++ LP RE +RKY S+P
Sbjct: 246 ENPKLAVVPDIEVISFKNFEYSS--EKGSYNRGIFSWELMFNWGPLPPREKMRRKYESDP 303
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYD---------PGLLVWGGENFELSFKIWMCGGSIEW 304
KSPT AGGLFAM+R +F E G YD L WGGEN E+SF++WMCG IE
Sbjct: 304 IKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYWGGENVEMSFRLWMCGEGIEI 363
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGP--LITYNYKRVIETWFDEKHKAYFYTREPLAM 362
+PCSR+GHV+R PY K P +N RV E W DE K FY+
Sbjct: 364 IPCSRVGHVFRERAPY---------KSPDGSTDHNSIRVAEVWMDE-FKEIFYSFRANLK 413
Query: 363 FLDMGDISEQ 372
GD+SE+
Sbjct: 414 PEQGGDVSER 423
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 216/345 (62%), Gaps = 10/345 (2%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASV 86
G+A +PE+ A N+ S IS +RT+ D+RM+ CK YP++ LP SV
Sbjct: 2 GQAVIIPESQHAEMKEKFKVNQFNLMASELISVNRTLRDVRMDSCKSKTYPVESLPTTSV 61
Query: 87 ILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRL 146
++VFHNE +S+L+RTVHS+I R+P L+EIILVDD S K L ++L++Y+ + + V +
Sbjct: 62 VIVFHNEAWSTLLRTVHSVINRSPPPLLKEIILVDDASEKDFLGRQLDEYLSKLSVHVYV 121
Query: 147 IRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDG 206
+R +R GLIR R +GA + G+VI FLDAHCE WL PLL I+ +RK + P+ID
Sbjct: 122 LRMEKRTGLIRARLKGAARAEGKVITFLDAHCECTEGWLEPLLFEIHKNRKSVVCPIIDV 181
Query: 207 IDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFA 265
I +T+E+ + D + G F W + ++ +P+RE ++R + S P +SPT AGGL A
Sbjct: 182 ISDETFEY--ITGSDMTWGG-FNWKLNFRWYPVPQREVERRGGDRSLPLRSPTMAGGLLA 238
Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
++R +F E+G YD G+ +WGGEN E+SF+IWMCGG++ V CS +GHV+R PY F
Sbjct: 239 IERDYFYEIGSYDDGMDIWGGENLEMSFRIWMCGGTLLIVTCSHVGHVFRKATPYTFPGG 298
Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
R+ I +N R+ E W DE ++++Y P D GD+S
Sbjct: 299 TGRI----INHNNARLAEVWMDE-WRSFYYKINPGVKQTDYGDLS 338
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 154/357 (43%), Positives = 214/357 (59%), Gaps = 14/357 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAY-RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
E + GPGE GK Y L +A D E G + S+ I+ +R++PD+R C+
Sbjct: 89 EANRVGPGEHGKPYRLTGVEEKALNDKLFKENGYSAVVSDMIALNRSVPDIRHISCRTKA 148
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP SVI++F+NE +S+L+RTV+S++ R+PA L+E+ILV+D S+K L L ++
Sbjct: 149 YLRELPTVSVIVIFYNEHWSALLRTVYSVLNRSPASLLKEVILVNDHSTKPFLWAPLREF 208
Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ KVRLI ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D
Sbjct: 209 VESELAPKVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPPLLEPIAED 268
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
+ P ID I + T+++R+ D RG F+W YK L + ++P+
Sbjct: 269 YRTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFN 322
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GHVYR
Sbjct: 323 SPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 382
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ P+ + + + N+KRV E W DE K + Y R PL D GD++ Q
Sbjct: 383 GYAPFGNPRGVN-----FVVRNFKRVAEVWMDEYAK-FLYERNPLFEKTDPGDLTAQ 433
>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 151/352 (42%), Positives = 204/352 (57%), Gaps = 24/352 (6%)
Query: 35 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
EA + D+ + N+ S + + R +PD R +C+ YP DLP+ASV++ F NE
Sbjct: 79 EADQQLRDSGYHRHAFNLLISTRLGYHRELPDTRDPQCRDRTYPGDLPRASVVICFFNEA 138
Query: 95 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
S+L+RTVHS++ RTP L EIILVDD+S +L L+ Y+Q GKVR++RN +RE
Sbjct: 139 LSALLRTVHSVLDRTPPFLLHEIILVDDYSELEELKGDLDRYVQAELRGKVRVLRNQKRE 198
Query: 154 GLIRTRSRGAKESRG-------------EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
GLIR R GA ++ G EV+VFLD+HCEV WL PLLAPI DR+ +
Sbjct: 199 GLIRGRMIGAAQASGVSPDPQILDLCSGEVLVFLDSHCEVNQMWLQPLLAPIRQDRRTVV 258
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVID I T Y P RG F WG+ +K + +P E K + P +SPT A
Sbjct: 259 CPVIDIISADTLS----YSPSPIVRGGFNWGLHFKWDPVPPAELKSPQGPVGPIRSPTMA 314
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA++R +F E+G YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY
Sbjct: 315 GGLFAINRKYFNEIGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPY 374
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
D + +N R+ W DE + Y R L D GDI E+
Sbjct: 375 GSPGGQD-----TMAHNSLRLAHVWMDEYKEQYLSMRPDLRQ-RDYGDIGER 420
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 161/375 (42%), Positives = 219/375 (58%), Gaps = 26/375 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHI 58
+PV+K +PP +P PGE GKA L + + S+ +Y +N+ S+ I
Sbjct: 65 QPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQEESIEKYALNIYLSDQI 114
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE S+L+RT+HS+++ +PA L EI
Sbjct: 115 SLHRHIMDNRMYECKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLESSPAVLLREI 174
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDDFS K L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD H
Sbjct: 175 ILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCH 233
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
CE WL PLL I + + PVID ID+ T+EF + G F+W + ++ +
Sbjct: 234 CECVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWH 291
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PE+E ++RK +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 292 AVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQ 351
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E PCS +GHV+ PY P N R E W D +K FY R
Sbjct: 352 CGGTLEIEPCSHVGHVFPKKAPY---------ARPNFLQNTARAAEVWMD-GYKELFYNR 401
Query: 358 EPLAMFLDMGDISEQ 372
P A + GDISE+
Sbjct: 402 NPPAQKENYGDISER 416
>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
Length = 617
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 216/353 (61%), Gaps = 12/353 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPHDDWGEG---GTGVSHLTPEQQKRADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ YP DLP SVI+V+HNE +S+L+RTV S+I R+P L+EIILVDDFS + L
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLKEIILVDDFSDREFLR 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KL++ I+ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKQHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+GHV+R P++F + G ++ N RV E W DE K YFY P A
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNANLLRVAEVWMDE-WKYYFYKIAPQA 431
>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Takifugu rubripes]
Length = 580
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 201/339 (59%), Gaps = 11/339 (3%)
Query: 35 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
EA + D+ + N+ S + R +PD R +C+ YP DLP ASV++ F NE
Sbjct: 77 EADQQLRDSGYHRHAFNLLISTRLGPHRDLPDTRDPQCRDRIYPRDLPPASVVICFFNEA 136
Query: 95 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
S+L+RTVHS++ RT L EIILVDD+S +L L+ Y+Q GKV+++RN RE
Sbjct: 137 LSALLRTVHSVLDRTAPFLLHEIILVDDYSELEELKGDLDRYVQAELQGKVKVLRNQRRE 196
Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
GLIR R GA + G+V+VFLD+HCEV WL PLLA I+ DR+ + PVID I T
Sbjct: 197 GLIRGRMIGAAHASGQVLVFLDSHCEVNQMWLEPLLASIHEDRRTVVCPVIDIISADTLS 256
Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
Y P RG F WG+ +K + +P E K K +P +SPT AGGLFA++R +F E
Sbjct: 257 ----YSPSPIVRGGFNWGLHFKWDPVPPSELKSPKGPVDPIRSPTMAGGLFAINRKYFNE 312
Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
+G YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY D
Sbjct: 313 MGQYDAGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFRKRRPYGSPGGQD-----T 367
Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ +N R+ W DE + Y R P D GDIS++
Sbjct: 368 MAHNSLRLAHVWMDEYKEQYLSMR-PELRERDYGDISDR 405
>gi|410897066|ref|XP_003962020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Takifugu rubripes]
Length = 600
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 14/353 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G+ G+A H+ + A S E N+ SN I DR IPD R E C DLP
Sbjct: 101 GQFGQAVHVSSSEDALVRKSWDEGFFNVYLSNQIPLDRAIPDTRPESCAQTLVHDDLPST 160
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
SVI F +E +S+L+R+VHS++ R+P LEEIILVDDFS+K L L+ Y+ +F KV
Sbjct: 161 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLEEIILVDDFSTKEYLKAPLDKYMSQF-PKV 219
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
R+IR ER+GLIR R GA ++GEV+ FLD+H E + WL PLL IY DR+ + PVI
Sbjct: 220 RIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPLLERIYMDRRKVPCPVI 279
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
+ I+ + + V D+ RGIF W +++ + LPE KK S+P + P AGGL
Sbjct: 280 EVINDKDMSYMLV---DNFQRGIFRWPLVFGWSPLPEAYIKKHNLTISDPIRCPVMAGGL 336
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F++D+ +F ELG YD GL VWGGEN E+SFKIWMCGG IE +PCSR+GH++R PY F
Sbjct: 337 FSIDKKYFYELGAYDSGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQNPYKFP 396
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF----LDMGDISEQ 372
K DR K + N RV E W DE +K FY + +D+GD+SEQ
Sbjct: 397 K--DRQKT--VERNLARVAEVWLDE-YKDLFYGHGYHHLLDKSVIDIGDLSEQ 444
>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
partial [Strongylocentrotus purpuratus]
Length = 531
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 195/304 (64%), Gaps = 13/304 (4%)
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
CK YP DLP S+I+ FHNE +S+L+RT++SII R+P + ++EIIL+DD S+ L +
Sbjct: 85 CKNISYPHDLPSTSIIICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQE 144
Query: 132 KLEDYIQRFNG-KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+EDYI + + ++R++R +R GLI+ R G S GE FLD+H EV + WL PLLA
Sbjct: 145 PIEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLA 204
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
+ SDR I+ +PV+D I+ T+ + V EP RG F W Y+ +P + KR
Sbjct: 205 RLASDRTIVVMPVVDEINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYD--KRPSK 260
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
P KSP GGL MDR+FFLELGG+D G+ VWGGEN E S KIWMCGGSIE +PCSR+
Sbjct: 261 VAPIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRV 320
Query: 311 GHVYRSFMPYNFGKLADRVKGPL--ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
GHVYR PY+F + PL + +N RV+E W DE HK +FY R P+ D GD
Sbjct: 321 GHVYRDTSPYSFLG-----QNPLDIVEHNAMRVVEVWTDE-HKHHFYDRLPMLKNRDFGD 374
Query: 369 ISEQ 372
+S++
Sbjct: 375 VSKR 378
>gi|189237799|ref|XP_001814012.1| PREDICTED: similar to N-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270008127|gb|EFA04575.1| PNR-like protein [Tribolium castaneum]
Length = 614
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 151/366 (41%), Positives = 213/366 (58%), Gaps = 13/366 (3%)
Query: 8 GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
KL + P L K + G ++ + + D ++ N+ S +S+ R +PD
Sbjct: 81 NKLQPVYPKLSTDKNELSQLGLVKNIDDQRKK--DEGYKKHAYNVLISERLSYHRDVPDT 138
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R E CK Y DLP A++I+ F+NE + +L+RTVHSII RTPA L+EI+LVDDFS
Sbjct: 139 RNELCKNISYSADLPTAAIIICFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSDLE 198
Query: 128 DLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
+L + L YI + F+ +V+LI+ REGLIR R GA+ ++ +VI+FLD+H EV + W+
Sbjct: 199 NLHENLSTYITKNFDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIE 258
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
PLL I + + +PVID I+ T+ Y RG F WG+ +K LP+
Sbjct: 259 PLLQRIKDNYTNVAMPVIDIINADTF----AYTASPLVRGGFNWGLHFKWENLPKGTLST 314
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
+ +P KSPT AGGLFAM R +F +LG YD G+ +WGGEN E+SF+IWMCGG +E +P
Sbjct: 315 KMDFIKPIKSPTMAGGLFAMSRKYFTDLGEYDAGMNIWGGENLEISFRIWMCGGRLELIP 374
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CSR+GHV+R PY D + +N RV W D +K YF P A +D
Sbjct: 375 CSRVGHVFRQRRPYGAPDGQD-----TMLHNSLRVANVWMDS-YKEYFLNHRPDAKRIDF 428
Query: 367 GDISEQ 372
GD+S +
Sbjct: 429 GDVSSR 434
>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
Length = 1150
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 155/345 (44%), Positives = 210/345 (60%), Gaps = 14/345 (4%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLRMEECKY 74
L+ + GPGE GKA L A + + G N S+ IS +R+I DLR CK
Sbjct: 216 LDRERVGPGEQGKAATLSPAESDSEQRKKLYLQNGFNALLSDKISINRSIADLRHPSCKL 275
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP ASV++ F+ E +S+L+RTVHS++ R+P+ L+E+I+VDD S+K L +L+
Sbjct: 276 QQYFKHLPTASVVVPFYEEHWSTLLRTVHSVLNRSPSHLLKEVIIVDDGSTKEFLHGQLQ 335
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+Y+ + KV+LIR ER GL++ R GAK + G+V+VFLD+H E G NWLPPLL PI
Sbjct: 336 NYVNQNLPKVKLIRQGERTGLMKARLAGAKLASGDVLVFLDSHTEAGYNWLPPLLEPIAE 395
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ K P+ID ID QT+ +++ D RG+F+W YK L E + R + P+
Sbjct: 396 NPKTCVCPLIDVIDDQTF---NIHPQDDGGRGLFDWRFHYKRLALKESD---RVSPTAPF 449
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGGLFA+ FF ELGGYD L +WG E +ELSFKIW CGG + PCSR H+Y
Sbjct: 450 PSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQCGGRMLDAPCSRFSHIY 509
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
RS+ P+ + D IT N+KRV E W DE +K Y Y R+P
Sbjct: 510 RSYSPFPNSRKYD-----FITRNHKRVAEIWMDE-YKQYIYDRDP 548
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 188/301 (62%), Gaps = 13/301 (4%)
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
C Y LP+ SVI+ F++E +S+L+RTV+S+++R+P+ L EIILVDD S K L +
Sbjct: 675 CHNIKYLQHLPRTSVIIPFYDEHWSTLLRTVYSVMRRSPSSLLLEIILVDDGSMKNFLKE 734
Query: 132 KLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+L+ Y+ V++I R GLI R GAK ++G+V+VFLD+H E G+NWLPPLL
Sbjct: 735 QLDHYVATHLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGINWLPPLLE 794
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
PI + + P ID I T+E D RG F+W MLYK LP R + +K
Sbjct: 795 PIAHNPRTCVCPFIDVIMDDTFELTP---QDQGARGAFDWNMLYKR--LPLR-PEDQKDP 848
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
++P++SP AGGLFA+ FF ELGGYD L +WG E +ELSFKIW CGG + PCSR+
Sbjct: 849 TQPFESPVMAGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKIWQCGGRMIDAPCSRV 908
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GH+YRS+ P+ K D V N+KRV E W DE +K Y Y ++P+ +D GD++
Sbjct: 909 GHIYRSYSPFPNVKSYDYV-----AKNHKRVAEVWMDE-YKKYVYRKDPMRFSIDAGDLT 962
Query: 371 E 371
+
Sbjct: 963 K 963
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 157/360 (43%), Positives = 215/360 (59%), Gaps = 20/360 (5%)
Query: 20 YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
YK+G PGE G P+ R D N SN IS R++P+ E C+
Sbjct: 166 YKQGDPNQPGEFGTGKLSPKE-RKLFDEGFKRNSFNEYVSNMISIHRSLPNNTDELCQKA 224
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
Y DLP SVI+ FHNE +S L+RTVHS+++RTP L+EIILVDDFS L + LED
Sbjct: 225 SYRNDLPDTSVIICFHNEAWSVLLRTVHSVLERTPDHLLKEIILVDDFSDFDHLKKPLED 284
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F GKVR+IR R GLIR R +GA + G+V+ +LD+HCE WL PLL I +
Sbjct: 285 YMSQF-GKVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPLLDRIAQN 343
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYR---GIFEWGMLYKENELPEREAKKRKYNSE 252
+ PVID I+ +T + Y H R G F WG+++ + LP+R+ + K +
Sbjct: 344 STNVVTPVIDTINLETLQ----YHLSSHRRLSVGGFNWGLVFNWHILPDRDYQAMKSRID 399
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P SPT AGGLF++DR +F +LGGYDPG +WG EN E+SFKIWMCGG +E VPCS +GH
Sbjct: 400 PIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISFKIWMCGGRLEVVPCSHVGH 459
Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++R PY + K + ++ N R+ E W D+ +K +Y R + +D GD+SE+
Sbjct: 460 IFRKKSPYKWRKGIN-----VLQRNNIRLAEVWLDD-YKEIYYNRINHKL-VDFGDVSER 512
>gi|170589103|ref|XP_001899313.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158593526|gb|EDP32121.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 636
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 154/359 (42%), Positives = 218/359 (60%), Gaps = 16/359 (4%)
Query: 18 EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
+ ++G GEGGK + ++ D G + S+ I+ +R++ D+R CK
Sbjct: 111 DALRQGLGEGGKPVVVAISEFKKLRDDLYRINGYDAYISDLIALNRSVKDIRHSGCKNMV 170
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y LP V+ F+NE S+L+R+V+S+I R+P + EIILVDD S+KA L + LE++
Sbjct: 171 YLEKLPTVGVVFPFYNEHNSTLLRSVYSVINRSPKDIMREIILVDDGSTKAFLKEPLEEF 230
Query: 137 IQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
+++ N V++IR +REGLIR R RGA+ +VIVFLDAH EV NWLPPL+ PI
Sbjct: 231 LKKAGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDAHSEVNYNWLPPLVEPIAL 290
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
D K++ P ID ID T+E+R+ D RG F+W YK L E +K + P+
Sbjct: 291 DYKMVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPF 344
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SP AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ PCSR+GH+Y
Sbjct: 345 HSPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIY 404
Query: 315 RS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R ++P++ + D I+ NY+RV E W DE K + Y R P + +D GD+SEQ
Sbjct: 405 RCKYIPFSNPGIGD-----FISRNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSEQ 457
>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
Length = 617
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 149/365 (40%), Positives = 224/365 (61%), Gaps = 13/365 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ +P +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNITFPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLK 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
KL++ ++ ++++R+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKEHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
+GHV+R P++F + G ++ N RV E W DE K YFY P+A +
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKILNANLLRVAEVWMDE-WKYYFYKLAPVAYRMRQSI 438
Query: 368 DISEQ 372
D+SE+
Sbjct: 439 DVSER 443
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 160/372 (43%), Positives = 215/372 (57%), Gaps = 28/372 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
L+PP ++ PGE GK LP+ + A D + N S+ IS R++PD R
Sbjct: 155 LDPPAANLEDSPGELGKPVILPKDMSPEMKKAVDDGWTKNAFNQYVSDLISVRRSLPDPR 214
Query: 69 MEECKYWD-YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE +S L+RTVHS++ R+P + + EIILVDDFS
Sbjct: 215 DAWCKDSALYLSNLPKTDVIICFHNEAWSVLIRTVHSVLDRSPPELIGEIILVDDFSDMP 274
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV+++R +REGLIR R GA+ ++ VI +LD+HCE WL P
Sbjct: 275 HLKKQLEDYFASY-PKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWLEP 333
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID ID T EF HYR G F+W + + + +P
Sbjct: 334 LLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVP 385
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ SEP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 386 EREKKRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSFKTWMCGG 445
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
++E VPCS +GH++R PY + R ++ N R+ E W D+ K Y+Y R +
Sbjct: 446 TLEIVPCSHVGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDDYAK-YYYQRIGM 499
Query: 361 AMFLDMGDISEQ 372
D GD+SE+
Sbjct: 500 DKG-DFGDVSER 510
>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryzias latipes]
Length = 584
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 161/375 (42%), Positives = 223/375 (59%), Gaps = 27/375 (7%)
Query: 7 DGKLGN---LEPPLEPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISF 60
DG L ++PP P PGE G+A L PE + + S+ Y +N+ S+ IS
Sbjct: 62 DGPLARALYIKPP--PDSSAPGEWGRATRLNLSPEE-KKLEEESVESYAINIFVSDKISL 118
Query: 61 DRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
R I D RMEEC K +DY LP SVI+ F+NE +S+L+RT+HS+++ TPA L+EII
Sbjct: 119 HRHIQDNRMEECRNKKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEII 177
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
L+DD+S + L +L +YI +VRLIR +REGL+R R GA + G+V+ FLD HC
Sbjct: 178 LIDDYSDRGYLKSQLAEYISNLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHC 236
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKEN 237
E W+ PLL I + + PVID ID+ ++EF EP G F+W + ++ +
Sbjct: 237 ECVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTGEP---MIGGFDWRLTFQWH 293
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PE E K+RK ++P++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 294 SVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSFRVWQ 353
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGGS+E PCS +GHV+ PY P N R E W D +K +FY R
Sbjct: 354 CGGSLEIHPCSHVGHVFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKHHFYNR 403
Query: 358 EPLAMFLDMGDISEQ 372
P A + GDI+E+
Sbjct: 404 NPPAKKENYGDITER 418
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 210/350 (60%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A +P A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 103 PGEMGAAVAIPPENDAKQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKNKKYLKYLPD 162
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L Q LEDY+
Sbjct: 163 TSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQDLEDYVITLPVP 222
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR + P+
Sbjct: 223 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVVCPI 282
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ S D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 283 IDVISDDTFEYISA--SDMTWGG-FNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTMAGG 339
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 340 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 399
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ + +N RV E W DE + ++Y P A +D+GD+SE+
Sbjct: 400 PGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 444
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 147/353 (41%), Positives = 211/353 (59%), Gaps = 9/353 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
K PGE G A H+ A N+ S+ IS +R++ D+R+E CK YP
Sbjct: 100 KGKPGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLRDIRLEGCKTKKYPKY 159
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 160 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTL 219
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I +R +
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAENRSTVV 279
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTM 336
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R P
Sbjct: 337 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSP 396
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F +V + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 397 YTFPGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 444
>gi|170056941|ref|XP_001864259.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876546|gb|EDS39929.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 210/356 (58%), Gaps = 13/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK HL + D + G N S+ IS +R++PD+R CK Y
Sbjct: 79 ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID + + T+E+R+ D RG F+W YK L ++ +EP++SP
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN---PTEPFESP 312
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR +
Sbjct: 313 IMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRGY 372
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P+ + D +T NYKRV E W DE +K Y Y R+ D GD+S+Q
Sbjct: 373 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYVRDRKKYDNTDAGDLSKQ 422
>gi|170039452|ref|XP_001847548.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863025|gb|EDS26408.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 210/356 (58%), Gaps = 13/356 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE GK HL + D + G N S+ IS +R++PD+R CK Y
Sbjct: 79 ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
P ID + + T+E+R+ D RG F+W YK L ++ +EP++SP
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN---PTEPFESP 312
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GH+YR +
Sbjct: 313 IMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRGY 372
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P+ + D +T NYKRV E W DE +K Y Y R+ D GD+S+Q
Sbjct: 373 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYVRDRKKYDNTDAGDLSKQ 422
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 161/375 (42%), Positives = 219/375 (58%), Gaps = 26/375 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHI 58
+PV+K +PP +P PGE GKA L + D S+ +Y +N+ S+ I
Sbjct: 65 QPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQDESIEKYALNIYLSDQI 114
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SV++ F+NE S+L+RT+HS+++ +PA L EI
Sbjct: 115 SLHRHIMDNRMYECKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLETSPAVLLREI 174
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDDFS K L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD H
Sbjct: 175 ILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCH 233
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
CE WL PLL I + + PVID ID+ T+EF + G F+W + ++ +
Sbjct: 234 CECISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWH 291
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PE+E ++RK +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 292 AVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQ 351
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E PCS +GHV+ PY P N R E W D +K FY R
Sbjct: 352 CGGTLEIEPCSHVGHVFPKKAPY---------ARPNFLQNTARAAEVWMD-GYKELFYNR 401
Query: 358 EPLAMFLDMGDISEQ 372
P A + GDISE+
Sbjct: 402 NPPARKENYGDISER 416
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 165/380 (43%), Positives = 222/380 (58%), Gaps = 28/380 (7%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYR---AAGDASLGEYGMNMETSNHIS 59
V A +G L PP P +GPGE G+ L + + A N S+ IS
Sbjct: 120 VDHAPAPVGVLAPPQNP--DGPGEMGRPVVLKDLTKEQEAKVKQGWDRNAFNQYISDMIS 177
Query: 60 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
R++PD+R ECK Y DLP SVI+ FHNE +S L+RTVHSII R+P + L EIIL
Sbjct: 178 LHRSLPDVRDSECKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIIL 237
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD+S L QKLEDY+ F KV+++R +REGLIR R GA + V+ +LD+HCE
Sbjct: 238 VDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCE 296
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGM 232
WL PLL I + + PVID I T+E+ HYR G F+W +
Sbjct: 297 CTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEY--------HYRDSGGVNVGGFDWNL 348
Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
+ + +PERE ++RK++ +P SPT AGGLF++D+AFF +LG YD G +WGGEN ELS
Sbjct: 349 QFSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 408
Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
FK WMCGG++E VPCS +GH++R PY + R ++ N R+ E W DE +K
Sbjct: 409 FKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDE-YKQ 462
Query: 353 YFYTREPLAMFLDMGDISEQ 372
Y+Y R + D GD+S +
Sbjct: 463 YYYQRIGDDLG-DFGDVSAR 481
>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 547
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 209/340 (61%), Gaps = 10/340 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
G GE G+ L EA D + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 73 GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 132
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +L++++ +
Sbjct: 133 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 192
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 193 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 252
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
PVID I+ +T+ ++ E +RG F W + ++ LP K R + ++P SPT
Sbjct: 253 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSNDPTKPIISPTM 309
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD + +WGGEN E+S ++W CGG IE +PCS +GHV+R P
Sbjct: 310 AGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHVGHVFRRASP 369
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
++F G ++ N RV E W DE K +FY P
Sbjct: 370 HDF---PSHKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 405
>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
Length = 662
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 213/353 (60%), Gaps = 15/353 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE G+A L + + + + E G N S+ IS +R++ D+R ++C+ +Y L
Sbjct: 138 GLGEQGQAASLDDESQIETEKRMSLENGFNALLSDSISVNRSLNDIRHKQCRKKEYLTQL 197
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F NE S LMR+VHS+I R+P + L+EIILVDD+S + L LE YI
Sbjct: 198 PTVSVIIIFWNEYLSVLMRSVHSLINRSPPELLKEIILVDDYSDREYLGHDLEAYIANHF 257
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 258 KIVRVVRLPRRTGLIGARSEGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 317
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ +++R+ D RG F+W YK LPE K+ ++P+KSP A
Sbjct: 318 PFIDVIDHSNFQYRA---QDEGARGAFDWEFYYKRLRLLPE----DLKHPADPFKSPVMA 370
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR +
Sbjct: 371 GGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPCSRIGHIYRGPRNH 430
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N KG + NYKRV E W DE +K Y Y+ + + +D GD++ Q
Sbjct: 431 N----PSPRKGDYLHRNYKRVAEVWMDE-YKNYLYSHGDGIYERVDAGDLTAQ 478
>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
kowalevskii]
Length = 753
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 153/372 (41%), Positives = 225/372 (60%), Gaps = 14/372 (3%)
Query: 11 GNLEP-----PLEP--YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
GN+ P PLE Y + PGEGG L A+ ++ N+ S+ I+ +RT
Sbjct: 221 GNILPQLGHRPLEQPWYPDSPGEGGMPVDLTPQEARLSKATFYQFEFNIIASDKIALNRT 280
Query: 64 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
+PD R C++ +YP LPK SVI+VFHNE +++L+RTV S+I R+P Q LEEI+LVDD
Sbjct: 281 LPDSRPVACEHREYPHILPKTSVIIVFHNEAWTTLLRTVISVIDRSPWQLLEEILLVDDA 340
Query: 124 SSKAD--LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
S+ L +L++Y+ + R+IR +R GLI+ R RG +E+RGEV+ FLD+HCE
Sbjct: 341 STSEKYWLQSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECN 400
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
+ WL PLL+ I +DR + P +D I +T+ + + +P+ G F W + +K LP+
Sbjct: 401 IGWLEPLLSEIVNDRTTVVAPNLDVISDKTFGY-TFIKPEQTMIGGFGWLVDFKWYSLPK 459
Query: 242 REAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
RE + + S P ++PT AGGLFA+D +F +G YDPG WG EN ELSF++W CGG
Sbjct: 460 RERLRVNNDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELSFRVWQCGG 519
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
++E VPCS +GHV+RS +PY + ++ G I N R+++ W D+ K +F P
Sbjct: 520 TLEIVPCSHVGHVFRSSIPYKYKD--NKNPGLTIAKNNMRLMDVWMDD-LKYFFLAILPH 576
Query: 361 AMFLDMGDISEQ 372
+ GD SE+
Sbjct: 577 YAEQEFGDTSER 588
>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus impatiens]
Length = 643
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 159/379 (41%), Positives = 224/379 (59%), Gaps = 23/379 (6%)
Query: 1 RPVFK-ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETS 55
R FK +D L L+P P++P +G E G + + + D Y N+ S
Sbjct: 90 RNAFKNSDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNILVS 147
Query: 56 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
++I R +PD R + C+ Y LP AS+++ F+NE + +L+R++HSII RTPA L
Sbjct: 148 DNIGLHRELPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLH 207
Query: 116 EIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
EIILV+D+S L +K+E YI FNGKV+ + +REGLIR R GA+++ GE+++FL
Sbjct: 208 EIILVNDWSDSKALHEKIETYIANNFNGKVKFFKTEKREGLIRARMFGARKATGEILIFL 267
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+H EV W+ PLL+ I + I+ +PVID I+ T++ Y RG F WG+ +
Sbjct: 268 DSHIEVNKRWIEPLLSQIAHSKTIIAMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHF 323
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
K + +P + +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+
Sbjct: 324 KWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFR 383
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
IWMCGGSIE +PCSR+GHV+R PY F + +K L RV W DE +K Y
Sbjct: 384 IWMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSL------RVAHVWLDE-YKDY 436
Query: 354 FYTREPLAMFLDMGDISEQ 372
F +D GDISE+
Sbjct: 437 FLKN---VQKVDYGDISER 452
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 211/352 (59%), Gaps = 12/352 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GPGE G +A N S+ IS +R++PD+R CK +Y DLP
Sbjct: 251 GPGEHGAGVRTKLEEQAKVKIGWDHAYFNEYVSDMISVERSVPDVRHNLCKTKEYSDDLP 310
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
+ SVI+ F E +S+L+RTVHS++ R+P + + E++LVDDFS + L + L++Y+++
Sbjct: 311 RTSVIICFTEESWSTLLRTVHSVLNRSPPELIAEVLLVDDFSQRDYLKEPLDEYMKKL-P 369
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KV+++R +REGLIR R GA+ ++G V+ FLD+H E + WL PLL I+ D + P
Sbjct: 370 KVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPLLQRIHDDPTNVVCP 429
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
ID ID ++E+ G F W M + N +PE EA++R S P +SP AGG
Sbjct: 430 AIDAIDATSFEYAG---SGATIIGAFNWEMKFTWNGIPEYEARRRDDESWPIRSPAMAGG 486
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ FF +G YDPG +WG EN ELSFKIWMCGGS+E +PCSR+ H++R PY F
Sbjct: 487 LFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCGGSLEIIPCSRVAHIFRKQQPYKF 546
Query: 323 GKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + T+ N R++ W DE ++ FY+ +P M + GD+S++
Sbjct: 547 P------DGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKPQLMGQEYGDVSDR 592
>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
gigas]
Length = 613
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 198/335 (59%), Gaps = 10/335 (2%)
Query: 38 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
+ A D + N S+ I F R IPD R +C+ +P S+I+ F NE S+
Sbjct: 102 QIARDEGYQNFAFNALVSDKIGFHRAIPDTRYPKCQDVTFPAINLDTSIIVCFFNEQPSA 161
Query: 98 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
L+R VHSI +TP + ++EIILVDD S+ DL ++E+Y+ + VRL+R EREGLIR
Sbjct: 162 LLRLVHSINDQTPQELVKEIILVDDSSTLDDLSCQIENYVNQHFNNVRLVRTPEREGLIR 221
Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
R GA + G+V+VFLD+HCEV +WL PLL I D + VPVID I++ T E
Sbjct: 222 ARVFGANLASGQVLVFLDSHCEVNTDWLEPLLLRISHDPTTVVVPVIDIINHDTME---- 277
Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
Y+ RG F WG+ + + LP+ E S+P SPT AGGLFAM R +F LG Y
Sbjct: 278 YQQSPLVRGGFNWGLHFSWDRLPDNEKNDPDLGSKPILSPTMAGGLFAMKRDYFHHLGEY 337
Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYN 337
D G+ +WGGEN E+SF+IWMCGG +E +PCSR+GH++R PY K D N
Sbjct: 338 DLGMDIWGGENLEISFRIWMCGGKLEIIPCSRVGHIFRKRRPYGNPKGRDT-----FLKN 392
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
RV W D K+K YF + P A +D GDIS++
Sbjct: 393 SLRVANVWMD-KYKEYFLKQRPQAQVVDYGDISDR 426
>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Nasonia vitripennis]
Length = 637
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 158/372 (42%), Positives = 220/372 (59%), Gaps = 20/372 (5%)
Query: 6 ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
+D L L P P++P +G E G ++ E + + + N+ S+++S
Sbjct: 94 SDKLLQQLMPVPVKPSVTVGQGLDELGLVKNMDEQKKR--EEGYKSFAFNVLVSDNLSLH 151
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R IPD R + CK Y LP AS+++ F+NE +++L+R+++SI+ RTP L EIIL++
Sbjct: 152 RDIPDTRHKLCKNQTYDQKLPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILIN 211
Query: 122 DFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DFS L +++ DY+ Q F+ KV+ R REGLIR R GAK++ GEV+VFLD+H EV
Sbjct: 212 DFSDSKSLHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEV 271
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
WL PLLA I R I+ +PVID I+ T+++ S RG F WG+ +K + LP
Sbjct: 272 NKMWLEPLLARISHSRTIVPMPVIDIINADTFQYSS----SPLVRGGFNWGLHFKWDSLP 327
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ +P KSPT AGGLFAMDR +F ELG YD G+ VWGGEN E+SF+IWMCGG
Sbjct: 328 IGTLSLEQDFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISFRIWMCGG 387
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
SIE +PCSR+GHV+R PY D + N RV W D+ +K YF
Sbjct: 388 SIELIPCSRVGHVFRRRRPYGGNDQQD-----TMLKNSLRVAYVWMDQ-YKKYFLKN--- 438
Query: 361 AMFLDMGDISEQ 372
+D GDI+E+
Sbjct: 439 VKKIDYGDITER 450
>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 597
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 209/340 (61%), Gaps = 10/340 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
G GE G+ L EA D + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 62 GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 121
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +L++++ +
Sbjct: 122 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 181
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 182 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 241
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
PVID I+ +T+ ++ E +RG F W + ++ LP K R + ++P SPT
Sbjct: 242 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSNDPTKPIISPTM 298
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD + +WGGEN E+S ++W CGG IE +PCS +GHV+R P
Sbjct: 299 AGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHVGHVFRRASP 358
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
++F G ++ N RV E W DE K +FY P
Sbjct: 359 HDF---PSHKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 394
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 221/379 (58%), Gaps = 19/379 (5%)
Query: 4 FKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPEAYRAAGDASLGEYGMNME 53
A KLG L P P Y GPGEGGKAY L A + D + N
Sbjct: 163 LSAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSPAEQIIFDKGWEDNAYNQY 221
Query: 54 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
S+ IS R +PD R CK Y +LP AS+I+ FHNE +S L+R+VHS+I R+P
Sbjct: 222 ASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNL 281
Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
L+EIILVDDFS + L + LE+Y+ N V+++R +REGLIR R GA+ S G+V+VF
Sbjct: 282 LQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVF 340
Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
LD+H E WL PLL I + I+ VPVI I+ +T + + + D+ G F+W +
Sbjct: 341 LDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLT 399
Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
++ +E ER+ + P +SPT AGGLFA+ R +F LG YD G+ +WGGEN ELSF
Sbjct: 400 FRWHEQTERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSF 459
Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
K+WMCGG +E V CS +GH++R PY + VK PL N R+ + W D+ +K +
Sbjct: 460 KVWMCGGILETVVCSLVGHIFRGRSPYKWNV---NVKDPL-KRNLLRLADVWLDD-YKRF 514
Query: 354 FYTREPLAMFLDMGDISEQ 372
+Y R +D GD+SE+
Sbjct: 515 YYARIGFKT-IDFGDVSER 532
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 207/323 (64%), Gaps = 9/323 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S IS +R++PD+R + CK Y LP SV++VFHNE +S+L+RTV S+I R+P
Sbjct: 127 NLLASERISLNRSLPDVRAKGCKTKKYFELLPTTSVVIVFHNEAWSTLLRTVWSVINRSP 186
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD S + L +KLE+Y++ V ++R +R GLIR R GAK +G+V
Sbjct: 187 KPLIKEIILVDDASVQPHLGKKLENYVKTLPVPVTVLRTPKRSGLIRARLLGAKHVKGQV 246
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
I FLDAHCE WL PLLA I DRK + P+ID I +T+E+ + D + G F W
Sbjct: 247 ITFLDAHCECTEGWLEPLLARITEDRKTVVCPIIDVISDETFEY--ITASDTTWGG-FNW 303
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+ ++ +P+RE +R + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN
Sbjct: 304 RLNFRWYRVPKREMDRRNNDKTVPIRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENL 363
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGG++E VPCS +GHV+R PY F ++ + +N RV E W DE
Sbjct: 364 EMSFRVWQCGGTLEIVPCSHVGHVFRDKSPYTFPGGVSQI----VLHNANRVAEVWMDE- 418
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+ ++Y P A +++GDI+ +
Sbjct: 419 WRDFYYAMNPGAKKIEVGDITSR 441
>gi|195020976|ref|XP_001985304.1| GH16989 [Drosophila grimshawi]
gi|193898786|gb|EDV97652.1| GH16989 [Drosophila grimshawi]
Length = 682
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/355 (45%), Positives = 215/355 (60%), Gaps = 17/355 (4%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE GK L E+ R E G N S+ IS +R++PD+R +C+ Y L
Sbjct: 155 GIGEQGKIAKLDDESVRENEQKVSIENGFNGLLSDSISVNRSLPDIRHIDCRKKLYLRKL 214
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF- 140
P SV+++F +E S LMR+VHS+I R+P + L+EIILVDDFS +A L+++LEDYI
Sbjct: 215 PTVSVVIIFFDEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLNKELEDYIVNHF 274
Query: 141 -NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
G VR++R +R GLI RS GA+ + +V++FLD+H E NWLPPLL PI +++
Sbjct: 275 AVGLVRVVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLEPIAINKRAA 334
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPYKSPT 258
P ID ID+ + +R+ D RG F+W YK L PE K+ +EP+KSP
Sbjct: 335 VCPFIDVIDHSNFNYRA---QDEGARGGFDWQFFYKRLPLLPE----DLKHPTEPFKSPV 387
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSR+GH+YR
Sbjct: 388 MAGGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG-- 445
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
P + KG + NYKRV E W DE +K Y Y E + +D GD++ Q
Sbjct: 446 PRK--SIPSPRKGDYLHKNYKRVAEVWMDE-YKNYLYANGEGIYERVDAGDLTAQ 497
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 221/379 (58%), Gaps = 19/379 (5%)
Query: 4 FKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPEAYRAAGDASLGEYGMNME 53
A KLG L P P Y GPGEGGKAY L A + D + N
Sbjct: 163 LSAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSPAEQIIFDKGWEDNAYNQY 221
Query: 54 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
S+ IS R +PD R CK Y +LP AS+I+ FHNE +S L+R+VHS+I R+P
Sbjct: 222 ASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNL 281
Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
L+EIILVDDFS + L + LE+Y+ N V+++R +REGLIR R GA+ S G+V+VF
Sbjct: 282 LQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVF 340
Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
LD+H E WL PLL I + I+ VPVI I+ +T + + + D+ G F+W +
Sbjct: 341 LDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLT 399
Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
++ +E ER+ + P +SPT AGGLFA+ R +F LG YD G+ +WGGEN ELSF
Sbjct: 400 FRWHEQTERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSF 459
Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
K+WMCGG +E V CS +GH++R PY + VK PL N R+ + W D+ +K +
Sbjct: 460 KVWMCGGILETVVCSLVGHIFRGRSPYKWNV---NVKDPL-KRNLLRLADVWLDD-YKRF 514
Query: 354 FYTREPLAMFLDMGDISEQ 372
+Y R +D GD+SE+
Sbjct: 515 YYARIGFKT-IDFGDVSER 532
>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11 [Sarcophilus
harrisii]
Length = 970
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 200/332 (60%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R ECK YP LP AS+++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAECKEKSYPTGLPAASIVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+I RTPA L EIILVDD S DL +L+DY+Q++ GK++++RN + EGLI R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA GEV+VFLD+HCEV WL PLL PI+ D + + PVID I T +Y
Sbjct: 231 IGAAHGTGEVLVFLDSHCEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL----MYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
G F W + +K + +P + + P KSP AGGLFAM+R +F ELG YD G
Sbjct: 287 SPIVCGGFNWDLHFKWDLVPFSKLGGPEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDT-----MTNNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L + G+ISE+
Sbjct: 402 MAHVWLDEYKEQYFSLRPELKL-KSYGNISER 432
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 142/298 (47%), Positives = 190/298 (63%), Gaps = 11/298 (3%)
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
YP LP AS+++ F+NE FS+L+RTVHS+I RTPA L EIILVDD S DL +L+D
Sbjct: 507 SYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDD 566
Query: 136 YIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
Y+Q++ GK++++RN +REGLIR R GA + GEV+VFLD+HCEV WL PLL PI+
Sbjct: 567 YVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLVPIHE 626
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
D + + PVID I T +Y RG F WG+ +K + +P E + P
Sbjct: 627 DHRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHFKWDLVPFSELGGPEGAIAPI 682
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+GH++
Sbjct: 683 KSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIF 742
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R PY + D +T+N R+ W DE + YF R L + G+ISE+
Sbjct: 743 RKRRPYGSPEGQDT-----MTHNSLRLAHVWLDEYKEQYFSLRPELKL-KSYGNISER 794
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 195/304 (64%), Gaps = 9/304 (2%)
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R + CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S +
Sbjct: 91 RFDRCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERD 150
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L LE+Y++ V++IR ER GLIR R RGA SRG+VI FLDAHCE L WL P
Sbjct: 151 FLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEP 210
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 211 LLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 267
Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V
Sbjct: 268 KGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVT 327
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
CS +GHV+R PY F G +I N +R+ E W DE K +FY P + +D
Sbjct: 328 CSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDY 382
Query: 367 GDIS 370
GD+S
Sbjct: 383 GDVS 386
>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Loxodonta africana]
Length = 527
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 205/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ NM SN + + R +PD R CK YPLDLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKSYPLDLPAASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
VHS+ RTPA L EIILVDD S DL +L++Y+Q++ GK ++IRN +REGLIR R
Sbjct: 171 VHSVTDRTPAHLLHEIILVDDDSDLDDLKGELDEYVQKYLPGKTKVIRNKKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA ++ GEV+VFLD+HCEV WL PLLA + D + PVID I T +Y
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNEMWLQPLLAAVREDPHTVVCPVIDIISADTL----LYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPIVRGGFNWGLHFKWDLVPFDELGGPEGATAPIKSPTMAGGLFAMNRHYFSELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE + YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-RSYGNISER 432
>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Cavia porcellus]
Length = 608
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 152/332 (45%), Positives = 200/332 (60%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRNAACKEQSYPADLPVASVVICFYNEAFSALLRT 170
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
VHS++ RTPA L EIILVDD S DL +L++Y+Q+ K+++IRN +REGLIR R
Sbjct: 171 VHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYVQKSLPTKIKVIRNAKREGLIRGRM 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + GEV+VFLD+HCEV WL PLLA I D + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNEMWLQPLLATIRGDPHTVVCPVIDIISADTL----AYSS 286
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P E + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGEDGATAPIKSPTMAGGLFAMNRQYFNELGQYDSG 346
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGG + +PCSR+GH++R PY + D +T+N R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE YF R L G+ISE+
Sbjct: 402 LAHVWLDEYKDQYFSLRPDLKT-KSYGNISER 432
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 166/407 (40%), Positives = 225/407 (55%), Gaps = 59/407 (14%)
Query: 1 RPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGM 50
+PVF D P +P YK+G GE GKA L +A D +
Sbjct: 95 KPVFMVD--------PNDPIYKKGDANQAGELGKAVVVDKTKLTSEQKAIYDKGMLNNAF 146
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++RTP
Sbjct: 147 NQYASDMISVHRTLPTNIDAECKVEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTP 206
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
LEEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA + GEV
Sbjct: 207 EHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEV 266
Query: 171 IVFLDAHCE-----------------VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
+ +LD+HCE W+ PLL I D + PVID ID T+E
Sbjct: 267 LTYLDSHCECMEGKETENRVRTRNKKCKKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFE 326
Query: 214 FRSVYEPDHHYR------GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMD 267
+ HH + G F+WG+ + + +PER+ K R +P +SPT AGGLF++D
Sbjct: 327 Y-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSID 379
Query: 268 RAFFLELGGYDPGLLVWGGENFELSFK----IWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
+ +F +LG YDPG +WGGEN ELSFK IWMCGG++E VPCS +GHV+R PY +
Sbjct: 380 KKYFEKLGTYDPGFDIWGGENLELSFKVRKCIWMCGGTLEIVPCSHVGHVFRKRSPYKW- 438
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
R ++ N R+ E W D+ +K Y+Y R D GD+S
Sbjct: 439 ----RTGVNVLKRNSIRLAEVWLDD-YKTYYYERIN-NQLGDFGDVS 479
>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oryzias latipes]
Length = 758
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 211/352 (59%), Gaps = 12/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G+ G+ LP + E N+ S+ I DR IPD R E C DLP
Sbjct: 257 GQFGRGVILPSSEDEEVRKRWDEGHFNVYLSDRIPVDRAIPDTRPEVCSQAVVHDDLPST 316
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
SVI F +E +S+L+R+VHS++ R+P L+EIILVDDFS+K L + L+ Y+ +F KV
Sbjct: 317 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKEPLDKYMSQF-PKV 375
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
R++R ER+GLIR R GA + GEV+ FLD+H E + WL PLL IY DR+ + PVI
Sbjct: 376 RIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPLLERIYLDRRKVPCPVI 435
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
+ I+ + + + D+ RGIF+W +++ N L E +K S+P + P AGGL
Sbjct: 436 EVINDKDMSYMLI---DNFQRGIFKWPLVFGWNALSEDYIRKHNITVSDPIRCPVMAGGL 492
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F++D+ +F ELG YDPGL VWGGEN E+SFKIWMCGG IE +PCSR+GH++R PY+F
Sbjct: 493 FSIDKKYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQNPYSFP 552
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYF---YTREPLAMFLDMGDISEQ 372
K DR K + N RV E W DE ++ Y D+G+++EQ
Sbjct: 553 K--DRQKT--VERNLARVAEVWLDEYKDLFYGHGYQHLLDKSVTDIGNLTEQ 600
>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus terrestris]
Length = 643
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 156/373 (41%), Positives = 221/373 (59%), Gaps = 22/373 (5%)
Query: 6 ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
+D L L+P P++P +G E G + + + D Y N+ S++I
Sbjct: 96 SDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNILVSDNIGLH 153
Query: 62 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
R IPD R + C+ Y LP AS+++ F+NE + +L+R++HSII RTPA L EIILV+
Sbjct: 154 REIPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVN 213
Query: 122 DFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
D+S L +K++ YI FNGKV+ + +REGLIR R GA+++ GEV++FLD+H EV
Sbjct: 214 DWSDSKALHEKIKTYIVNNFNGKVKFYKTEKREGLIRARMFGARKATGEVLIFLDSHIEV 273
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
W+ PLL+ I + I+ +P+ID I+ T++ Y RG F WG+ +K + +P
Sbjct: 274 NKRWIEPLLSQIAQSKTIVAMPIIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVP 329
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+IWMCGG
Sbjct: 330 VGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGG 389
Query: 301 SIEWVPCSRIGHVYRSFMPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
SIE +PCSR+GHV+R PY F + +K L RV W DE +K YF
Sbjct: 390 SIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSL------RVAHVWLDE-YKDYFLKN-- 440
Query: 360 LAMFLDMGDISEQ 372
+D GDISE+
Sbjct: 441 -VQKVDYGDISER 452
>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
floridanus]
Length = 667
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 155/368 (42%), Positives = 221/368 (60%), Gaps = 20/368 (5%)
Query: 10 LGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
L L+P P++P +G E G ++ + + Y N+ S+++ R +P
Sbjct: 124 LKQLQPAPVKPAVTLDQGLDELGMVKNMEDQQKRT--IGYKNYAFNVLISDNLGVRRNVP 181
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R + CK Y +LP AS+I+ F+NE +++L+R++HSI++RTPA L EIILV+DFS
Sbjct: 182 DTRHKLCKTQKYSSNLPNASIIICFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSD 241
Query: 126 KADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L +K+ YI+ F KVRL + +REGLIR R GA+++ G+V++FLD+H EV W
Sbjct: 242 SDILHEKIHAYIKNNFGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIW 301
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 244
+ PLL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 302 IEPLLSRIAYSKTIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTL 357
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K +P KSPT AGGLFA+DR +F+++G YD G+ VWGGEN E+SF+IWMCGGSIE
Sbjct: 358 KHENDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISFRIWMCGGSIEL 417
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
+PCSR+GHV+R PY D + N RV W DE +K YF A +
Sbjct: 418 IPCSRVGHVFRRRRPYGSDDPHDT-----MLKNSLRVAHVWMDE-YKDYFLKN---AKAI 468
Query: 365 DMGDISEQ 372
D GDISE+
Sbjct: 469 DYGDISER 476
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 208/352 (59%), Gaps = 18/352 (5%)
Query: 25 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
GE GKA L E+ + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRL 138
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE Y+ +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLEAYVSNLD 198
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I+ D ++
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIHKDETVVIC 257
Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK EP++SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIEPFRSPTMA 314
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +GHV+ PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P N R E W DE +K +FY R P A GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISER 416
>gi|348519900|ref|XP_003447467.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oreochromis niloticus]
Length = 777
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 199/306 (65%), Gaps = 10/306 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N+ S+ I DR IPD R + C+ DLP SVI F +E +S+L+R+VHS++ R+P
Sbjct: 304 NVYLSDKIPVDRAIPDTRPQMCEQSLVHDDLPSTSVIFCFVDEVWSTLLRSVHSVLNRSP 363
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
L+EIILVDDFS+K L ++L+DY+ +F KVR++R ER+GLIR R GA ++GEV
Sbjct: 364 PHLLKEIILVDDFSTKDYLKKQLDDYMAQF-PKVRIVRLKERQGLIRARLAGAAVAKGEV 422
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+H E + WL PLL +Y DRK + PVI+ I + + V D+ RGIF+W
Sbjct: 423 LTFLDSHIECNVGWLEPLLERVYLDRKKVPCPVIEVISDKDMSYMMV---DNFQRGIFKW 479
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++ + +P + KK S+P + P AGGLF++D+ +F ELG YDPGL VWGGEN
Sbjct: 480 PLVFGWSAVPPEDIKKFNLTISDPIRCPVMAGGLFSIDKQYFFELGTYDPGLDVWGGENM 539
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SFKIWMCGG IE +PCSR+GH++R PY F K DR K + N RV E W DE
Sbjct: 540 EISFKIWMCGGEIEIIPCSRVGHIFRGQNPYKFPK--DRQK--TVERNLARVAEVWLDE- 594
Query: 350 HKAYFY 355
+K FY
Sbjct: 595 YKDLFY 600
>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 623
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 208/336 (61%), Gaps = 11/336 (3%)
Query: 38 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
R D + N+ S+ I R +PD R CK Y +LP ASVI+ F+NE F++
Sbjct: 114 RRKRDEGYKNFAFNILVSDAIGIHRELPDTRHNLCKKKKYSKNLPTASVIICFYNEHFTT 173
Query: 98 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLI 156
L+R+++S+++RTP+ L+EIILV+DFS A L + + +Y+ F KV+L ++ +R GLI
Sbjct: 174 LLRSIYSVLERTPSYLLKEIILVNDFSDLAGLHRNISNYVNTNFTDKVKLFKSKKRLGLI 233
Query: 157 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 216
R R G++++ G+V+VFLD+H EV +NWL PLL+ I +K + VP+ID I+ T++
Sbjct: 234 RARIFGSRKASGDVLVFLDSHIEVNVNWLQPLLSRIVDSKKNVVVPIIDIINADTFK--- 290
Query: 217 VYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGG 276
Y RG F WG+ +K LP+ K + +P SPT AGGLFA++RA+F ELG
Sbjct: 291 -YSSSPLVRGGFNWGLHFKWENLPKSTLKSNEDFVKPILSPTMAGGLFAINRAYFKELGE 349
Query: 277 YDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY 336
YD G+ +WGGEN E+SF+IWMCGG++E +PCSR+GHV+R PY D +
Sbjct: 350 YDNGMNIWGGENLEISFRIWMCGGNLELIPCSRVGHVFRKRRPYGSPNGEDT-----MMR 404
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N RV W D+ +K +FY + P GDIS++
Sbjct: 405 NSLRVANVWMDD-YKEFFYKQHPEGKTFPFGDISDR 439
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 209/350 (59%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 103 PGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYSKYLPD 162
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LE Y++R
Sbjct: 163 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEHYVKRLPVP 222
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 223 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 282
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 283 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 339
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 340 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 399
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 400 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 444
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 201/325 (61%), Gaps = 10/325 (3%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE GK +P + + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 92 PGEMGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 151
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LEDY++
Sbjct: 152 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEDYVKTLPVS 211
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 212 TIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 271
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I +T+E+ V D + G F W + ++ +P RE ++R ++ + P ++PT AGG
Sbjct: 272 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 328
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PCS +GHV+R PY F
Sbjct: 329 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 388
Query: 323 -GKLADRVKGPLITYNYKRVIETWF 346
G +A+ ++ N RV E W
Sbjct: 389 PGGVAN-----IVLKNAARVAEVWM 408
>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 215/351 (61%), Gaps = 10/351 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGEGG++ +PE + E N+ S+ ++ +R+I D R C+ ++P DLP
Sbjct: 133 PGEGGRSVSIPENLKQEAKKRFPENQFNIVASDLMALNRSINDQRSSRCRSHEFPSDLPT 192
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD-LDQKLEDYIQRFNG 142
S+++VFHNEG S+L+RT+ SI+ R+P ++++EII+VDD S + L LE +++
Sbjct: 193 TSIVIVFHNEGNSTLLRTLTSIVMRSPTEFIQEIIMVDDASVDREYLKDILETFVKELPV 252
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
+V +IRNT+R GL+++R +GA+++ G+ + FLDAH E WL LL + DR + P
Sbjct: 253 RVEIIRNTQRLGLMKSRLKGAEKATGDTLTFLDAHIECSPGWLEYLLYEVKKDRTAVVCP 312
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 261
+ID I+ +F + D + G F W + ++ +P RE +R Y+ S P SPT AG
Sbjct: 313 IIDVINDD--DFAYLTGSDMTWGG-FNWRLNFRWYPVPNREEVRRNYDHSLPLLSPTMAG 369
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF +DR +F E+G YDPG+ VWGGEN E+SF++W CGG + PCS +GHV+R PY
Sbjct: 370 GLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRVWQCGGKVLIHPCSHVGHVFRKQTPYT 429
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F G +I +N KR++E W D K+K + Y P +D GD+SE+
Sbjct: 430 FPGGT----GKVIFHNNKRLVEVWLD-KYKDFVYAIMPELKNVDAGDVSER 475
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 212/363 (58%), Gaps = 20/363 (5%)
Query: 14 EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+PP + + GE GKA L E+ + + Y +N+ S+ IS R I D RM E
Sbjct: 54 KPPADSH--ALGEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 111
Query: 72 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 112 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLK 171
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 172 TQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLE 230
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 231 RIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 287
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
EP++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 288 RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 347
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+ PY P N R E W DE +K +FY R P A GDI
Sbjct: 348 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDI 397
Query: 370 SEQ 372
SE+
Sbjct: 398 SER 400
>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Metaseiulus occidentalis]
Length = 585
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 200/326 (61%), Gaps = 12/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
++ N S I R +PD R CK Y DLP+ASVI+ F+NE +S+L+RTV+S++
Sbjct: 94 QHAFNTLVSERIGLRRRVPDTRDALCKQQKYSKDLPRASVIICFYNEAWSTLIRTVNSVL 153
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
R+P+ L+EIILVDD S A+L + L ++Q+ + KVR+IR EREGLIR R GA S
Sbjct: 154 DRSPSALLQEIILVDDLSDIAEL-EPLAGFVQK-HEKVRVIRTREREGLIRARMIGAHNS 211
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+V+VFLD+H EV WL PLL PI ++ +T PVID I+ T+E Y P +G
Sbjct: 212 TGDVLVFLDSHVEVNERWLQPLLVPIQQNQTTVTCPVIDIINADTFE----YSPSPLVKG 267
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F WGM ++ + LP+ K K P SPT AGGLFA+ + F LG YD G+ VWGG
Sbjct: 268 GFNWGMHFRWDNLPKGYFKSEKERIAPLPSPTMAGGLFAIHKDEFRRLGEYDWGMDVWGG 327
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELSF+IWMCGGS++ +PCSR+GHV+R PY D + N RV W
Sbjct: 328 ENLELSFRIWMCGGSLKIMPCSRVGHVFRKRRPYGASNGED-----TLAKNSLRVANVWM 382
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
D+ +K Y+Y P +D GDIS +
Sbjct: 383 DD-YKKYYYRMRPDLKDIDFGDISAR 407
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 212/363 (58%), Gaps = 20/363 (5%)
Query: 14 EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+PP + + GE GKA L E+ + + Y +N+ S+ IS R I D RM E
Sbjct: 66 KPPADSH--ALGEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 123
Query: 72 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 124 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLK 183
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 184 TQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLE 242
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 243 RIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 299
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
EP++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 300 RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 359
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+ PY P N R E W DE +K +FY R P A GDI
Sbjct: 360 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDI 409
Query: 370 SEQ 372
SE+
Sbjct: 410 SER 412
>gi|115497708|ref|NP_001069909.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos taurus]
gi|83405338|gb|AAI11261.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Bos taurus]
gi|440895696|gb|ELR47826.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos grunniens mutus]
Length = 448
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 203/342 (59%), Gaps = 16/342 (4%)
Query: 31 YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
Y +PE YG N S ++ R +PD R C+ YP LP AS+I+ F
Sbjct: 93 YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147
Query: 91 HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
HNE F++L RT+ SI+ T LEEIILVDD S DL +KL+ +++ F GK++LIRN
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207
Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
+REGLIR R GA + G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267
Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAF 270
T E Y+P RG F W + +K + + E + + + P +SP AGG+FA++R +
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHY 323
Query: 271 FLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVK 330
F E+G YD G+ +WGGEN ELS +IWMCGG + +PCSR+GH+ R + F +
Sbjct: 324 FNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPCSRVGHINRQHVTNRFEIMK---- 379
Query: 331 GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ YN R++ TW DE +K F+ R P G+ISE+
Sbjct: 380 --VVEYNNLRLVHTWLDE-YKGQFFLRRPALKSAAYGNISER 418
>gi|296488205|tpg|DAA30318.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 5 [Bos
taurus]
Length = 447
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 203/342 (59%), Gaps = 16/342 (4%)
Query: 31 YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
Y +PE YG N S ++ R +PD R C+ YP LP AS+I+ F
Sbjct: 93 YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147
Query: 91 HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
HNE F++L RT+ SI+ T LEEIILVDD S DL +KL+ +++ F GK++LIRN
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207
Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
+REGLIR R GA + G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267
Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAF 270
T E Y+P RG F W + +K + + E + + + P +SP AGG+FA++R +
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHY 323
Query: 271 FLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVK 330
F E+G YD G+ +WGGEN ELS +IWMCGG + +PCSR+GH+ R + F +
Sbjct: 324 FNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPCSRVGHINRQHVTNRFEIMK---- 379
Query: 331 GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ YN R++ TW DE +K F+ R P G+ISE+
Sbjct: 380 --VVEYNNLRLVHTWLDE-YKGQFFLRRPALKSAAYGNISER 418
>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Ovis
aries]
Length = 448
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 145/325 (44%), Positives = 197/325 (60%), Gaps = 11/325 (3%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
YG N S ++ R++PD R C+ YP LP AS+I+ FHNE FS+L RT+ SI+
Sbjct: 105 YGFNHIISKNLGHYRSVPDTRNVMCRKKTYPARLPTASIIICFHNEEFSALFRTLSSIMA 164
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA +
Sbjct: 165 LTPQYILEEIILVDDTSDFDDLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHAS 224
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY T E Y+P RG
Sbjct: 225 GDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGA 280
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F W + +K + + E + + + P +SP AGG+FA+ R +F E+G YD G+ +WGGE
Sbjct: 281 FNWHLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGE 340
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N ELS +IWMCGG + +PCSR+GH+ R M D ++ YN R+ W D
Sbjct: 341 NLELSLRIWMCGGQLYVIPCSRVGHINRQHMT------NDSEIMKVVEYNSLRLAHIWLD 394
Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
E +K F+ R P G+ISE+
Sbjct: 395 E-YKEEFFLRRPALKSAAYGNISER 418
>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Megachile rotundata]
Length = 637
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 209/327 (63%), Gaps = 17/327 (5%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N+ S++I DR +PD R + C+ YP LP AS+++ F+NE + +L+R++HSII+
Sbjct: 135 YAFNVLISDNIGLDRKLPDTRHKLCQMQQYPNKLPNASIVICFYNEHYMTLLRSIHSIIE 194
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKES 166
RTP L EIILV+D+S +L +K++ +I F+ KV+ + +REGLIR R GA+++
Sbjct: 195 RTPKHLLHEIILVNDWSDSKELHEKIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKA 254
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
GEV++FLD+H EV W+ PLL+ I + I+ +PVID I+ T++ Y RG
Sbjct: 255 TGEVLIFLDSHIEVNKMWIEPLLSRIAHSKTIVAMPVIDIINADTFQ----YTASPLVRG 310
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F WG+ +K +LP + + +P KSPT AGGLFAMDR +F+ELG YD G+ VWGG
Sbjct: 311 GFNWGLHFKWEQLPTKLVHDEDF-IKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGG 369
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN E+SF+IWMCGGSIE +PCSR+GHV+R PY AD K + N RV W
Sbjct: 370 ENLEISFRIWMCGGSIELIPCSRVGHVFRKRRPYG----ADD-KHDTMLKNSLRVAYVWL 424
Query: 347 DE-KHKAYFYTREPLAMFLDMGDISEQ 372
DE KH +Y ++ +D GDI+++
Sbjct: 425 DEYKH---YYLKD--VNKIDYGDITDR 446
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARSVAVGDVSER 445
>gi|195030214|ref|XP_001987963.1| GH10909 [Drosophila grimshawi]
gi|193903963|gb|EDW02830.1| GH10909 [Drosophila grimshawi]
Length = 668
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 160/361 (44%), Positives = 216/361 (59%), Gaps = 29/361 (8%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
G GE G + +A A + EY G N S+ IS +R++PD+R EECK Y
Sbjct: 145 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 202
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 138
LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S L Q+L+DY+ Q
Sbjct: 203 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 262
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
+ V ++ + ER+GLI R GAK S GE IVF D+H EV NWLPPLL PI + KI
Sbjct: 263 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAIVFFDSHIEVNYNWLPPLLEPIAINNKI 322
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSP 257
T P++D ID+ + + Y+ RG F+W YK+ LPE K S PY++P
Sbjct: 323 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK----SLPYRNP 376
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
GGLFA+ FF +LGGYD GL +WGGE +ELSFKIWMCGG + VPCSR+ H++R
Sbjct: 377 VMMGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQ 436
Query: 318 M-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISE 371
M P N+ LA N+KRV E W DE +K + Y R+ +D GD++
Sbjct: 437 MDPRPNPLNYNFLA---------RNHKRVAEVWMDE-YKEHVYRRDRTTYDKIDAGDLTR 486
Query: 372 Q 372
Q
Sbjct: 487 Q 487
>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 639
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 150/332 (45%), Positives = 204/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ S+ I R +PD R + C Y LP AS+I+ F+NE +L+R+
Sbjct: 134 DVGYRKHAFNVLVSSKIGPFREVPDTRHKLCPEQSYDKVLPSASIIMCFYNEHLQTLLRS 193
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG-KVRLIRNTEREGLIRTRS 160
V+S++ RTPA L EIILVDD S DL LE +++FN K+RLIRN +REGL+R+R
Sbjct: 194 VNSVLGRTPAYLLHEIILVDDCSDFDDLGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRV 253
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA+ + G+V+VFLD+H EV ++W+ PLL I +R I+ +PVID I+ T+ Y
Sbjct: 254 YGARNATGDVLVFLDSHIEVNVDWIEPLLQRIKVNRTILAMPVIDIINSDTF----AYTS 309
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + LP+ K P++SPT AGGLFAMDR +F ELG YD G
Sbjct: 310 SPLVRGGFNWGLHFKWDNLPKGSLAKETDFVGPFQSPTMAGGLFAMDRKYFKELGEYDMG 369
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ VWGGEN E+SF+ W CGGSIE +PCSRIGHV+R PY D + N R
Sbjct: 370 MDVWGGENLEISFRAWQCGGSIELLPCSRIGHVFRKRRPYGSPDGTD-----TMIRNSLR 424
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W D+ K YF+ +P A LD GD+SE+
Sbjct: 425 LARVWMDDYIK-YFFENQPHANKLDAGDLSER 455
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 209/350 (59%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R++ CK Y LP
Sbjct: 104 PGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKTKKYSKYLPD 163
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 341 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 445
>gi|170060398|ref|XP_001865784.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
gi|167878898|gb|EDS42281.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
Length = 356
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 125/147 (85%), Positives = 138/147 (93%)
Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
GIFEWGMLYKENE+P REAK+RK++SEPYKSPTHAGGLFA++R FFL++G YDPGLLVWG
Sbjct: 51 GIFEWGMLYKENEVPRREAKRRKHDSEPYKSPTHAGGLFAINREFFLKIGAYDPGLLVWG 110
Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
GENFELSFKIW CGGSIEWVPCSR+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETW
Sbjct: 111 GENFELSFKIWQCGGSIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETW 170
Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
FDE++K YFYTREPLA FLDMGDISEQ
Sbjct: 171 FDEQYKEYFYTREPLARFLDMGDISEQ 197
>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
Length = 562
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
+GHV+R P++F + G ++ N RV E W D+ K YFY P A +
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438
Query: 368 DISEQ 372
D+SE+
Sbjct: 439 DVSER 443
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V + +N RV E W DE + ++Y P A + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 445
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 143/353 (40%), Positives = 211/353 (59%), Gaps = 9/353 (2%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
K PGE G A H+ A N+ S+ IS +R++ D+R+E CK YP
Sbjct: 103 KGSPGEMGAAVHIAPENEAKQQELFKLNQFNLMASDLISLNRSLKDIRLEGCKNKKYPKY 162
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE +I
Sbjct: 163 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKKELEKHITEL 222
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR +
Sbjct: 223 PVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 282
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
P+ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT
Sbjct: 283 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTM 339
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E CS +GHV+R P
Sbjct: 340 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISSCSHVGHVFRDKSP 399
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y F ++ + +N RV E W DE + ++Y P A +D+GD+SE+
Sbjct: 400 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 447
>gi|195030212|ref|XP_001987962.1| GH10908 [Drosophila grimshawi]
gi|193903962|gb|EDW02829.1| GH10908 [Drosophila grimshawi]
Length = 684
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 216/361 (59%), Gaps = 29/361 (8%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
G GE G + +A A + EY G N S+ IS +R++PD+R EECK Y
Sbjct: 161 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 218
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 138
LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S L Q+L+DY+ Q
Sbjct: 219 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 278
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
+ V ++ + ER+GLI R GAK S GE +VF D+H EV NWLPPLL PI + KI
Sbjct: 279 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAMVFFDSHIEVNYNWLPPLLEPIAINNKI 338
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSP 257
T P++D ID+ + + Y+ RG F+W YK+ LPE K S PY++P
Sbjct: 339 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK----SLPYRNP 392
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
GGLFA+ FF +LGGYD GL +WGGE +ELSFKIWMCGG + VPCSR+ H++R
Sbjct: 393 VMIGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQ 452
Query: 318 M-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISE 371
M P N+ LA N+KRV E W DE +K + Y R+ +D GD++
Sbjct: 453 MDPRPNPLNYNFLA---------RNHKRVAEVWMDE-YKEHVYRRDRTTYDNIDAGDLTR 502
Query: 372 Q 372
Q
Sbjct: 503 Q 503
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 195/302 (64%), Gaps = 9/302 (2%)
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
CK YP LP SV++VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L +
Sbjct: 7 CKDKVYPEKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGK 66
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LEDY+ + + V+++R +R GLIR R GA +G+VI FLDAHCE NWL PLLA
Sbjct: 67 QLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLAR 126
Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 250
I DR + PVID I +T+E+ S + G F W + ++ +P+RE +R +
Sbjct: 127 IAEDRTRVVCPVIDVISDETFEYISASDLTW---GGFNWKLNFRWYRVPQRELDRRGGDR 183
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+ P ++PT AGGLFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +
Sbjct: 184 TLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHV 243
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+R PY F ++ + +N R+ E W DE K +++ P A +D GD+S
Sbjct: 244 GHVFRKSTPYTFPGGTSKI----VNHNNARLAEVWLDE-WKEFYFAINPAAKNVDKGDLS 298
Query: 371 EQ 372
+
Sbjct: 299 HR 300
>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 658
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 149/351 (42%), Positives = 206/351 (58%), Gaps = 14/351 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
G+ G A L + D + N+ S+ + +R++PD R C+ YP+ ++P
Sbjct: 144 GKDGHAVILGRDEQLEADREFSKAAFNVYVSDRLPLNRSLPDTRHRHCRAITYPVAEMPT 203
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNG 142
ASV+++F +E FS+L+RT+ S+I R+P L EIILVDDFS DL +LE YI+ F
Sbjct: 204 ASVVIIFTDEIFSTLLRTIVSVIDRSPRHLLREIILVDDFSQSEDLKDRLERYIEHHFRA 263
Query: 143 KV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V RLIR ER GLIR R GA+ +RG+V++FLD+HCE WL PLL PI DR+ +
Sbjct: 264 DVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTPGWLEPLLEPIRRDRRAVVC 323
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
PVID IDY+T ++ + E D G F W + + +P + R +EP +SPT AG
Sbjct: 324 PVIDVIDYRTLQYVAA-EGDRFQIGGFNWRGEFTWHNIPSAWRRNRVSVAEPMRSPTMAG 382
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA++R +F E G YD + WGGEN E+SF+IW CGG I PCS +GH++R + PY
Sbjct: 383 GLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIAPCSHVGHIFRDYQPYK 442
Query: 322 F--GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GK + + N KR +E W DE K Y Y P + +GDIS
Sbjct: 443 IPGGKDTNAI-------NTKRAVEVWMDE-FKKYIYQARPELKKIRIGDIS 485
>gi|347971870|ref|XP_313714.5| AGAP004429-PA [Anopheles gambiae str. PEST]
gi|333469065|gb|EAA09257.5| AGAP004429-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 208/334 (62%), Gaps = 13/334 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN + R IPD R + C+ Y LP ASV++ F+NE +L+R+
Sbjct: 154 DIGYRKHAFNVLVSNKLGPFRPIPDTRHKLCQAQVYDKVLPVASVVMCFYNEHLETLVRS 213
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLD---QKLEDYIQRFNGKVRLIRNTEREGLIRT 158
+H+++KRTPA L+E+ILVDD S DL Q ++ Q KVRL+RNT+REGLIR+
Sbjct: 214 IHTVLKRTPAYLLKELILVDDCSDFEDLTVGGQLEKELAQLGTNKVRLLRNTDREGLIRS 273
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R GA+ + G+V++FLD+H EV ++W+ PLLA I DR I+ +PVID I+ T+ VY
Sbjct: 274 RVYGARNATGQVLIFLDSHIEVNVDWIEPLLARIKHDRTILAMPVIDIINSDTF----VY 329
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
RG F WG+ +K + LP+ ++ P+ SPT AGGLFA+DRA+F ELG YD
Sbjct: 330 TASPLVRGGFNWGLHFKWDNLPKGSLERDTDFVGPFNSPTMAGGLFAIDRAYFKELGEYD 389
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNY 338
G+ VWGGEN E+SF+ W CGGSIE +PCSRIGHV+R PY D + N
Sbjct: 390 MGMDVWGGENLEISFRAWQCGGSIELLPCSRIGHVFRKRRPYGSPDGQD-----TMIRNS 444
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R+ W D+ + YFY ++P A + G++SE+
Sbjct: 445 LRLAHVWMDD-YIRYFYEQQPQAHHVPYGNVSER 477
>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 6; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
Length = 618
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
+GHV+R P++F + G ++ N RV E W D+ K YFY P A +
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438
Query: 368 DISEQ 372
D+SE+
Sbjct: 439 DVSER 443
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 205/336 (61%), Gaps = 10/336 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R + C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKDKFKENQFNLLASDMISLNRSLTDVRHDGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LEDY+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
HV+R PY F G +A ++ +N RV E W
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 445
>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 587
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 201/335 (60%), Gaps = 7/335 (2%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
GE G+ +L + G+ L + +N+ SN I R +PD+R CK Y LP A
Sbjct: 75 GEYGRPAYLSGEEKIKGNEVLKKKAVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSA 134
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
S+I++FHNE FS L+RTV+S+IK TP + L+EIILVDD S +L LE YIQ R K
Sbjct: 135 SIIIIFHNEAFSVLLRTVYSVIKETPPKLLKEIILVDD-KSNEELLGLLEYYIQTRLPKK 193
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+L+R ER+GL+R R +GAK + G+V++FLDAHCEV WL PLL I + + P+
Sbjct: 194 VKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAVVTPI 253
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID I +T+E+ EP G F W + + E + K + P KSPT AGGL
Sbjct: 254 IDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTSAISPVKSPTMAGGL 313
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
FA++R +F ++G YD + WGGEN E+SF+IW CGG +E +PCSR+GHV+R+F+PY F
Sbjct: 314 FAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRVGHVFRNFLPYKFP 373
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
D G N R+ W D+ + Y+ RE
Sbjct: 374 MDKD-THG----INTARLANVWMDDYKRLYYLHRE 403
>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
Length = 617
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322
Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
+GHV+R P++F + G ++ N RV E W D+ K YFY P A +
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438
Query: 368 DISEQ 372
D+SE+
Sbjct: 439 DVSER 443
>gi|332021082|gb|EGI61469.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Acromyrmex
echinatior]
Length = 580
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 150/358 (41%), Positives = 215/358 (60%), Gaps = 16/358 (4%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P ++G E G +L + + D +Y N+ S+++ R IPD R + CK
Sbjct: 47 PAVTLEQGLDELGMVKNLEDQRKR--DEGYKDYAFNILISDNLGVQRNIPDTRHKLCKMQ 104
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
YP +LP AS+I+ F+NE +++L+R++HSI+++TP L EIILV+D+S L + ++
Sbjct: 105 KYPANLPNASIIICFYNEHYTTLLRSLHSILEKTPTVLLHEIILVNDYSDSDTLHENIKV 164
Query: 136 YIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
YI+ FN +VRL + REGLIR R GA+++ G+V++FLD+H EV W+ PLL+ I
Sbjct: 165 YIRNNFNDRVRLFKTERREGLIRARVFGARKATGKVLIFLDSHIEVNEIWIEPLLSRIAY 224
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
R I+ +PVID I+ T++ Y RG F WG+ +K + LP +P
Sbjct: 225 SRNIIPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLNHDVDFVKPI 280
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
KSPT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+
Sbjct: 281 KSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISFRIWMCGGSIELIPCSRVGHVF 340
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R PY D + N RV W DE +K YF A +D GDISE+
Sbjct: 341 RRRRPYGSDDPQD-----TMLKNSLRVAHVWMDE-YKDYFLKN---AKTIDYGDISER 389
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF+IWMCGG++E CS +GHV+R PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRIWMCGGTLEIATCSHVGHVFRKSTPYTF 400
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++ + +N R+ E W D+ K ++Y P A + +GD+SE+
Sbjct: 401 PGGTSKI----VNHNNARLAEVWLDQ-WKYFYYNINPGARNVAVGDVSER 445
>gi|393910975|gb|EJD76111.1| glycosyl transferase, variant [Loa loa]
Length = 549
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 214/356 (60%), Gaps = 16/356 (4%)
Query: 21 KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
++G GEGG+ A E ++ D G N S+ I+ +R+I D+R C+ Y
Sbjct: 93 RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LP V+ HNE S+L+R+++S+I R+P ++E+ILVDD S+K L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212
Query: 140 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
N V+++R +REGLIR R GA+ +VIVFLDAH E NWLPPL+ PI D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID T+E+R+ D RG F+W YK L E +K + P+ +P
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPFHNP 326
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS- 316
AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ PCSR+GH+YR
Sbjct: 327 VMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIYRCK 386
Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++P+ D G I+ NY+RV E W DE K + Y R P + +D GD+S+Q
Sbjct: 387 YVPF-----PDPGIGDFISKNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSKQ 436
>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
Length = 353
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 191/303 (63%), Gaps = 12/303 (3%)
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
+C YP LP +V++ FHNE +S+L+RTVHS+I R+PA L EI+L+DDFS+ L
Sbjct: 25 KCSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLK 84
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
KL Y+ + VR++R ++REGLIR R GA+ ++G+VI FLDAHCE ++WL PLL+
Sbjct: 85 SKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLS 143
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I+SDR I+ VPVID I + + G F W M + + LP +RK
Sbjct: 144 RIHSDRTIVAVPVIDIISSTNFMYSGTPSA---VIGGFSWDMQFTWHSLPNNRQSERKDR 200
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+ P ++PT AGGLF++DR +F E G YD G+ VWGGEN E+SF+IW CGG +E +PCSR+
Sbjct: 201 TAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRV 260
Query: 311 GHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
GHV+R+ PY+F G ++ ++ N RV+ W DE + Y Y + P L GDI
Sbjct: 261 GHVFRTRFPYSFPGGYSE------VSVNLARVVHVWMDE-YNQYVYMKRPDLQSLKYGDI 313
Query: 370 SEQ 372
+ +
Sbjct: 314 TSR 316
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 159/376 (42%), Positives = 215/376 (57%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + + PGE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLESYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P+ E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N RV E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARVAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/300 (47%), Positives = 193/300 (64%), Gaps = 9/300 (3%)
Query: 72 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 39 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 98
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 99 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 158
Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 250
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 159 IKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 215
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+ P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 216 TLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHV 275
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+R PY F G +I N +R+ E W DE K +FY P + +D GD+S
Sbjct: 276 GHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 330
>gi|312065523|ref|XP_003135832.1| glycosyl transferase [Loa loa]
gi|307769015|gb|EFO28249.1| glycosyl transferase [Loa loa]
Length = 614
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 214/356 (60%), Gaps = 16/356 (4%)
Query: 21 KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
++G GEGG+ A E ++ D G N S+ I+ +R+I D+R C+ Y
Sbjct: 93 RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LP V+ HNE S+L+R+++S+I R+P ++E+ILVDD S+K L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212
Query: 140 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
N V+++R +REGLIR R GA+ +VIVFLDAH E NWLPPL+ PI D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
+ P+ID ID T+E+R+ D RG F+W YK L E +K + P+ +P
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPFHNP 326
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS- 316
AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ PCSR+GH+YR
Sbjct: 327 VMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIYRCK 386
Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++P+ D G I+ NY+RV E W DE K + Y R P + +D GD+S+Q
Sbjct: 387 YVPF-----PDPGIGDFISKNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSKQ 436
>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
Length = 374
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 211/353 (59%), Gaps = 17/353 (4%)
Query: 24 PGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
PGE G+ +L R + +Y N S I RTIPD R CK +Y +
Sbjct: 1 PGELGQGVVLRNLSPQDRKQLEEGYKKYAFNEFASTKIPLTRTIPDGRHWLCKSKEYDVS 60
Query: 81 -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LP SVI+ FHNE +S+LMRTVHS+++ P++ L E+I+VDD S L +L DY+
Sbjct: 61 RLPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLKAQLTDYVAG 120
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
KV+LIR +REGLIR R GA +R +V+VFLD+HCE + WL PLL I +R +
Sbjct: 121 LP-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDRIVQNRSHV 179
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
PVID ID++T+E+R + RG F+W ++++ ++P K+R + +P SPT
Sbjct: 180 VTPVIDVIDFKTFEYRHL--AIIQVRG-FDWRLIFRWEKIPASYEKRRGLSVDPILSPTM 236
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+D+ +F LG YD G+ +WGGEN ELSF+IW CGG++E +PCSR+GHV+R P
Sbjct: 237 AGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSFRIWQCGGTLEIMPCSRVGHVFRQRFP 296
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y + + T N RV E W D+ +K YFY + GD++E+
Sbjct: 297 Y-------QTSTEVTTRNLMRVAEVWMDQ-YKEYFYQIRHIKK-KSFGDVTER 340
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 208/362 (57%), Gaps = 18/362 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 70 KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 129
Query: 74 Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
+DY LP SV++ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 130 SKKFDYR-RLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 188
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 189 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 247
Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 248 IAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 304
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 364
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+ PY P N R E W DE +K +FY R P A GDIS
Sbjct: 365 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 414
Query: 371 EQ 372
E+
Sbjct: 415 ER 416
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 156/363 (42%), Positives = 211/363 (58%), Gaps = 20/363 (5%)
Query: 14 EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+PP + + GE GKA L E + + Y +N+ S+ IS R I D RM E
Sbjct: 67 KPPAD--SQALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 124
Query: 72 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S +A L
Sbjct: 125 CKSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRAYLK 184
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 185 TQLETYISNLD-RVRLIRTKKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLE 243
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 244 RIERDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 300
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 301 RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 360
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GHV+ PY P N R E W D+ +K +FY R P A D GDI
Sbjct: 361 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKEDYGDI 410
Query: 370 SEQ 372
SE+
Sbjct: 411 SER 413
>gi|195148068|ref|XP_002014996.1| GL18655 [Drosophila persimilis]
gi|194106949|gb|EDW28992.1| GL18655 [Drosophila persimilis]
Length = 646
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 207/332 (62%), Gaps = 24/332 (7%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E+CK Y LP SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q+L+DY+ F V ++RN ER+GLI R GAK +
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
GEV+VF D+H EV NWLPPLL PI + KI T P++D ID+ + + Y+ RG
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W YK+ LPE K S+P+++P GGLFA+ FF +LGGYD L +WGG
Sbjct: 327 FDWRFFYKQLPVLPEDSVDK----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGG 382
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-----PYNFGKLADRVKGPLITYNYKRV 341
E +ELSFKIWMCGG + VPCSR+ H++R M P N+ + N+KRV
Sbjct: 383 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPMDPRPNPRNYN---------FVGRNHKRV 433
Query: 342 IETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
E W DE +K + Y+R+P +D GD++ Q
Sbjct: 434 AEVWMDE-YKEHVYSRDPQTYNNIDAGDLTRQ 464
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 208/362 (57%), Gaps = 18/362 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 74 Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
+DY LP SV++ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 126 SKKFDY-RRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 184
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 185 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243
Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 244 IAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 300
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +
Sbjct: 301 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 360
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+ PY P N R E W DE +K +FY R P A GDIS
Sbjct: 361 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 410
Query: 371 EQ 372
E+
Sbjct: 411 ER 412
>gi|410953294|ref|XP_003983307.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Felis
catus]
Length = 443
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 143/326 (43%), Positives = 202/326 (61%), Gaps = 11/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+YG N S + +R +PD R ++C YP +LP ASV++ FHNE FS+L RT+ S++
Sbjct: 99 KYGFNTVLSKSLGSEREVPDTRNKKCFQKHYPANLPTASVVVCFHNEEFSALFRTMFSVV 158
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
TP +LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR+R GA +
Sbjct: 159 NLTPRHFLEEIILVDDMSDSDDLKEKLDHHLEVFRGKIKLIRNKKREGLIRSRMIGASRA 218
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+V+VFLD+HCEV WL PLL I D K++ P+ID ID T E Y P RG
Sbjct: 219 SGDVLVFLDSHCEVNKVWLEPLLHAIAKDPKMVVCPLIDVIDSVTLE----YWPSPVVRG 274
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F W + +K + + E + + P +SP AGG+FA++R +F E+G YD G+ +WG
Sbjct: 275 AFNWHLQFKWDNVFSYEMDGPEGPTLPIRSPAMAGGIFAINRHYFREIGQYDKGMNLWGA 334
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELS +IWMCGG + +PCSR+GH+ + P N + A+ +TYN R+ W
Sbjct: 335 ENLELSLRIWMCGGQLFVLPCSRVGHISKQRFP-NQPEFAE-----AMTYNSLRLAHVWL 388
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K F+ R P + G+ISE+
Sbjct: 389 DE-YKEQFFLRRPGLKSVAYGNISER 413
>gi|348585731|ref|XP_003478624.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Cavia porcellus]
Length = 937
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 154/373 (41%), Positives = 218/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V K D L +P + PG+ G+ +P E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVIVPPGKEKEAQKRWKEGNFNVYLSDLIPVDR 473
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNQLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPQHLIKEILLVDD 533
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L KL+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKDKLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P E
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPE 649
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK R ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 650 VVAKNRIKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE VPCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 710 IEIVPCSRVGHIFRNDNPYSFPK--DRLKT--VERNLVRVAEVWLDE-YKELFYGHGDHL 764
Query: 360 LAMFLDMGDISEQ 372
+ LD G++++Q
Sbjct: 765 IDQRLDAGNLTQQ 777
>gi|296210176|ref|XP_002751862.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Callithrix jacchus]
Length = 443
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/326 (42%), Positives = 204/326 (62%), Gaps = 11/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+YG N+ S + +R +PD R + C YP+ LP AS+++ F+NE F++L RT+ S+
Sbjct: 99 KYGFNIIISRSLGIEREVPDTRNKMCLQKRYPVRLPTASIVICFYNEEFNALFRTMSSVW 158
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
TP LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 159 NLTPHHLLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 218
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+V+VFLD+HCEV WL PLL I D K++ PVID IDY+T E Y+P RG
Sbjct: 219 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDYRTLE----YKPSPVVRG 274
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + +K + + E + ++P +SP AGG+FA+ R +F E+G YD + WGG
Sbjct: 275 AFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGG 334
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELS +IWMCGG + +PCSR+GH+ + GK + + +T+NY R+ W
Sbjct: 335 ENLELSLRIWMCGGQLFIIPCSRVGHISKK----QSGKPSTLINA--VTHNYLRLAHVWL 388
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K F+ R+P ++ G+ISE+
Sbjct: 389 DE-YKEQFFLRKPGLKYMTYGNISER 413
>gi|32698686|ref|NP_055383.1| polypeptide N-acetylgalactosaminyltransferase 5 [Homo sapiens]
gi|51315940|sp|Q7Z7M9.1|GALT5_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|30841528|gb|AAP34404.1| GalNAc-T5 [Homo sapiens]
gi|119631854|gb|EAX11449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|148745655|gb|AAI42677.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|158257740|dbj|BAF84843.1| unnamed protein product [Homo sapiens]
Length = 940
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 151/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|195492881|ref|XP_002094181.1| GE20340 [Drosophila yakuba]
gi|194180282|gb|EDW93893.1| GE20340 [Drosophila yakuba]
Length = 666
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 161/352 (45%), Positives = 214/352 (60%), Gaps = 13/352 (3%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE GKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEKGKAATLDDESQRDLEKQKSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVAKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P ID ID+ + +R+ D RG F+W YK L E + K+ ++P+KSP AG
Sbjct: 322 PFIDVIDHTNFNYRA---QDEGARGAFDWEFFYKRLPLLEEDL---KHPADPFKSPVMAG 375
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P N
Sbjct: 376 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PRN 433
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
KG + NYKRV E W DE +K Y Y+ + L +D GD++EQ
Sbjct: 434 HQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482
>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
Length = 666
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ ++P+KSP A
Sbjct: 322 PFIDVIDHTNFHYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N KG + NYKRV E W DE +K Y Y+ + L +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 192/297 (64%), Gaps = 9/297 (3%)
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L + LE Y
Sbjct: 5 YPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESY 64
Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I DR
Sbjct: 65 VKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDR 124
Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYK 255
K + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + + P +
Sbjct: 125 KTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVR 181
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
+PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R
Sbjct: 182 TPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFR 241
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F G +I N +R+ E W DE K +FY P +D GDIS +
Sbjct: 242 KATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISSR 293
>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
Length = 662
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 203/332 (61%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N+ SN I R +PD R + C Y LP AS+I+ F+NE +L+R+
Sbjct: 157 DVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQSYDKVLPSASIIMCFYNEHLETLVRS 216
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
V SII+RTP+ L EIILVDD S DL LE + N KVRLIRN EREGL+R+R
Sbjct: 217 VTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHELNALKNSKVRLIRNAEREGLMRSRV 276
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA+ + G+V++FLD+H EV ++W+ PLL I +++ I+ +PVID I+ T+ +Y
Sbjct: 277 YGARNATGDVLIFLDSHIEVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTF----IYSS 332
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + LP+ K P++SPT AGGLFA+DR +F +LG YD G
Sbjct: 333 SPLVRGGFNWGLHFKWDNLPKGTLAKESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMG 392
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ VWGGEN E+SF+ W CGGSIE VPCSRIGHV+R PY +D + N R
Sbjct: 393 MDVWGGENLEISFRTWQCGGSIELVPCSRIGHVFRKRRPYGSPDGSD-----TMIRNSLR 447
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W D+ K YF +P A +D GD++++
Sbjct: 448 LSRVWMDDYIK-YFLENQPQAKKVDPGDLTDR 478
>gi|198474477|ref|XP_001356707.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
gi|198138408|gb|EAL33772.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
Length = 646
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 207/332 (62%), Gaps = 24/332 (7%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E+CK Y LP SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q+L+DY+ F V ++RN ER+GLI R GAK +
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
GEV+VF D+H EV NWLPPLL PI + KI T P++D ID+ + + Y+ RG
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W YK+ LPE K S+P+++P GGLFA+ FF +LGGYD L +WGG
Sbjct: 327 FDWRFFYKQLPVLPEDSVDK----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGG 382
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-----PYNFGKLADRVKGPLITYNYKRV 341
E +ELSFKIWMCGG + +PCSR+ H++R M P N+ + N+KRV
Sbjct: 383 EQYELSFKIWMCGGMLLDIPCSRVAHIFRGPMDPRPNPRNYN---------FVGRNHKRV 433
Query: 342 IETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
E W DE +K + Y+R+P +D GD++ Q
Sbjct: 434 AEVWMDE-YKEHVYSRDPQTYNNIDAGDLTRQ 464
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 206/352 (58%), Gaps = 18/352 (5%)
Query: 25 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DL 81
GE GKA L EA + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASRLQLNEAELKQQEELIERYAINIYLSDKISLHRHIEDKRMYECKSKTFNYRQL 138
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE Y+ +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLD 198
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D ++
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISEDETVIIC 257
Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +GHV+ PY
Sbjct: 315 GGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P N R E W DE +K +FY R P A GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISER 416
>gi|426337441|ref|XP_004032714.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gorilla
gorilla gorilla]
Length = 940
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 219/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P+ + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|34042986|gb|AAQ56703.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 666
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ ++P+KSP A
Sbjct: 322 PFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N KG + NYKRV E W DE +K Y Y+ + L +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482
>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Monodelphis domestica]
Length = 708
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 156/364 (42%), Positives = 213/364 (58%), Gaps = 21/364 (5%)
Query: 14 EPPLEPYKEGPGEGGKAYHLP---EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
+PP +P GE G+A HL +A + + +Y +N+ S+ IS R I D RM
Sbjct: 195 KPPPDP--GALGEWGEASHLQLQGDAEKQQAEELTEKYAINIYLSDRISLHRHIRDDRMY 252
Query: 71 EC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
EC K +DY LP SVI+ F+NE +S+L+RTVHS+++ PA L+EIILVDD S K
Sbjct: 253 ECRLKSFDY-RRLPTTSVIIAFYNEAWSTLLRTVHSVLETAPAVLLKEIILVDDLSDKVY 311
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L +LE YI +VRLIR +REGL+R R GA + GEV+ FLD HCE WL PL
Sbjct: 312 LKAQLETYISSLQ-RVRLIRTKKREGLVRARLIGATFATGEVLTFLDCHCECNQGWLEPL 370
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L I D ++ PVID ID+ T++F + G F+W + ++ +PE E ++ +
Sbjct: 371 LERIGQDESVIICPVIDTIDWNTFDF--YMQEGEPVIGGFDWHLTFQWQPVPEHERRRWQ 428
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
++P KSP AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG++E PCS
Sbjct: 429 SRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGALEIHPCS 488
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+ PY P N R E W D+ +K +FY R PLA GD
Sbjct: 489 HVGHVFPKRAPY---------ARPNFRQNTVRAAEVWMDD-YKEHFYNRNPLARKESYGD 538
Query: 369 ISEQ 372
+SE+
Sbjct: 539 VSER 542
>gi|443704264|gb|ELU01402.1| hypothetical protein CAPTEDRAFT_127533 [Capitella teleta]
Length = 390
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 206/354 (58%), Gaps = 13/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
GPGE G++ AA N S+ +SF+RTIPD R C K +DY
Sbjct: 6 NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 64
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LPK SVI+ F E +S+L+R+VHS++ RTP + LEEIILVDDFS + L KL++Y+ R
Sbjct: 65 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 125 L-PKVTLIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
PVID I + + + + + RG F W ML+K +P E + + + P ++PT
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTPT 240
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PCSR+GHVYRS
Sbjct: 241 IAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRSTQ 300
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+F VK + N RV W D + FY +P GDIS +
Sbjct: 301 PYSFP--GGNVK--VFMRNNLRVANVWMD-GYVNLFYLMKPELRNEPFGDISSR 349
>gi|24656262|ref|NP_647749.2| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|24656265|ref|NP_728779.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|442629817|ref|NP_001261342.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
gi|51315873|sp|Q6WV16.2|GALT6_DROME RecName: Full=N-acetylgalactosaminyltransferase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|7292281|gb|AAF47689.1| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|7292282|gb|AAF47690.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|440215219|gb|AGB94037.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
Length = 666
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ ++P+KSP A
Sbjct: 322 PFIDVIDHTNFHYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
N KG + NYKRV E W DE +K Y Y+ + L +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482
>gi|6525067|gb|AAF15313.1|AF154107_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5 [Homo
sapiens]
Length = 610
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 153/373 (41%), Positives = 221/373 (59%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V + D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 93 VLRIDVTLSPRDP------KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDR 146
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDD
Sbjct: 147 AIEDTRPAGCAEQLVXNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDD 206
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 207 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 265
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 266 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 322
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK R ++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 323 VIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 382
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 383 IEIIPCSRVGHIFRNDNPYSFPK--DRMKT--VERNLVRVAEVWLDE-YKELFYGHGDHL 437
Query: 360 LAMFLDMGDISEQ 372
+ LD+G++++Q
Sbjct: 438 IDQGLDVGNLTQQ 450
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 156/375 (41%), Positives = 212/375 (56%), Gaps = 26/375 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RP++K +PP + + G L E + + Y +N+ S+ IS
Sbjct: 61 RPLYK--------KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISL 112
Query: 61 DRTIPDLRMEECKY--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
R I D RM ECK +DY LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EII
Sbjct: 113 HRHIEDKRMYECKSRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEII 171
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
LVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HC
Sbjct: 172 LVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHC 230
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKEN 237
E WL PLL I D + PVID ID+ T+EF EP G F+W + ++ +
Sbjct: 231 ECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWH 287
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+P+ E +RK +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 288 SVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQ 347
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG +E PCS +GHV+ PY P N R E W DE +K +FY R
Sbjct: 348 CGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNR 397
Query: 358 EPLAMFLDMGDISEQ 372
P A GDISE+
Sbjct: 398 NPPARKEAYGDISER 412
>gi|194759472|ref|XP_001961971.1| GF15238 [Drosophila ananassae]
gi|190615668|gb|EDV31192.1| GF15238 [Drosophila ananassae]
Length = 663
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 203/326 (62%), Gaps = 11/326 (3%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+EECK Y LP SVI +F NE S+L+R++HS++ R
Sbjct: 164 GFNGLLSDRISVNRSVPDVRLEECKTRKYLAKLPNVSVIFIFFNEYLSTLLRSIHSVVNR 223
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L +L+DY+ F G V ++RN ER GLI R GAK +
Sbjct: 224 TPPELLKQIVLVDDGSDWESLKHQLDDYVSIHFPGLVDIVRNPERRGLIGARIAGAKVAV 283
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H E NWLPPLL PI + KI T P+ID ID+ T+ + ++ RG
Sbjct: 284 GDVMVFFDSHIEANYNWLPPLLEPIAINNKICTCPMIDSIDHATFSYHGGHQ--EGARGG 341
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F+W M YK+ + ++ + S P++SP GGLFA++ FF +LGGYD L +WGGE
Sbjct: 342 FDWKMYYKQLPVLAEDSIDK---SLPFRSPVMMGGLFAINTDFFWDLGGYDDELDIWGGE 398
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
+ELSFKIWMCGG + VPCS + H++R M + + R + N+KRV E W D
Sbjct: 399 QYELSFKIWMCGGMLLDVPCSHVAHIFRGPMD---PRPSPRENTNFVARNHKRVAEVWMD 455
Query: 348 EKHKAYFYTREPLAM-FLDMGDISEQ 372
E +K Y Y R+P +D GD++ Q
Sbjct: 456 E-YKKYLYERDPETYEKIDAGDLTRQ 480
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 158/376 (42%), Positives = 214/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + + PGE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P+ E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 205/350 (58%), Gaps = 8/350 (2%)
Query: 24 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
ID I T+E+ + D + G F W + ++ + +RE +R + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PCS +GHV+R PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+V + +N RV E W DE Y+ A + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDEWRDFYYAMNPEGARNVAVGDVSER 446
>gi|443715013|gb|ELU07165.1| hypothetical protein CAPTEDRAFT_143879 [Capitella teleta]
Length = 390
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 206/354 (58%), Gaps = 13/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
GPGE G++ AA N S+ +SF+RTIPD R C K +DY
Sbjct: 6 NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSFDYS- 64
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LPK SVI+ F E +S+L+R+VHS++ RTP + LEEIILVDDFS + L KL++Y+ R
Sbjct: 65 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 125 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
PVID I + + + + + RG F W ML+K +P E + + + P ++PT
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTPT 240
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PCSR+GHVYRS
Sbjct: 241 IAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRSTQ 300
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+F VK + N RV W D + FY +P GDIS +
Sbjct: 301 PYSFP--GGNVK--VFMRNNLRVANVWMD-GYVNLFYLMKPELRNEPFGDISSR 349
>gi|380805795|gb|AFE74773.1| polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Macaca mulatta]
Length = 336
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 141/276 (51%), Positives = 181/276 (65%), Gaps = 12/276 (4%)
Query: 97 SLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLI 156
SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+ KVR++R +REGLI
Sbjct: 1 SLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLI 59
Query: 157 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 216
RTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+ID ID+ + + +
Sbjct: 60 RTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHFGYEA 119
Query: 217 VYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGG 276
+ RG F+W M YK +P +R S+P++SP AGGLFA+DR +F ELGG
Sbjct: 120 --QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGGLFAVDRKWFWELGG 175
Query: 277 YDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY 336
YDPGL +WGGE +E+SFK+WMCGG + VPCSR+GH+YR ++PY G +
Sbjct: 176 YDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKVP------SGTSLAR 229
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KRV ETW DE Y Y R P L GDIS Q
Sbjct: 230 NLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 264
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 138/336 (41%), Positives = 204/336 (60%), Gaps = 10/336 (2%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414
Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
HV+R PY F G +A ++ +N RV E W
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 445
>gi|6688167|emb|CAB65104.1| GalNAc-T5 [Homo sapiens]
Length = 668
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 215/354 (60%), Gaps = 12/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 164 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 223
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 224 PTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 282
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 283 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 342
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++ + P A
Sbjct: 343 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMA 399
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY
Sbjct: 400 GGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 459
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 460 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 508
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 155/363 (42%), Positives = 208/363 (57%), Gaps = 28/363 (7%)
Query: 22 EGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK-YWD 76
+ GE GK LP+ + A D N S+ IS R++PD R CK
Sbjct: 153 DNAGEMGKPVVLPKEMAPDMKKAVDEGWTNNAFNQYVSDLISVHRSLPDPRDAWCKDSAR 212
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LPK VI+ FHNE +S L+RTVHS++ R+P++ + EIILVDD+S L +KLEDY
Sbjct: 213 YLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPSELIGEIILVDDYSDMTHLKKKLEDY 272
Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
+ V+++R +REGLIR R GAK ++ VI +LD+HCE WL PLL I +
Sbjct: 273 FADY-PMVKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDRIARNS 331
Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKY 249
+ PVID ID T EF HYR G F+W + + + +PERE K+
Sbjct: 332 TTVVCPVIDVIDDATLEF--------HYRDSSGVNVGGFDWNLQFSWHSVPEREKKRHNS 383
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
SEP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 384 TSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSH 443
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+GH++R PY + R ++ N R+ E W D+ K Y+Y R + D GD+
Sbjct: 444 VGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDDYSK-YYYQRIGMDKG-DFGDV 496
Query: 370 SEQ 372
S++
Sbjct: 497 SDR 499
>gi|114581297|ref|XP_525944.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Pan troglodytes]
gi|410296312|gb|JAA26756.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
gi|410333399|gb|JAA35646.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
Length = 940
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|51316006|sp|Q8IA42.2|GALT4_DROME RecName: Full=N-acetylgalactosaminyltransferase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4; Short=pp-GaNTase 4
gi|34042946|gb|AAQ56701.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 659
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 337
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P LD GD++ Q
Sbjct: 449 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 475
>gi|221330664|ref|NP_001137779.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|442625712|ref|NP_722910.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
gi|25987157|gb|AAN75751.1|AF324752_1 N-acetylgalactosaminyltransferase [Drosophila melanogaster]
gi|220901927|gb|ACL82986.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|440213268|gb|AAN10370.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
Length = 644
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 145 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 204
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 205 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 264
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 265 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 322
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 323 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 378
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 379 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 433
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P LD GD++ Q
Sbjct: 434 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 460
>gi|281341921|gb|EFB17505.1| hypothetical protein PANDA_013078 [Ailuropoda melanoleuca]
Length = 936
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 154/373 (41%), Positives = 220/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 649
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK R ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 650 VVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 710 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 764
Query: 360 LAMFLDMGDISEQ 372
+ LD+G+++EQ
Sbjct: 765 IDQGLDVGNLTEQ 777
>gi|345797223|ref|XP_545481.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Canis
lupus familiaris]
Length = 602
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 154/375 (41%), Positives = 222/375 (59%), Gaps = 13/375 (3%)
Query: 2 PVFKADGKLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
P A ++ ++ L P E PG+ G+ +P + E N+ S+ I
Sbjct: 77 PAQPAVRRVSGIDATLSPRDPEAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPV 136
Query: 61 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
DR I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LV
Sbjct: 137 DRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLV 196
Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
DDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E
Sbjct: 197 DDFSTKDYLKDDLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVEC 255
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
+ WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 256 NVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 312
Query: 241 -EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
+ AK R ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCG
Sbjct: 313 PDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCG 372
Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-- 357
G IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 373 GEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGD 427
Query: 358 EPLAMFLDMGDISEQ 372
+ LD+G+++EQ
Sbjct: 428 HLIDQGLDVGNLTEQ 442
>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
saltator]
Length = 606
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/365 (41%), Positives = 219/365 (60%), Gaps = 20/365 (5%)
Query: 13 LEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
L+P P++P +G E G +L + + D Y N+ S+++ RT+PD R
Sbjct: 66 LQPVPVKPAVTLDQGLDELGMVKNLDDQRKR--DEGYKNYSFNVLISDNLGVLRTLPDTR 123
Query: 69 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
+ C+ YP +LP AS+I+ F+NE + +L+R++HSII +TP L EIILV+D+S
Sbjct: 124 HKLCRARKYPTNLPNASIIICFYNEHYMTLLRSLHSIIDKTPTSLLHEIILVNDYSDSNI 183
Query: 129 LDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +K++ YI F+ KV+ + +REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 184 LHEKIKVYITNNFDAKVQFFKTDKREGLIRARVFGARKATGDVLIFLDSHIEVNEVWIEP 243
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP K+
Sbjct: 244 LLSRIAHSKTIVAMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLKQE 299
Query: 248 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
+P KSPT AGGLFA+DR +F ++G YD G+ VWGGEN E+SF+IWMCGG+IE +PC
Sbjct: 300 DDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISFRIWMCGGNIELIPC 359
Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
SR+GHV+R PY D + N RV W DE +K YF +D G
Sbjct: 360 SRVGHVFRRRRPYGSDDPQD-----TMLKNSLRVAHVWLDE-YKDYFLRN---VRKIDFG 410
Query: 368 DISEQ 372
DISE+
Sbjct: 411 DISER 415
>gi|354548807|gb|AER27632.1| AT25481p1 [Drosophila melanogaster]
Length = 666
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 167 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 226
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 227 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 286
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 287 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 344
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 345 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 400
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 401 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 455
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P LD GD++ Q
Sbjct: 456 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 482
>gi|301776863|ref|XP_002923851.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Ailuropoda melanoleuca]
Length = 937
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 154/373 (41%), Positives = 220/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 649
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK R ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 650 VVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 710 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 764
Query: 360 LAMFLDMGDISEQ 372
+ LD+G+++EQ
Sbjct: 765 IDQGLDVGNLTEQ 777
>gi|195359229|ref|XP_002045319.1| GM11142 [Drosophila sechellia]
gi|194122575|gb|EDW44618.1| GM11142 [Drosophila sechellia]
Length = 658
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 337
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P LD GD++ Q
Sbjct: 449 MDE-YKQYVYKRDPKTYDSLDAGDLTRQ 475
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 156/374 (41%), Positives = 213/374 (56%), Gaps = 28/374 (7%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
G + PP E + PG GK LP+ + A D + N ++ IS R++PD
Sbjct: 130 GVIAPPHEDSPDSPGAMGKPVVLPKDMSPEMKKAVDDGWSKNAFNQYAADLISIRRSLPD 189
Query: 67 LRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
R CK Y DLP SVI+ FHNE +S L+RTVHS++ R+P ++E+ILVDDFS
Sbjct: 190 PRDPWCKEPGRYGTDLPATSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSD 249
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
++LEDY + + +V++IR +REGLIR R GA+ + V+ +LD+HCE WL
Sbjct: 250 MPHTQKQLEDYFEAYP-RVKIIRAPKREGLIRARLLGARYATAPVLTYLDSHCECTTGWL 308
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENE 238
PLL I + + PVID ID T E+ HYR G F+W + + +
Sbjct: 309 EPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVGGFDWNLQFNWHA 360
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P+RE K+ K +EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMC
Sbjct: 361 VPDREKKRHKSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSFKTWMC 420
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG++E VPCS +GH++R PY + R +I N R+ E W DE K Y+Y R
Sbjct: 421 GGTLEIVPCSHVGHIFRKRSPYKW-----RTGVNVIKRNSVRLAEVWLDEYAK-YYYQRI 474
Query: 359 PLAMFLDMGDISEQ 372
D GD+SE+
Sbjct: 475 GNDKG-DYGDVSER 487
>gi|195576344|ref|XP_002078036.1| GD23236 [Drosophila simulans]
gi|194190045|gb|EDX03621.1| GD23236 [Drosophila simulans]
Length = 674
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 175 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 234
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 235 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 294
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 295 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 352
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 353 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 408
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 409 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 463
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P LD GD++ Q
Sbjct: 464 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 490
>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
rerio]
gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
Length = 582
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 207/359 (57%), Gaps = 16/359 (4%)
Query: 17 LEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
L P PGE G+A L + +AS+ +N+ S+ IS R I D RM ECK
Sbjct: 72 LPPDSNAPGEYGRATRLTLTSEEKKEEEASVERCAINIFISDKISLHRHIQDNRMHECKA 131
Query: 75 WDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKL 133
Y + LP SV++ F+NE +S+L+RT+HS+++ TPA L++IILVDDFS + L +L
Sbjct: 132 KKYNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLETTPAVLLKDIILVDDFSDRGYLKSQL 191
Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
YI +VRLIR +REGL+R R GA + G V+ FLD HCE W+ PLL I
Sbjct: 192 AQYISNLE-RVRLIRTKKREGLVRARLIGATYATGSVLTFLDCHCECVPGWIEPLLERIA 250
Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ + PVID ID+ T+EF + + G F+W + ++ + +PE + K RK +P
Sbjct: 251 ENETTIICPVIDTIDWNTFEF--YMQTEEPMVGGFDWRLTFQWHAVPEIDRKIRKSRIDP 308
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
+SPT AGGLFA+ +A+F LG YD G+ VWGGEN ELSF++W CGGS+E PCS +GHV
Sbjct: 309 IRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHV 368
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ PY N R E W D +K +FY R P A GDISE+
Sbjct: 369 FPKKAPYARSNFLQ---------NTVRAAEVWMD-TYKQHFYNRNPPARKESYGDISER 417
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 201/334 (60%), Gaps = 10/334 (2%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
P + PGE GK +P + E N+ S+ IS +R++ D+R E C+
Sbjct: 122 PTVREQHGQPGEMGKPVKIPADMKEVMKEKFKENQFNLLASDMISLNRSLTDVRHENCRR 181
Query: 75 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L +KLE
Sbjct: 182 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKKLE 241
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R + + P
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRTAP 358
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 359 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 418
Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
+R PY F G +A ++ +N RV E W
Sbjct: 419 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 447
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 155/371 (41%), Positives = 210/371 (56%), Gaps = 29/371 (7%)
Query: 1 RPVFKADGKLGNLEPPL--EPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMET 54
RP + D + G P + P + GPGE GK LP+ + D + N
Sbjct: 142 RPARQPDDQGGLALPGVIAPPSEGGPGELGKPVVLPKDLSPEVKKLVDEGWAKNAFNQYV 201
Query: 55 SNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
++ IS RT+PD R CK Y DLP SVI+ FHNE +S L+RTVHS++ R+P
Sbjct: 202 ADMISIRRTLPDPRDAWCKEPGRYREDLPPTSVIICFHNEAWSVLLRTVHSVLDRSPEHL 261
Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
++E+ILVDDFS ++LE+Y + +V+++R +REGLIR R GA+ + V+ +
Sbjct: 262 VKEVILVDDFSDMPHTQKQLEEYFLAY-PRVKIVRAAKREGLIRARLLGARHATAPVLTY 320
Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------G 226
LD+HCE WL PLL I + + PVID ID T E+ HYR G
Sbjct: 321 LDSHCECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVG 372
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + + + +PERE +K K +EP SPT AGGLFA+DR FF LG YD G +WGG
Sbjct: 373 GFDWNLQFNWHAVPEREKRKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGG 432
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELSFK WMCGGS+E +PCS +GH++R PY + R +I N R+ E W
Sbjct: 433 ENLELSFKTWMCGGSLEIIPCSHVGHIFRKRSPYKW-----RTGVNVIKRNSVRLAEVWM 487
Query: 347 DEKHKAYFYTR 357
DE + Y+Y R
Sbjct: 488 DE-YAQYYYQR 497
>gi|410968689|ref|XP_003990834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Felis catus]
Length = 939
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/360 (41%), Positives = 216/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 431 PRDP--KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 488
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 489 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 548
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 549 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 607
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 608 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDII 664
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 665 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 724
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY F K DR+K + N RV E W DE +K FY + LD+G+++EQ
Sbjct: 725 RNDNPYTFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTEQ 779
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 153/362 (42%), Positives = 207/362 (57%), Gaps = 18/362 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 5 KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 64
Query: 74 Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
+DY LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 65 SRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 123
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 124 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 182
Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 183 ISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 239
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +
Sbjct: 240 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 299
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+ PY P N R E W DE +K +FY R P A GDIS
Sbjct: 300 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 349
Query: 371 EQ 372
E+
Sbjct: 350 ER 351
>gi|332233960|ref|XP_003266176.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Nomascus
leucogenys]
Length = 940
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPDVIAKNRIKETDII 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|149730635|ref|XP_001491185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Equus
caballus]
Length = 940
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/373 (40%), Positives = 221/373 (59%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLKIDVTLSPRDP------KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDR 476
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 595
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPD 652
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK R +++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 653 IVAKNRIKDTDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 713 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 767
Query: 360 LAMFLDMGDISEQ 372
+ LD+G++++Q
Sbjct: 768 IDQGLDVGNLTQQ 780
>gi|109099754|ref|XP_001087663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Macaca mulatta]
Length = 940
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P +EEI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|403258969|ref|XP_003922012.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Saimiri boliviensis boliviensis]
Length = 940
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/353 (41%), Positives = 214/353 (60%), Gaps = 12/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PG+ G+ +P + E N+ S+ I DR I D R C +LP
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNLP 496
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 497 TTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF-P 555
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK + P
Sbjct: 556 KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVACP 615
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
VI+ I+ + + +V D+ RGIF W M + +P + AK R ++ + P AG
Sbjct: 616 VIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVIRCPVMAG 672
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY+
Sbjct: 673 GLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 732
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 733 FPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLINQGLDVGNLTQQ 780
>gi|296204771|ref|XP_002749473.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Callithrix jacchus]
Length = 940
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 215/354 (60%), Gaps = 12/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDDLDKYMSQF- 554
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++ + P A
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVIRCPVMA 671
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY
Sbjct: 672 GGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 731
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 732 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|348522865|ref|XP_003448944.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Oreochromis niloticus]
Length = 590
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 206/342 (60%), Gaps = 13/342 (3%)
Query: 25 GEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
GE GKA HL R +L +YG N S IS R +P+ R +C ++ LP
Sbjct: 126 GEMGKAVRLHLEGLERDMELRALQQYGFNEVVSERISLHRRLPEARHPKCLGVEHIESLP 185
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
ASV++ F++E +S+L+RTVHS++ P QYL+E++LVDD S + L L +Y+ +G
Sbjct: 186 SASVVICFNDEAWSTLLRTVHSVLDTAPKQYLQEVLLVDDLSQQGHLKTGLSEYVSHLDG 245
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
VRLIR+T+R G+ R+ GA + GEV+VF+D+HCE WL PLL I DR + P
Sbjct: 246 -VRLIRSTKRLGVGGCRTLGAARAVGEVVVFMDSHCECQKGWLEPLLERIALDRTRVVSP 304
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
++D ID+QT+ + + P RG+F+W + + +PE + K+ + +P +SP GG
Sbjct: 305 IMDVIDWQTFRYNATQWP---VRGVFDWRLDFFWESIPELQDKEPEMAVQPLQSPALGGG 361
Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
+ A+DR FF +G YDPG+++WG E ELS ++W CGGS+E VPCSR+GH+ R +PY F
Sbjct: 362 VVAIDRHFFQSVGTYDPGMVLWGAEQIELSIRVWSCGGSMEVVPCSRVGHLIRHHLPYRF 421
Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
L+ N R+ ETW D +K +Y R+ LA F+
Sbjct: 422 P------DQDLLQRNKIRIAETWMD-TYKKIYYRRDTLAHFI 456
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y DLPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476
>gi|344276550|ref|XP_003410071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Loxodonta africana]
Length = 448
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 201/328 (61%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP ASVI+ FHNE F++L RTV S
Sbjct: 102 LLQYGFNIIISRSLGKEREVPDTRNKMCLEKHYPKYLPTASVIICFHNEEFNALFRTVSS 161
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA
Sbjct: 162 VMNLTPHYILEEIILVDDMSEFDDLKEKLDYHLEVFRGKIKLIRNKKREGLIRARLIGAS 221
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID I+ T E Y P
Sbjct: 222 RASGDVLVFLDSHCEVNRVWLEPLLFAISKDPKVVVCPLIDVINDTTLE----YTPSPVV 277
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F W + +K + + E + + + P +SP AGG+FA+ R +F E+G YD G+ +W
Sbjct: 278 RGAFNWKLQFKWDNVLSYEMEGPEGPTGPIRSPAMAGGIFAIQRKYFNEIGQYDKGMYLW 337
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+GH+ + + NF + + YN R++
Sbjct: 338 GGENLELSLRIWMCGGQLFIIPCSRVGHISKQHIQNNFRFMQS------LRYNNLRLVHV 391
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 392 WLDE-YKEQFFLQGPGLKSMNYGNISER 418
>gi|443726011|gb|ELU13353.1| hypothetical protein CAPTEDRAFT_91056 [Capitella teleta]
Length = 426
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 152/354 (42%), Positives = 205/354 (57%), Gaps = 13/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
PGE G++ A N S+ +SF+RTIPD R C K +DY
Sbjct: 42 NSPGEHGRSVRTSPDDEAVVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 100
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LPK SVI+ F E +S+L+R+VHS++ RTP LEEI+LVDDFS + L KL+DY+ R
Sbjct: 101 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPDLLEEILLVDDFSQREHLHAKLDDYLTR 160
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 161 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 219
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
PVID I + + + + + RG F W ML+K +P+ E + + + P ++PT
Sbjct: 220 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPDYEKSRTGGDPTAPVRTPT 276
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PC+R+GHVYRS
Sbjct: 277 IAGGLFAIHQGFFRSLGFYDPGLHIWGSENLELSFKAWMCGGSMEMIPCARVGHVYRSTQ 336
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+F VK + N RV W D+ + FY +P GDIS +
Sbjct: 337 PYSFP--GGNVK--VFMRNNLRVANVWMDD-YVDLFYLMKPELRNEPFGDISSR 385
>gi|431894831|gb|ELK04624.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Pteropus alecto]
Length = 939
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 214/354 (60%), Gaps = 12/354 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 435 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAKQLVHNNL 494
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 495 PTTSVIMCFVDEVWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 553
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 554 PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLSRKKVAC 613
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++ + P A
Sbjct: 614 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDIIRCPVMA 670
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY
Sbjct: 671 GGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 730
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 731 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 779
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 191 KPPADSHALGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 250
Query: 74 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ LP SV++ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 251 SQKFNYRKLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 310
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 311 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 369
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 370 SKDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRI 426
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 427 DPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 486
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W DE +K +FY R P A GDISE
Sbjct: 487 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISE 536
Query: 372 Q 372
+
Sbjct: 537 R 537
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y DLPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 156/376 (41%), Positives = 221/376 (58%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEA--YRAAGDASLGEYGMNMETSNHI 58
RPV++ +PP +P+ G GE GKA L + + + + Y +N+ S+ I
Sbjct: 66 RPVYQ--------KPPPDPH--GLGEWGKAARLTLSPEEKKLEEELVERYAINIYLSDKI 115
Query: 59 SFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
S R I D RM EC K +DY LP SVI+ F+NE +S+L+RT+HS+++ +P+ L+E
Sbjct: 116 SLHRHIDDGRMPECRSKTYDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLESSPSVLLKE 174
Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
IILVDD S K L +LE YI +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 175 IILVDDLSDKVYLKGELEKYISNLQ-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDC 233
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
HCE WL PLL + + ++ PVID ID+ T+EF +P G F+W + ++
Sbjct: 234 HCECVPGWLEPLLQRVAENESVIICPVIDTIDWNTFEF--YMQPGEPMIGGFDWRLTFQW 291
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P+ E ++RK +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 292 HSVPDYERQRRKSKVDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSFRVW 351
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W D+ +K +FY
Sbjct: 352 QCGGILEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYN 401
Query: 357 REPLAMFLDMGDISEQ 372
R P A + GD+SE+
Sbjct: 402 RNPPARKENFGDLSER 417
>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Macaca mulatta]
Length = 558
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 144/379 (37%), Positives = 212/379 (55%), Gaps = 13/379 (3%)
Query: 2 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH--LPEAYRAAGDASLGEYGMNMETSNHIS 59
P+ D LG L L P P G + Y LP+ ++ + + S+ IS
Sbjct: 57 PIVDVD-VLGQLGIELYPELIDPLLGARGYPAILPDNLKSQSKNLFKNHSFDSLLSDRIS 115
Query: 60 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
+R + +++ + C YP +LP SVI+ FHNE S+L+RTVHS+I TP L I+L
Sbjct: 116 LNRRLGNVKGDLCSSKQYPAELPNTSVIICFHNEATSALLRTVHSVINETPPNILSNIVL 175
Query: 120 VDDFSSKADLDQKLEDYIQRFNGK-----VRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
VDD S A L + L +YI N K V L RN +R+GL+R+R +GA+ + G V+ FL
Sbjct: 176 VDDASVGAALKKPLRNYINELNRKLGEEMVILYRNAKRQGLVRSRLKGAELASGTVLTFL 235
Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
D+HCE W+ PLL I D++ + PVI+ ID ++ G F W + +
Sbjct: 236 DSHCEATEGWVEPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKTELDRITQVGGFTWDLFF 295
Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
E+ E E + R ++P KSPT AGGLFA+D+++F E+G YD + +WGGEN E+SF+
Sbjct: 296 NWKEITEDEKRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSFR 355
Query: 295 IWMCGGSIEWVPCSRIGHVYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
IWMCGG +E +PCSR+GH++R PY+F + + N+ R+ E W DE + Y
Sbjct: 356 IWMCGGKLEIIPCSRVGHIFRKENSPYSFPNGVSKT----LAKNFNRLAEVWMDEYKELY 411
Query: 354 FYTREPLAMFLDMGDISEQ 372
+ + P + GDISE+
Sbjct: 412 YRRKPPEDKLVKYGDISER 430
>gi|402888383|ref|XP_003907542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Papio
anubis]
Length = 940
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 151/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM--FLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY M LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLMDQGLDVGNLTQQ 780
>gi|344268422|ref|XP_003406059.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Loxodonta africana]
Length = 939
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 215/360 (59%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P E N+ S+ I DR I D R C
Sbjct: 431 PRDP--KAPGQFGRPVIVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPTGCAEQ 488
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 489 LVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 548
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 549 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLS 607
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK R ++
Sbjct: 608 RKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPDVVAKNRIKETDVI 664
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 665 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 724
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 725 RNDNPYTFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 779
>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Macaca
mulatta]
Length = 558
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|355564907|gb|EHH21396.1| hypothetical protein EGK_04452 [Macaca mulatta]
Length = 940
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS DL+ L + R KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSS--DLEDCL--LLTRI-PKVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|449281639|gb|EMC88675.1| Polypeptide N-acetylgalactosaminyltransferase-like protein 2
[Columba livia]
Length = 640
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 147/370 (39%), Positives = 209/370 (56%), Gaps = 25/370 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHI 58
RP +A+G + + P P + P EG AAG LG +G N S I
Sbjct: 123 RPEARAEGDAESPQLPARPLQ--PAEGA----------AAGQRPLGLETHGFNEALSERI 170
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S R +P++R C +Y LP ASVI+ FH+E +S+L+RTVHSI+ P L++II
Sbjct: 171 SLRRDLPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSIMDTAPKASLKDII 230
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
LVDD S + L L +YI + +G V+LIR+ +R G+IR R GA + G+V+VF+D+HC
Sbjct: 231 LVDDLSQQGPLKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHC 289
Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
E WL PLLA + S+R + PVID ID++T+++ Y +RG+F+W + +
Sbjct: 290 ECQKGWLEPLLARLSSNRNSVVSPVIDVIDWKTFQY---YHSVGLHRGVFDWKLDFHWEP 346
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+PERE K R+ P +SP AG + AMDR +F G YD + +WG EN ELS + W+C
Sbjct: 347 VPEREEKVRQSPISPIRSPVVAGAVVAMDRHYFQNTGAYDSDMTMWGAENLELSIRTWLC 406
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GGS+E +PCSR+GHVYR+ P F I N R+ ETW K FY +
Sbjct: 407 GGSVEIIPCSRVGHVYRNHFPRAFS------YEEAIVRNKIRIAETWLG-SFKDNFYKHD 459
Query: 359 PLAMFLDMGD 368
+A + +
Sbjct: 460 TVAFLISKAE 469
>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
Length = 499
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 12 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 70
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 71 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 125
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 126 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 185
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 186 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 242
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 243 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 297
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 298 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 341
>gi|13929126|ref|NP_113984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Rattus norvegicus]
gi|51315691|sp|O88422.1|GALT5_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|3510639|gb|AAC69708.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T5 [Rattus
norvegicus]
gi|149047792|gb|EDM00408.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
gi|149047793|gb|EDM00409.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
Length = 930
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 146/353 (41%), Positives = 215/353 (60%), Gaps = 12/353 (3%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPGKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKANLDKYMSQF-P 545
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
VI+ I+ + + +V D+ RG+F W M + +P + AK ++ + P AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPDVIAKNGIKETDIIRCPVMAG 662
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 723 FPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 770
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 158/385 (41%), Positives = 217/385 (56%), Gaps = 31/385 (8%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSN 56
+P K D K L+ P+ + PGE GK LP+ + A D + N S+
Sbjct: 134 KPPPKEDDK-SVLDAPVANLNDNPGELGKPVILPKDMPIDMKKAVDDGWTKNAFNQYVSD 192
Query: 57 HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
IS R++PD R CK Y +LPK VI+ FHNE +S L+RTVHS++ R+P + +
Sbjct: 193 LISVHRSLPDPRDAWCKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPPELIG 252
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
+IILVDD+S L ++LEDY + V+++R +REGLIR R GAK ++ VI +LD
Sbjct: 253 QIILVDDYSDMPHLKKQLEDYFASY-PMVQIVRGPQREGLIRARLLGAKYAKSPVITYLD 311
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
+HCE WL PLL I + + PVID ID T EF HYR G F
Sbjct: 312 SHCECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGF 363
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + + +PERE ++ +EP SPT AGGLF++DR FF LG YD G +WGGEN
Sbjct: 364 DWNLQFSWHAVPEREKRRHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGEN 423
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFK WMCGG++E VPCS +GH++R PY + R ++ N R+ E W D+
Sbjct: 424 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDD 478
Query: 349 KHKAYFYTREPLAMFL-DMGDISEQ 372
K Y + + M D GD+SE+
Sbjct: 479 YSKYYL---QRIGMDKGDYGDVSER 500
>gi|311275140|ref|XP_003134592.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Sus scrofa]
Length = 446
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 200/326 (61%), Gaps = 11/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+YG N S + R +PD R + C YP +LP AS+I+ FHNE F++L+RTV SI+
Sbjct: 103 KYGFNHIVSKSLGNYRNVPDSRNKMCHQKHYPANLPTASIIICFHNEEFNALLRTVSSIM 162
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
TP +EEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 163 TLTPHHIIEEIILVDDMSEYDDLKEKLDYHLEIFRGKIKVIRNKKREGLIRARLVGASRA 222
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+++VFLD+HCEV WL PLL I D K++ P++D IDY T E Y+P RG
Sbjct: 223 SGDILVFLDSHCEVNKIWLEPLLDAIVKDPKMVVCPIMDVIDYVTLE----YKPSPVVRG 278
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
+F W + ++ + + E + P +SP GGLFA+ R +F E+G YD G+ +WGG
Sbjct: 279 VFNWHLQFEWDRVFSYEMDGPDGPTRPIRSPAMVGGLFAIHRHYFNEIGQYDKGMNLWGG 338
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELS +IWMCGG + +PCSR+GH+ + + N G++ + YN R++ W
Sbjct: 339 ENLELSLRIWMCGGQLFLLPCSRVGHINKPYFT-NQGEIKKA-----MAYNNLRIVHVWL 392
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K F+ + P L G++SE+
Sbjct: 393 DE-YKEQFFLQNPRLKSLAYGNVSER 417
>gi|219804492|ref|NP_001137331.1| polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
gi|296490560|tpg|DAA32673.1| TPA: polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
Length = 940
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/372 (40%), Positives = 220/372 (59%), Gaps = 16/372 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V + D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK + ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 653 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPL 360
IE VPCSR+GH++R+ PY+F K DR+K + N RV E W DE K Y + +
Sbjct: 713 IEIVPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLGRVAEVWLDEYKELFYGHGNHLI 768
Query: 361 AMFLDMGDISEQ 372
LD+G++++Q
Sbjct: 769 DQGLDVGNLTQQ 780
>gi|355750550|gb|EHH54877.1| hypothetical protein EGM_03977 [Macaca fascicularis]
Length = 940
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|395838452|ref|XP_003792129.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Otolemur garnettii]
Length = 869
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 200/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N TS ++ F R +PD R + C Y LP ASVI+ FHNE F++L RT+ S
Sbjct: 283 LSKYGFNTITSTNVGFKREVPDTRHKMCLQNHYSTHLPTASVIICFHNEEFNALFRTMFS 342
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP LEEIILVDD S DL +KL+ ++ F GK++LIRN +REGLIR R GA
Sbjct: 343 VVNLTPNSLLEEIILVDDMSEFDDLKEKLDYVLEVFRGKIKLIRNQKREGLIRGRMIGAA 402
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID T E+R+
Sbjct: 403 RASGDVLVFLDSHCEVNKGWLEPLLYSIAKDHKMVVCPLIDVIDETTLEYRA----SPVV 458
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E +P +SP AGG+FA+ R +F E+G YD G+ +W
Sbjct: 459 RGAFDWELKFKWDNVFSYEMDGPDRPIKPIRSPAMAGGIFAIYRHYFNEIGQYDKGMDLW 518
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+GH+ + F +++ + T N R++
Sbjct: 519 GGENLELSLRIWMCGGQLFIIPCSRVGHITKK----QFKEVSAITRA--FTRNSLRMVHV 572
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ R+P + G+ISE+
Sbjct: 573 WLDE-YKEQFFLRKPGLRSIAYGNISER 599
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 160/378 (42%), Positives = 228/378 (60%), Gaps = 31/378 (8%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RPV++ +PPL E GE G+A L E+ + S+ + +N+ S+ I
Sbjct: 70 RPVYE--------KPPLGRETE-LGELGRAARLELSESELRRQEESVALHQINVYLSDRI 120
Query: 59 SFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
S R +P+ R +C K +DY +LPK SVI+ F+NE +S+L+RTVHS+++ +P LEE
Sbjct: 121 SLHRRLPERRHPQCTEKRYDY-YNLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDILLEE 179
Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
IILVDD+S K L +KLE+Y+ KVRLIR +REGL+R R GA ++G+V+ FLD
Sbjct: 180 IILVDDYSDKEHLKEKLENYVANLR-KVRLIRANKREGLVRARLLGASIAKGDVLTFLDC 238
Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYK 235
HCE WL PLL I + + PVID ID+ T+E+ + EP G F+W +++
Sbjct: 239 HCECHEEWLEPLLERIKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFT 295
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
+ +PERE K+R+ ++ +SPT AGGLFA+++ +F LG YD G+ VWGGEN E SF+I
Sbjct: 296 WHVVPEREQKQRRSKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSFRI 355
Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYF 354
W CGGS+E PCS +GHV+ PY+ K LA+ V R E W D +K +
Sbjct: 356 WQCGGSLEIHPCSHVGHVFPKQAPYSRAKALANSV----------RAAEVWMD-SYKELY 404
Query: 355 YTREPLAMFLDMGDISEQ 372
Y R P A GD++E+
Sbjct: 405 YHRNPHARMEPYGDVTER 422
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 207/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I+ +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSSTVVCPVIDVINDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476
>gi|194855550|ref|XP_001968569.1| GG24947 [Drosophila erecta]
gi|190660436|gb|EDV57628.1| GG24947 [Drosophila erecta]
Length = 659
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 205/328 (62%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E CK Y LP SV+ VF NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVVFVFFNEHFNTLLRSMYSVINR 219
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L++I+LVDD S L Q L+DY+Q+ F V ++ + ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDSLKQPLDDYVQQHFPHLVTVVHSPERQGLIGARIAGAKVAV 279
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
GEV+VF D+H EV NWLPPL+ PI + KI T P++D I ++ +F RG
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKICTCPIVDSISHE--DFSYFGGNKDGTRGG 337
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S+PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SQPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+ H++R M K +G + N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P +D GD++ Q
Sbjct: 449 MDE-YKQYVYNRDPTTYDNVDAGDLTRQ 475
>gi|397525624|ref|XP_003832760.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Pan
paniscus]
Length = 940
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++ +++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIHKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 208/359 (57%), Gaps = 20/359 (5%)
Query: 19 PYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P PGE GK L PE + + + +Y +N+ S+ IS R I D RM CK
Sbjct: 70 PDSYAPGEWGKPTRLQLSPEEKKQEAEL-IDKYAINIYLSDKISLHRHIEDNRMSGCKTK 128
Query: 76 DYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
Y LP SV++ F+NE +S+L+RTVHS+++ +P+ L+EIILVDD S K L LE
Sbjct: 129 SYNYRKLPTTSVVIAFYNEAWSTLLRTVHSVLETSPSVLLKEIILVDDLSDKVYLKTDLE 188
Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 189 KYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERIAE 247
Query: 195 DRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ ++ PVID ID+ T+E+ EP G F+W + ++ + +P+ E +RK ++P
Sbjct: 248 NETVVICPVIDTIDWNTFEYYMQSAEP---MIGGFDWRLTFQWHSVPKHERLRRKSETDP 304
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
+SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +GHV
Sbjct: 305 IRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVGHV 364
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ PY P N R E W DE +K +FY R P A + GDISE+
Sbjct: 365 FPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKENYGDISER 413
>gi|440896773|gb|ELR48609.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Bos grunniens
mutus]
Length = 940
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 151/373 (40%), Positives = 221/373 (59%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V + D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK + ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 653 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE VPCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 713 IEIVPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLGRVAEVWLDE-YKELFYGHGDHL 767
Query: 360 LAMFLDMGDISEQ 372
+ LD+G++++Q
Sbjct: 768 IDQGLDVGNLTQQ 780
>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Saccoglossus kowalevskii]
Length = 633
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 199/332 (59%), Gaps = 11/332 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D ++ N S+ I F R +PD R C Y Y +LP SV++ F NE +S+L+RT
Sbjct: 137 DEGYQQHAFNQLISDRIGFHRGLPDTRNGLCAYQVYSNNLPSTSVVICFFNEAWSTLLRT 196
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
V+S+I R+PA L EIILVDD+SS L L+D+I+ V++I N +REGLIR R
Sbjct: 197 VYSVIDRSPANLLHEIILVDDYSSSTYLKDYLDDFIKTNLFQIVKIIHNKKREGLIRARM 256
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA + G+V++FLD+HCEV WL PLL I D + P+ID I+ T+E Y+
Sbjct: 257 IGAAAATGDVVMFLDSHCEVSTQWLEPLLERIKFDPHTVVCPIIDIINADTFE----YQQ 312
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P + K ++ +P +SPT AGGLFAMDR +F ELG YD G
Sbjct: 313 SPLVRGGFNWGLHFKWDTIPSSQFKGKEDYIKPVRSPTMAGGLFAMDRKYFHELGEYDDG 372
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IW CGG++E +PCSR+GHV+R PY D ++ N R
Sbjct: 373 MDIWGGENLEISFRIWQCGGTLEIIPCSRVGHVFRKRRPYGSPNGED-----TMSKNSLR 427
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
V W DE + YF ++ D GDIS +
Sbjct: 428 VAHVWMDEYKEHYFELKKD-NRNKDYGDISSR 458
>gi|338724473|ref|XP_001495495.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Equus caballus]
Length = 448
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 141/328 (42%), Positives = 198/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
YG N S + +R +PD R + C YP LP AS+++ FHNE F++L+RTV S
Sbjct: 102 FSRYGFNAMISQRLGNEREVPDTRNKMCLQKHYPTRLPSASIVICFHNEEFNALLRTVSS 161
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++K TP + LEEIILVDD S DL +KL+ +++ F GK++LIRN ++EGLIR R GA
Sbjct: 162 VMKLTPYRVLEEIILVDDMSEFDDLKEKLDHHLEFFRGKIKLIRNKKKEGLIRARLIGAS 221
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY T + Y+P
Sbjct: 222 LASGDVLVFLDSHCEVNKVWLEPLLLAIAKDPKMVVCPLIDVIDYMTLK----YKPSPVV 277
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F W + +K + + E + P +SP AGG+FA+DR +F E+G YD + +W
Sbjct: 278 RGAFNWHLQFKWDNVFSYEMDGPEGPIAPIRSPAMAGGIFAIDRQYFNEIGRYDKDMNLW 337
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+GH+ + + R +TYN R++
Sbjct: 338 GGENLELSLRIWMCGGQLFVLPCSRVGHIDKQRIE------NKREYLKAMTYNNLRMVHV 391
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE HK + R P + G+ISE+
Sbjct: 392 WLDE-HKEQVFLRRPGLKSVAYGNISER 418
>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
boliviensis boliviensis]
Length = 578
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 151/361 (41%), Positives = 205/361 (56%), Gaps = 16/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G HL E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGKASKLHLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 74 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 126 SKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 185
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 186 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 244
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 GRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKYERDRRISRI 301
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W DE +K +FY R P A GDISE
Sbjct: 362 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISE 411
Query: 372 Q 372
+
Sbjct: 412 R 412
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 151/358 (42%), Positives = 205/358 (57%), Gaps = 18/358 (5%)
Query: 22 EGPGEGGKAYHLPEAYRAAG----DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
PGE GK +P D N S+ IS R++PD+R CK +
Sbjct: 136 NAPGENGKGVIVPTNLTGDAKRRLDIGWQNNAFNQYASDMISLHRSLPDMRDPGCKTQKF 195
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
DLP+ SVI+ FHNE +S LMRTVHS+I R+P L+EIILVDDFS L ++LEDY
Sbjct: 196 RRDLPQTSVIICFHNEAWSVLMRTVHSVIDRSPKNLLKEIILVDDFSDMKHLKEQLEDYT 255
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
++ G V+++R ++REGLIR R GAK + V+ +LD+HCE WL PLL I
Sbjct: 256 RKL-GIVKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPLLDRIAEADT 314
Query: 198 IMTVPVIDGIDYQTWEF---RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
+ PVID I T+E+ R+ Y + G F+W + + + LP+R+ RK +
Sbjct: 315 NVVCPVIDVISDSTFEYPHRRAGYTVN---VGGFDWNLQFSWHSLPQRDKDARKQSWSAV 371
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SPT AGGLF++ +A+F +LG YD G +WG EN ELSFK+WMCGG +E VPCS +GHV+
Sbjct: 372 PSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSFKVWMCGGRLEIVPCSHVGHVF 431
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R PY + K + +K N R+ + W DE + YF P D GDISE+
Sbjct: 432 RKRSPYKWLKGVNVLKK-----NSVRLAKVWMDEYAQYYFDRIGP--DLGDYGDISER 482
>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
Length = 666
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 159/352 (45%), Positives = 214/352 (60%), Gaps = 13/352 (3%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GE GKA L E+ R E G N S+ IS +R++PD+R C +Y L
Sbjct: 142 GLGEKGKAASLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCHKKEYVTKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R +R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPKRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P ID ID+ + +R+ D RG F+W YK L + + K+ ++P+KSP AG
Sbjct: 322 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLKDDL---KHPADPFKSPIMAG 375
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P N
Sbjct: 376 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PRN 433
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
R G + NYKRV E W DE +K Y Y+ + + +D GD++EQ
Sbjct: 434 HQPSPRR--GDYLHRNYKRVAEVWMDE-YKNYLYSHGDGVYESVDPGDLTEQ 482
>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Cricetulus griseus]
Length = 797
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 202/343 (58%), Gaps = 22/343 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 55 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLDLPATSVIIT 114
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 115 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 169
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 170 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSPIIDVISL 229
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 230 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 286
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 287 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 340
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
+G +TY N KR E W DE +K Y+Y P A+ G ++
Sbjct: 341 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 382
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 197/333 (59%), Gaps = 23/333 (6%)
Query: 33 LPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFH 91
+ E + A + N S+ IS RT+PD R E CK Y DLP+ SV++ FH
Sbjct: 1 MSEDAKLAVSEGWKKNAFNQYASDLISIRRTLPDPRDEWCKQPGRYLEDLPQTSVVICFH 60
Query: 92 NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
NE +S L+RTVHS+I R+PA ++EIILVDDFS L Q+L+DY+ KVR++R T+
Sbjct: 61 NEAWSVLLRTVHSVIDRSPAHLIKEIILVDDFSDMPHLMQQLDDYMSSL-PKVRIVRATQ 119
Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
REGLIR R GAK V+ +LD+HCE WL PLL I ++ + PVID ID T
Sbjct: 120 REGLIRARLLGAKYVTAPVLTYLDSHCECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNT 179
Query: 212 WEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
E+ HYR G F+W + + + +P RE + K+ +EP SPT AGGLF
Sbjct: 180 LEY--------HYRDSTSVNVGGFDWNLQFNWHPVPARERARHKHTAEPVWSPTMAGGLF 231
Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
A+D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS +GH++R PY +
Sbjct: 232 AIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-- 289
Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
R ++ N R+ E W D+ K Y+Y R
Sbjct: 290 ---RTGVNVLKKNSVRLAEVWLDDYSK-YYYQR 318
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + GE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF + EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RP++K +PP + + G L E + + Y +N+ S+ IS
Sbjct: 61 RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112
Query: 61 DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ +
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG +E PCS +GHV+ PY P N R E W DE +K +FY R
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRN 398
Query: 359 PLAMFLDMGDISEQ 372
P A GDISE+
Sbjct: 399 PPARKEAYGDISER 412
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 154/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ E + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ + P I ID T++F R V H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K +I N R+ E W D+ +K FY R A +
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMVQ 455
Query: 365 --DMGDISEQ 372
+ GDISE+
Sbjct: 456 ENNFGDISER 465
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + GE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF + EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RP++K +PP + + G L E + + Y +N+ S+ IS
Sbjct: 61 RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112
Query: 61 DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ +
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG +E PCS +GHV+ PY P N R E W DE +K +FY R
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRN 398
Query: 359 PLAMFLDMGDISEQ 372
P A GDISE+
Sbjct: 399 PPARKEAYGDISER 412
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + GE GKA L E + + Y +N+ S+ I
Sbjct: 58 RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 107
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 108 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 167
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 168 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 226
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF + EP G F+W + ++
Sbjct: 227 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 283
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 284 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 343
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 344 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 393
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 394 RNPPARKEAYGDISER 409
>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Otolemur garnettii]
Length = 582
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 153/352 (43%), Positives = 203/352 (57%), Gaps = 18/352 (5%)
Query: 25 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
GE GKA L E + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRL 138
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE YI
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLE 198
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D +
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVC 257
Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +GHV+ PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P N R E W DE +K +FY R P A GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKETYGDISER 416
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 202/349 (57%), Gaps = 13/349 (3%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 69
+P L P GE GK LP A DA + N S+ IS R++PD R
Sbjct: 72 KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131
Query: 70 EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
E CK + LP+ SVI+ FHNE +S L+RTVHS++ R+P+ ++E+ILVDDFS
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L Q+L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
+GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 370 HVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDEYAK-YYYQR 412
>gi|402865469|ref|XP_003896945.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Papio
anubis]
Length = 475
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 129 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 188
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 189 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 248
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 249 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 304
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + ++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 305 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 364
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
GGEN ELS +IWMCGG + +PCSR+GH+ K R +I+ +NY R+
Sbjct: 365 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 415
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE +K F+ R+P ++ G+I E+
Sbjct: 416 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 445
>gi|426221067|ref|XP_004004733.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Ovis
aries]
Length = 938
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 430 PRDP--KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 487
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 488 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 547
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E + WL PLL +Y
Sbjct: 548 YMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLLERVYLS 606
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK + ++
Sbjct: 607 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNKIKETDII 663
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 664 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 723
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 724 RNDNPYSFPK--DRMK--TVERNLGRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 778
>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
Length = 575
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 104 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 162
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 163 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 217
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 218 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 277
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 278 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 334
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 335 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 389
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 390 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 433
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
RP++K +PP + + G L E + + Y +N+ S+ IS
Sbjct: 61 RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112
Query: 61 DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ +
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288
Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
GG +E PCS +GHV+ PY P N R E W DE +K +FY R
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLRNTARAAEVWMDE-YKEHFYNRN 398
Query: 359 PLAMFLDMGDISEQ 372
P A GDISE+
Sbjct: 399 PPARKEAYGDISER 412
>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
[Homo sapiens]
gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
[synthetic construct]
Length = 542
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Pan paniscus]
Length = 532
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)
Query: 6 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R C Y DLP S+I+ FHNE S+L+RT+ S+I RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSN 134
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + D + PVID I+ T+ + E RG F+W + ++ +L +
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+R +EP ++P AGGLF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
PCSR+GHV+R PY F G TY N KR E W DE +K Y+Y P A+
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359
Query: 364 LDMGDI 369
G++
Sbjct: 360 RPFGNV 365
>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Pan
paniscus]
Length = 529
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 42 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 100
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 101 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 155
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 156 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 215
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 216 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 272
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 273 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 327
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 328 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 371
>gi|403276501|ref|XP_003929936.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Saimiri boliviensis boliviensis]
Length = 455
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 202/326 (61%), Gaps = 11/326 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+YG N+ S + R +PD R + C YP+ LP AS+++ F+NE F++L RTV SI
Sbjct: 111 KYGFNIIISRSLGIKREVPDTRSKMCLQKRYPVRLPTASIVICFYNEEFNALFRTVSSIW 170
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
TP LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 171 NLTPHHCLEEIILVDDMSKVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 230
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+V+VFLD+HCEV WL PLL I D K++ PVID ID +T + Y+P RG
Sbjct: 231 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDDRTLK----YKPSPVVRG 286
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + +K + + E + ++P +SP AGG+FA+ R +F E+G YD + WGG
Sbjct: 287 AFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGG 346
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELS +IWMCGG + +PCSR+GH+ + GK ++ + + NY R++ W
Sbjct: 347 ENLELSLRIWMCGGQLFIIPCSRVGHISKK----QPGKGSELINA--VARNYLRLVHVWL 400
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K F+ R+P ++ G+ISE+
Sbjct: 401 DE-YKEQFFLRKPGLKYMTYGNISER 425
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 202/349 (57%), Gaps = 13/349 (3%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 69
+P L P GE GK LP A DA + N S+ IS R++PD R
Sbjct: 72 KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131
Query: 70 EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
E CK + LP+ SVI+ FHNE +S L+RTVHS++ R+P+ ++E+ILVDDFS
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191
Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
L Q+L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
+GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 370 HVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDEYAK-YYYQR 412
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/338 (43%), Positives = 200/338 (59%), Gaps = 13/338 (3%)
Query: 25 GEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY-WDYPL 79
GE G+ HLP + D + N S+ IS R +PD R + CK +
Sbjct: 119 GEMGRPVHLPANLTGEIKKLVDEGWSKNAFNQYVSDLISVHRKLPDPRDKWCKEPGRFLQ 178
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
DLP+ SV++ FHNE +S L+RTVHS++ R+P L+EIILVDDFS L ++LEDY+
Sbjct: 179 DLPQTSVVICFHNEAWSVLLRTVHSVLDRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSH 238
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
+ KV++IR ++REGLIR R GA + V FLD+HCE + WL PLL I D +
Sbjct: 239 YP-KVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPLLDRIAKDPTTV 297
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
PVID ID T E+ + + G F+W + + + +PERE K+ K +EP SPT
Sbjct: 298 VCPVIDVIDDTTLEY-NFRDSGGVNVGGFDWNLQFNWHAVPEREKKRHKNTAEPVWSPTM 356
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFA+D+ FF +G YD G +WGGEN ELSFK WMCGG++E VPCS +GH++R P
Sbjct: 357 AGGLFAIDKNFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRRRSP 416
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
Y + R ++ N R+ E W D+ K Y+Y R
Sbjct: 417 YKW-----RSGVNVLKRNSVRLAEVWLDDYAK-YYYQR 448
>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
Length = 558
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADMAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Papio
anubis]
Length = 558
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Nomascus leucogenys]
Length = 558
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
Length = 558
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|15207811|dbj|BAB62930.1| hypothetical protein [Macaca fascicularis]
Length = 373
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 27 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 86
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 87 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 146
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 147 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 202
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + ++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 203 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 262
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
GGEN ELS +IWMCGG + +PCSR+GH+ K R +I+ +NY R+
Sbjct: 263 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 313
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE +K F+ R+P ++ G+I E+
Sbjct: 314 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 343
>gi|354482531|ref|XP_003503451.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Cricetulus griseus]
Length = 929
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 150/373 (40%), Positives = 220/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V + D L +P PG+ G+ +P + + E N+ S+ I DR
Sbjct: 412 VLRIDESLSPRDP------NAPGQFGRPVVVPPGKKEEAERRWKEGNFNVYLSDLIPVDR 465
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R EC DLP S+I+ F +E +S+L+R+VHSI+ R+P ++EI+LVDD
Sbjct: 466 AIEDTRPAECAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSILNRSPPHLIKEILLVDD 525
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 526 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 584
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P +
Sbjct: 585 GWLEPLLERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPD 641
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK ++ + P GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 642 VVAKSGIKETDIIRCPVMGCGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGE 701
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 702 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 756
Query: 360 LAMFLDMGDISEQ 372
+ LD+G++++Q
Sbjct: 757 IDQGLDVGNLTQQ 769
>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
Length = 644
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 133/320 (41%), Positives = 198/320 (61%), Gaps = 10/320 (3%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
++ + + N+ S + DR IPD R +C + L +VI+ FHNE +S+L+RT
Sbjct: 165 ESGMQRHSFNVRASELLPLDRPIPDYRPTQCPSINQST-LSPTTVIICFHNEAWSTLLRT 223
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
+HS+I R+P+ + EIILVDD S+ L + LE+++ + V L+R REGLIR R
Sbjct: 224 LHSVINRSPSHLIMEIILVDDASTFDYLGEPLENHLSQLEN-VYLLRTKIREGLIRARLL 282
Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
G ++G+V+VFLD+HCE WLPPLL I +DR + P++D I++QT+E+R+ E
Sbjct: 283 GVSYAKGDVLVFLDSHCECAEGWLPPLLLAIEADRTKIVCPLVDVIEFQTFEYRAAKEEL 342
Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
H G F+W + + +LPE E K+R ++ ++PT GGLFA+DR +F +G YD G+
Sbjct: 343 H---GAFDWNLQFIWKDLPEHEMKRRTSPADNIRAPTIIGGLFAVDRLYFKRIGSYDSGM 399
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
+WG EN ELSF++WMCGGS+E PCSR+GHV+R+ +PY F R I N R
Sbjct: 400 DIWGSENLELSFRVWMCGGSLEISPCSRVGHVFRTRIPYGFPNGGKRT----IRNNAMRA 455
Query: 342 IETWFDEKHKAYFYTREPLA 361
E W D+ +K +FY + +
Sbjct: 456 AEVWLDD-YKKFFYASQNIT 474
>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Loxodonta africana]
Length = 555
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 201/343 (58%), Gaps = 22/343 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 71 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + ++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DQREGLIRSRVRGADVAVAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKISRTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
+G +TY N KR E W DE +K Y+Y P A+ G ++
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 398
>gi|15207947|dbj|BAB62998.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + ++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 273 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
GGEN ELS +IWMCGG + +PCSR+GH+ K R +I+ +NY R+
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 383
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE +K F+ R+P ++ G+I E+
Sbjct: 384 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 413
>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Saimiri boliviensis boliviensis]
Length = 558
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 400
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 213/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + GE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 74 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 126 AKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQ 185
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 186 LEAYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERI 244
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 SRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSRI 301
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W D+ +K +FY R P A GDISE
Sbjct: 362 HVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKETYGDISE 411
Query: 372 Q 372
+
Sbjct: 412 R 412
>gi|26325284|dbj|BAC26396.1| unnamed protein product [Mus musculus]
Length = 930
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
VI+ I+ + + +V D+ RG+F W M + +P + AK ++ + P AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIELIPCSRVGHIFRNDNPYS 722
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
F K DR+K + N RV E W D+ + ++ + + LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770
>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
lupus familiaris]
Length = 552
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSSRAVPDTRHLRCTMLVYCADLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ D D +Q KV+ IRN+ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMNLIQEIILVDDFSNDPD------DCLQLIKLPKVKCIRNSERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I + +
Sbjct: 183 RIRGANVAKGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 240 ESAAELRGGFDWSLHFQWEQLSPEQKARRLDPAEPIRTPIIAGGLFVMDKSWFNYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIESR 388
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 151/352 (42%), Positives = 204/352 (57%), Gaps = 18/352 (5%)
Query: 25 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
GE GKA L + + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASKLQLSQDELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRRL 138
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE YI +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLD 198
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D +
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGKDETAIVC 257
Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +GHV+ PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P N R E W D+ +K +FY R P A GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDQ-YKEHFYNRNPPARKEAYGDISER 416
>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Gorilla gorilla gorilla]
Length = 532
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)
Query: 6 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDVWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + D + PVID I+ T+ + E RG F+W + ++ +L +
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+R +EP ++P AGGLF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
PCSR+GHV+R PY F G TY N KR E W DE +K Y+Y P A+
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359
Query: 364 LDMGDI 369
G++
Sbjct: 360 RPFGNV 365
>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
catus]
Length = 552
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 141/334 (42%), Positives = 194/334 (58%), Gaps = 17/334 (5%)
Query: 41 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+R
Sbjct: 70 GDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCADLPPTSIIITFHNEARSTLLR 129
Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
T+ S++ RTP ++EIILVDDFS+ D +L KV+ IRNTER+GL+R+R
Sbjct: 130 TIRSVLNRTPMNLIQEIILVDDFSNDPDDCSQLIKL-----PKVKCIRNTERQGLVRSRI 184
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
RGA ++G + FLD+HCEV +WL PLL + D + PVID I + + E
Sbjct: 185 RGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---IES 241
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 242 AAELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFEYLGKYDTD 301
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 302 MDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKNT 355
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 356 KRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|195471079|ref|XP_002087833.1| GE18238 [Drosophila yakuba]
gi|194173934|gb|EDW87545.1| GE18238 [Drosophila yakuba]
Length = 659
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 205/328 (62%), Gaps = 16/328 (4%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +R++PD+R+E CK Y LP SV+ +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISLNRSVPDIRLEACKTRKYLAKLPNISVVFIFFNEHFNTLLRSIYSVINR 219
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L +I+LVDD S L Q L+DY+ Q F V ++ + ER+GLI R GAK +
Sbjct: 220 TPPELLRQIVLVDDGSEWDSLKQPLDDYVAQHFPHLVTVVHSPERQGLIGARLAGAKVAV 279
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
GEV+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKIATCPMVDTIAHEDFSYFSGNKDG--ARGG 337
Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W MLYK+ LPE K S PY+SP GGLFA++ FF +LGGYD L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
E +ELSFKIWMCGG + VPCSR+GH++R M K +G + N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGLLLDVPCSRVGHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448
Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
DE +K Y Y R+P +D GD++ Q
Sbjct: 449 MDE-YKEYVYKRDPATYDNVDAGDLTRQ 475
>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Cricetulus griseus]
Length = 513
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 203/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 26 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLDLPATSVIIT 85
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 86 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 140
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 141 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSPIIDVISL 200
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 201 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 257
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 258 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 311
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 312 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 355
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476
>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
sapiens]
gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_d [Homo sapiens]
gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)
Query: 6 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
AD L + +P + + + + +L GD Y N S IS +R IP
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + D + PVID I+ T+ + E RG F+W + ++ +L +
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+R +EP ++P AGGLF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
PCSR+GHV+R PY F G TY N KR E W DE +K Y+Y P A+
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359
Query: 364 LDMGDI 369
G++
Sbjct: 360 RPFGNV 365
>gi|109732606|gb|AAI16333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
VI+ I+ + + +V D+ RG+F W M + +P + AK ++ + P AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
F K DR+K + N RV E W D+ + ++ + + LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770
>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Gorilla gorilla gorilla]
Length = 598
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 200/344 (58%), Gaps = 18/344 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 111 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 169
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 170 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 224
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 225 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 284
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 285 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 341
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 342 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 396
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
+G +TY N KR E W DE +K Y+Y P A+ G ++
Sbjct: 397 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 438
>gi|158749624|ref|NP_766443.2| polypeptide N-acetylgalactosaminyltransferase 5 [Mus musculus]
gi|341940730|sp|Q8C102.2|GALT5_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|148694985|gb|EDL26932.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
VI+ I+ + + +V D+ RG+F W M + +P + AK ++ + P AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+ PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
F K DR+K + N RV E W D+ + ++ + + LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770
>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Nomascus leucogenys]
Length = 532
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)
Query: 6 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLVKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + D + PVID I+ T+ + E RG F+W + ++ +L +
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+R +EP ++P AGGLF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
PCSR+GHV+R PY F G TY N KR E W DE +K Y+Y P A+
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359
Query: 364 LDMGDI 369
G++
Sbjct: 360 RPFGNV 365
>gi|324506451|gb|ADY42754.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Ascaris suum]
Length = 618
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/363 (41%), Positives = 215/363 (59%), Gaps = 33/363 (9%)
Query: 23 GPGEGGKAYHLP----------EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
GPGEGGK +P E YR G + S+ I +R++ D+R ++C
Sbjct: 96 GPGEGGKPVAIPTDPEIKKKQEELYRVNGYDAF--------VSDLIPLNRSVKDIRHKDC 147
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ Y LP SVI FH+E S+L+R+ +S+I RTP + L+EIILVDD S+K L +
Sbjct: 148 QNLRYLEALPSVSVIFPFHDEHNSTLLRSAYSVIARTPKEILKEIILVDDASTKPFLKKP 207
Query: 133 LEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
L++Y++ + + V+++R +REGLIR R GA+ + +++VFLDAH E NWLPPL+
Sbjct: 208 LDEYLKSAKLDHIVKVVRTKKREGLIRARQIGAQHATADIMVFLDAHSEPNYNWLPPLIE 267
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
PI D + + P +D ID T+E+R+ D RG F+W YK L E + K +
Sbjct: 268 PITLDYRTVVCPFVDVIDCDTFEYRA---QDEGARGSFDWEFNYKRLPLTEDDLK---HP 321
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+ P+KSP AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ PCSR+
Sbjct: 322 TRPFKSPVMAGGYFAISRKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGNMVDAPCSRV 381
Query: 311 GHVYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
GH+YR +P+ + D I+ NYKRV E W D+ +K Y Y R D GD+
Sbjct: 382 GHIYRCKHVPFPNPGVGD-----FISRNYKRVAEVWMDD-YKKYLYQRRHGMENADEGDL 435
Query: 370 SEQ 372
++Q
Sbjct: 436 TKQ 438
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 135 IDPPAN-FEENPGELGKPVRLPKEMSDDMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 193
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 194 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 253
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 254 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 312
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 313 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 364
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 365 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 424
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 425 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 475
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W DE + Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476
>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Sus scrofa]
Length = 552
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 196/336 (58%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S ++ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERVASNRVVPDTRLFRCTLLVYCADLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RTV SI+ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 129 RTVRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+++
Sbjct: 183 RIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 240 ESATELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFDYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388
>gi|195385643|ref|XP_002051514.1| GJ11806 [Drosophila virilis]
gi|194147971|gb|EDW63669.1| GJ11806 [Drosophila virilis]
Length = 653
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 154/365 (42%), Positives = 212/365 (58%), Gaps = 33/365 (9%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEY-----GMNMETSNHISFDRTIPDLRMEECKYW 75
+ G GE G LP + +L E G N S+ IS +R++PD+R E+CK
Sbjct: 128 RSGLGEHG----LPATIEDPAEKTLEEQEYRRNGFNGYLSDRISVNRSLPDVRHEKCKTR 183
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
Y LP SV+++F+NE F +L+RTV+SI+ RTP + L +I+LVDD S L +L+
Sbjct: 184 KYLAKLPNVSVVIIFYNEHFQTLLRTVYSIVNRTPKELLHQIVLVDDGSEWETLKDQLDQ 243
Query: 136 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
Y+ ++ V ++ N ER GLI R GA+ + GEV+VF D+H EV NWLPPLL PI
Sbjct: 244 YVALQWPHLVDVVHNPERRGLIGARLAGARVATGEVMVFFDSHIEVNYNWLPPLLEPIVI 303
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEP 253
+ KI T P++D ID+ + + Y+ RG F+W YK+ LPE K S P
Sbjct: 304 NNKISTCPIVDIIDHNNFAYNGGYQ--EGTRGGFDWRFFYKQLPVLPEDSVDK----SLP 357
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
Y+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VPCSR+ H+
Sbjct: 358 YRSPVMMGGLFAINSEFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHI 417
Query: 314 YRSFM-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMG 367
+R M P N+ + N+KRV E W DE +K + Y R+P +D G
Sbjct: 418 FRGQMDPRPNPRNYN---------FVARNHKRVAEVWMDE-YKEHVYRRDPATYDNIDAG 467
Query: 368 DISEQ 372
D+S Q
Sbjct: 468 DLSRQ 472
>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
Length = 582
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 218/357 (61%), Gaps = 14/357 (3%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
E +EGPGE GK L + + L E G + S+ I+ +R++PD R +C+
Sbjct: 56 ESKREGPGEHGKPLKLEKLEDIKLNEKLFKENGYSAVVSDMIALNRSVPDARHVQCRKKR 115
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP SVI++F+NE +S+L+RTVHSI+ R+P++ L+EI+LV+D S+K L + L+DY
Sbjct: 116 YLQELPTVSVIVIFYNEHWSTLLRTVHSILNRSPSKLLKEIVLVNDHSTKEFLWEPLQDY 175
Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ + KV+L R GLI R GAK + G+V++ LD+H EV +NWLPPL+ PI +
Sbjct: 176 VRSKLPSKVKLFNLPVRSGLIAARLAGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEN 235
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
+ P IDGI + T+E++ E RG F+W LYK LP R + + +EP+
Sbjct: 236 YRTCVCPYIDGIAHDTFEYKPQSE---GRRGAFDWKFLYKR--LPLR-PQDQTDPTEPFD 289
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGGLFA+ FF ELGGYD L +WGGE +ELSFKIW CGG + PCS +GHVYR
Sbjct: 290 SPIMAGGLFAISAKFFWELGGYDEELDIWGGEQYELSFKIWQCGGRMVDAPCSHVGHVYR 349
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
P+ + + +T N+KRV E W DE +K + + R P D GD+++Q
Sbjct: 350 GLAPFPNPRGTN-----FVTRNFKRVAEVWMDE-YKQFLFERNPEYDKTDAGDLTKQ 400
>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan paniscus]
Length = 517
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S+I RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350
>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Cavia porcellus]
Length = 552
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 197/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHPRCTLLGYHTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNGERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA+ ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G+I +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNIESR 388
>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Callithrix jacchus]
Length = 578
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 157/372 (42%), Positives = 209/372 (56%), Gaps = 25/372 (6%)
Query: 12 NLEPPLEPYKEGP-------GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
N E +P E P GE GKA L E + + Y +N+ S+ IS R
Sbjct: 55 NTEDLSQPLYEKPPADSHALGEWGKASKLRLNEGELKQQEELIERYAINIYLSDRISLHR 114
Query: 63 TIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVD
Sbjct: 115 HIEDKRMYECKSKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVD 174
Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
D S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 175 DLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECN 233
Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELP 240
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ + +P
Sbjct: 234 SGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVP 290
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG
Sbjct: 291 KHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
+E PCS +GHV+ PY P N R E W DE +K +FY R P
Sbjct: 351 KLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPP 400
Query: 361 AMFLDMGDISEQ 372
A GDISE+
Sbjct: 401 ARKEAYGDISER 412
>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Callithrix jacchus]
Length = 552
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D Q+L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Tupaia chinensis]
Length = 566
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y +DLP SVI+
Sbjct: 79 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSVDLPATSVII 137
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 138 TFHNEARSTLLRTVRSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 192
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 193 NDRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 252
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLDQKMTRTDPTRPIRTPVIAGGIFVIDK 309
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 310 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 364
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 365 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 408
>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Pan paniscus]
Length = 552
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S+I RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
Length = 638
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 142/366 (38%), Positives = 206/366 (56%), Gaps = 23/366 (6%)
Query: 5 KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHISFDR 62
+ + K G+ E L G G G A G+ LG +G N S I R
Sbjct: 123 RPEAKEGDPESQLLSLPLGDGNGA----------ATGERPLGLETHGFNEALSERIPLRR 172
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
+P++R C +Y LP ASVI+ FH+E +S+L+RTVHSI+ P L++IILVDD
Sbjct: 173 ELPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSILNTAPKASLKDIILVDD 232
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L L +YI + +G V+LIR+ R G+IR R GA + G+V+VF+D+HCE
Sbjct: 233 LSQQGPLKSALSEYISKLDG-VKLIRSNRRLGVIRGRMLGAARATGDVLVFMDSHCECQK 291
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLLA + S+R + P+ID ID++T+++ Y +RG+F+W + + +PE
Sbjct: 292 GWLEPLLARLSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEH 348
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
E K R+ + P +SP AG + AMDR +F +G YD + +WG EN ELS + W+CGGS+
Sbjct: 349 EEKVRQSPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSV 408
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
E +PCSR+GHVYR +P+ F I N R+ ETW D K FY + +A
Sbjct: 409 EIIPCSRVGHVYRHHIPHAFS------YEEAIVRNKIRIAETWLD-SFKENFYKNDTVAF 461
Query: 363 FLDMGD 368
+ +
Sbjct: 462 LISKAE 467
>gi|269115411|gb|ACZ26277.1| N-acetyl galactosaminyl transferase-like protein [Mayetiola
destructor]
Length = 638
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 157/379 (41%), Positives = 219/379 (57%), Gaps = 37/379 (9%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E + G GE G+ + + A G N S++IS +R++ D+R ++C Y
Sbjct: 88 EKQRTGIGEHGEPAFVADNEEAERKRLFDLNGFNALLSDYISINRSVKDIRHKDCAKIKY 147
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SV++ F NE FS+L+RTV+S++ R+PA+ + EIILVDD S++ ++ + L++YI
Sbjct: 148 LSELPSVSVVVPFFNEHFSTLLRTVYSVLNRSPAELIMEIILVDDASNRDNVKKPLDNYI 207
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA------- 190
+ KV+LIR ER GLI R GA+ ++G+V++FLD+H E NWLPPLL
Sbjct: 208 AKHLPKVKLIRLPERSGLILARLAGARAAKGDVLIFLDSHTEPNTNWLPPLLGKNEQNEI 267
Query: 191 --------------PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
PI + K+ P ID I Y T+E+R+ D RG F+W YK
Sbjct: 268 ILFSENKNKKTQTEPIAENYKVCMCPFIDVISYDTFEYRA---QDEGARGAFDWQFYYKR 324
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
L E + K+ + P+KSP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW
Sbjct: 325 LPLLEDDL---KHPTRPFKSPVMAGGLFAISAKFFWELGGYDDGLDIWGGEQYELSFKIW 381
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV--KGPLITYNYKRVIETWFDEKHKAYF 354
CGG + PCSR+GH+YR G +A KG + NYKRV E W DE +K Y
Sbjct: 382 QCGGEMYDAPCSRVGHIYRG------GGIAQPTGRKGDFLHKNYKRVAEVWMDE-YKEYL 434
Query: 355 YTREPLAM-FLDMGDISEQ 372
Y REP +D GD+++Q
Sbjct: 435 YKREPERYEAIDAGDLTKQ 453
>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Nomascus leucogenys]
Length = 535
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 52 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 111
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 112 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 166
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 167 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 223
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 224 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 283
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 284 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 337
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G++
Sbjct: 338 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 368
>gi|195115611|ref|XP_002002350.1| GI13183 [Drosophila mojavensis]
gi|193912925|gb|EDW11792.1| GI13183 [Drosophila mojavensis]
Length = 655
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/358 (41%), Positives = 209/358 (58%), Gaps = 24/358 (6%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ + + + G N S+ IS +R++PD+R E CK Y LP
Sbjct: 130 GLGEHGQPASVDPSEKELEQQEYRRNGFNGYLSDRISVNRSVPDVRKEACKTRKYLAKLP 189
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
SVI +F+NE F +L+R+++SI+ RTP + L++I+LVDD S L + L+DY+ ++
Sbjct: 190 NVSVIFIFYNEHFQTLLRSIYSIVNRTPPELLKQIVLVDDGSEWDTLKKHLDDYVALQWP 249
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V ++ N ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI + KI T
Sbjct: 250 KLVDVVHNPERRGLIGARLAGAKVATGEVMVFFDSHIEVNYNWLPPLLEPIVINNKIATC 309
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHA 260
P++D ID+ + + Y+ RG F+W YK+ LPE K S PY+SP
Sbjct: 310 PIVDIIDHNNFAYNGGYQ--EGSRGGFDWRFFYKQLPVLPEDSVDK----SLPYRSPVMM 363
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-- 318
GGLFA++ +F +LGGYD L +WGGE +ELSFKIWMCGG + VPCSR+ H++R M
Sbjct: 364 GGLFAINSKWFWDLGGYDDELEIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQMDP 423
Query: 319 ---PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
P N+ + N+KRV E W DE +K + Y R+P +D GD++ Q
Sbjct: 424 RPNPRNYN---------FVARNHKRVAEVWMDE-YKEFVYKRDPATYNNIDAGDLTRQ 471
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/362 (41%), Positives = 204/362 (56%), Gaps = 16/362 (4%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
++PP + + G L E + + Y +N+ S+ IS R I D RM EC
Sbjct: 65 IKPPADSHALGEWGRASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYEC 124
Query: 73 KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
K + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 125 KAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKA 184
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 185 QLETYISNLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243
Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 244 ISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSR 300
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +
Sbjct: 301 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 360
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+ PY P N R E W DE +K +FY R P A GD+S
Sbjct: 361 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDLS 410
Query: 371 EQ 372
E+
Sbjct: 411 ER 412
>gi|345781283|ref|XP_853759.2| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Canis lupus
familiaris]
Length = 559
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 207/351 (58%), Gaps = 14/351 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
E G+ GK ++ G L +YG N S + D +PD R + C YP L
Sbjct: 91 ETAGKLGKDFNYSNPEFIDG---LLKYGFNTILSKSLGSDSKVPDTRNKMCLQKRYPAKL 147
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P ASVI+ FHNE F++L RT+ S+ TP LEEIILVDD S DL +KL+ +++ F
Sbjct: 148 PTASVIICFHNEEFNALFRTLSSVGNLTPHYILEEIILVDDMSDFDDLKEKLDHHLEIFR 207
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
GK+++IRN +REGL+R+R GA + G+V+VFLD+HCEV WL PLL I D K++
Sbjct: 208 GKIKVIRNKKREGLVRSRLIGASRASGDVLVFLDSHCEVNTAWLQPLLHAIAKDSKMVVC 267
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P+ID ID T E Y+ RG F W + +K + + E + + P +SP AG
Sbjct: 268 PLIDVIDSMTLE----YQSSPVVRGAFNWHLDFKWDSVYSYEMDGPEGPTRPIRSPAMAG 323
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
G+FA++R +F E+G YD G+ +WG EN ELS +IWMCGG + +PCSR+GH+ +
Sbjct: 324 GIFAINRHYFNEIGQYDKGMDLWGAENLELSLRIWMCGGQLFIIPCSRVGHISKQ----R 379
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F + VK +TYN R++ W DE +K F+ ++P + G+ISE+
Sbjct: 380 FSNQPELVKA--MTYNNLRLVHVWLDE-YKEQFFLQQPGLKSVAYGNISER 427
>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Otolemur garnettii]
Length = 558
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + ++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVATAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350
>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Macaca mulatta]
Length = 532
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)
Query: 6 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
AD L + +P + + + + +L GD Y N S IS +R IP
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSN 134
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLL + D + PVID I+ T+ + E RG F+W + ++ +L +
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+R +EP ++P AGGLF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
PCSR+GHV+R PY F G TY N KR E W DE +K Y+Y P A+
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359
Query: 364 LDMGDI 369
G++
Sbjct: 360 RPFGNV 365
>gi|432096894|gb|ELK27469.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Myotis davidii]
Length = 940
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/359 (41%), Positives = 216/359 (60%), Gaps = 12/359 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G+ +P+ + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVLVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCAKQ 489
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNIGWLEPLLERVYLS 608
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R ++
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVI 665
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE K Y + + LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMKT--VERNLVRVAEVWLDEYKELFYGHGNHLIDQGLDVGNLTQQ 780
>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
grunniens mutus]
Length = 509
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 194/333 (58%), Gaps = 19/333 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 26 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 85
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ SI+ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 86 RTIRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 139
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ +
Sbjct: 140 RIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---I 196
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 197 ESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYD 256
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY
Sbjct: 257 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFP------DGNANTYIK 310
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
N KR E W DE +K Y+Y P A+ G+I
Sbjct: 311 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNI 342
>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Loxodonta africana]
Length = 552
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 196/332 (59%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCNLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFSS D D KL + KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSSDPD-DCKLLIKL----PKVKCVRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
+GA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDS 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R PY F G TY N
Sbjct: 301 EMDIWGGENFEMSFRVWMCGGSLEIIPCSRVGHVFRKKHPYIFP------DGNTNTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G+I
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNI 385
>gi|68342011|ref|NP_001020319.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Rattus
norvegicus]
gi|50926898|gb|AAH78995.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Rattus
norvegicus]
Length = 443
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 203/328 (61%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N+ TS + +R +PD R + C+ YP +LP ASVI+ F+NE F++L+RTV S
Sbjct: 83 LSRYGLNVITSRRLGIERQVPDSRNKICQQKHYPFNLPTASVIICFYNEEFNTLLRTVSS 142
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ +++ F GK++L+RN +REGLIR+R GA
Sbjct: 143 VMNLSPKHLLEEIILVDDMSEFDDLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGAS 202
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ PVID ID T ++ V P
Sbjct: 203 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPVIDVIDELTLDY--VGSP--IV 258
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + ++ +++ E + S P +SP +GG+FA++R +F ELG YD + +W
Sbjct: 259 RGAFDWNLNFRWDDVFSYELDGPEGPSTPIRSPAMSGGIFAINRHYFNELGQYDKDMDLW 318
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+GH ++ V ++ N RV+
Sbjct: 319 GGENVELSLRIWMCGGQLFILPCSRVGHNNKALSKNRL------VNQSALSKNLLRVVHV 372
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P + G+IS++
Sbjct: 373 WLDE-YKENFFLQRPSLTHVSCGNISDR 399
>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
sapiens]
gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
sapiens]
gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Homo sapiens]
gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_a [Homo sapiens]
gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[synthetic construct]
gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
sapiens]
Length = 552
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
floridanus]
Length = 597
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 149/373 (39%), Positives = 218/373 (58%), Gaps = 16/373 (4%)
Query: 5 KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
K + K+ LE + P G GE GK +L + G+A+L + +N+ SN IS R +
Sbjct: 69 KYEDKILKLEYNVVP---GLGENGKPAYLYGKDKFQGEAALKKKALNVILSNKISLTRKL 125
Query: 65 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
PD+R C Y LP ASV+++F+NE +S L+RTVHS++K +P L+EIILVDD S
Sbjct: 126 PDIRNSLCMNITYDKLLPSASVVIIFYNEPWSVLLRTVHSVLKGSPPHLLKEIILVDDHS 185
Query: 125 SKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
+ +L +L+ Y+ R KV+L+R + R+GLIR R GA+ ++G+V+VFLDAHCEV +
Sbjct: 186 EEEELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKD 245
Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 243
WL PLL I ++ + +P+ID I +T E+ E G F W + + + E
Sbjct: 246 WLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHE 305
Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
+ R P +SPT AGGLFA++R +F E+G YD + WGGEN E+SF+IW CGG++E
Sbjct: 306 VESRPSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSFRIWQCGGTLE 365
Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
+PCSR+GH++R+F PY F D N R+ W D + + R + F
Sbjct: 366 IIPCSRVGHIFRNFHPYKFPNDKDTH-----GINTARLAFVWMDGYKRLFLLHR---SEF 417
Query: 364 LD----MGDISEQ 372
D GD+SE+
Sbjct: 418 KDNPKLFGDVSER 430
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 143/353 (40%), Positives = 212/353 (60%), Gaps = 14/353 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWDYPL 79
GE GK ++ ++ + + E G N S+ +S RT+PD+R +EC+ +Y
Sbjct: 2 GENGKGLNIDKSKLSPEELKKYEKGYQRNAFNQYASDQMSLHRTLPDVRDKECRDRNYAT 61
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
+LP S+I++FHNE +S L+RTV S + R+P ++EIILVDDFS L L+++
Sbjct: 62 ELPDTSIIVIFHNEAWSVLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQAPLQEFADS 121
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
KVRL+R +REGLIR R GA ++G V+ FLD+HCE + WL PLL I ++ +
Sbjct: 122 -QEKVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRISQNKSNV 180
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
PVID I+ T +++ G F+W + + + +P+ E K+RK + +P +SPT
Sbjct: 181 VTPVIDVINDDTIQYQYSSAKSTSVGG-FDWNLQFNWHGIPDHEKKRRKSDVDPVRSPTM 239
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++ R +F LG YDPG+ +WGGEN ELSF+IWMCGGS++ PCS +GH++R P
Sbjct: 240 AGGLFSISREYFEYLGTYDPGMDIWGGENLELSFRIWMCGGSLDIAPCSHVGHIFRKRSP 299
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y++ + VK N R+ E W DE K Y+Y R + D GD+S +
Sbjct: 300 YSWKTGVNVVKK-----NSIRLAEVWLDEFSK-YYYERFNYDLG-DYGDVSAR 345
>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
Length = 667
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 180 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 239
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 240 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 294
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 295 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 354
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 355 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 411
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 412 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 465
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 466 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 509
>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Nomascus leucogenys]
Length = 552
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
Length = 655
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 201/343 (58%), Gaps = 22/343 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 168 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 227
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 228 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 282
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 283 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 342
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 343 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 399
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 400 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 453
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
+G +TY N KR E W DE +K Y+Y P A+ G ++
Sbjct: 454 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 495
>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Myotis davidii]
Length = 518
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ ++ DR I D R C Y DLP SVI+
Sbjct: 28 KAYLAAKQLKPGEDPYRQHAFNQLESDKLTSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 87
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 88 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 142
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL PLL + D + P+ID I
Sbjct: 143 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPLLQRVQEDHTRVVSPIIDVISL 202
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 203 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 259
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 260 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 313
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 314 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 357
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ E D ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTLLETQEKDEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SV++VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ + P I ID T++F + + H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K +I N R+ E W D+ +K FY R A +
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMAK 455
Query: 365 --DMGDISEQ 372
+ GD+SE+
Sbjct: 456 ENNFGDVSER 465
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/326 (42%), Positives = 190/326 (58%), Gaps = 17/326 (5%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N E S+ S DR +PD R C Y LP SVI+ FHNE S+L+RT+ S++ R
Sbjct: 54 AFNQEASDKTSIDRKVPDTRHSWCYNQVYHPTLPSTSVIITFHNEARSTLLRTIVSVLNR 113
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P +EEIILVDDFS + L K++LIRN REGL+R+R GA ++G
Sbjct: 114 SPPHLIEEIILVDDFSEDVNTGLLLTQM-----PKIKLIRNERREGLVRSRIFGADAAKG 168
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
E++ FLD+HCE + WL PLL + DR I+ P+ID I T+++ RG F
Sbjct: 169 EILTFLDSHCECNIGWLEPLLHRVSQDRTIVVSPIIDVISMDTFDYIGASS---ELRGGF 225
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + +K + + KRK EP K+P AGGLF+++R F+E G YD + +WGGEN
Sbjct: 226 DWSLHFKWDGFTPAQRAKRKSPIEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGEN 285
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWF 346
FE+SF+ WMCGGS+E +PCSR+GHV+R PY F G +TY N KR E W
Sbjct: 286 FEISFRTWMCGGSLEIIPCSRVGHVFRKRHPYVFP------GGNAMTYMKNTKRAAEVWM 339
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
D +K Y+Y+ P A DMG I +
Sbjct: 340 DN-YKDYYYSARPSAKGRDMGSIKSR 364
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 70 KPPADSHALGEWGRASKLELGEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMSECK 129
Query: 74 YWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
Y LP SV++ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 130 SKTYDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKAQ 189
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 190 LETYISSLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 248
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
D + PVID ID+ T+EF EP G F+W + ++ + +P++E +R
Sbjct: 249 GRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQERDRRTSRI 305
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 306 DPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 365
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W D+ +K +FY R P A GDISE
Sbjct: 366 HVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKEAYGDISE 415
Query: 372 Q 372
+
Sbjct: 416 R 416
>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Canis lupus
familiaris]
Length = 557
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSADLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Anolis carolinensis]
Length = 941
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 152/373 (40%), Positives = 218/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
VF D G +P G+ G+ +P + E N+ S+ I DR
Sbjct: 424 VFSIDKTFGPRDP------NAAGQFGRPAVVPNEKQEEAKRRWNEGNFNVYLSDMIPIDR 477
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C DLP S+I+ F +E +S+L+R+VHS++ R+P Q ++EIILVDD
Sbjct: 478 AIDDTRPIGCSDILVHNDLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPQLIKEIILVDD 537
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
FS+K L KL+ Y+ +F KVR++ ER GLIR R GA+ ++G+V+ FLD+H E +
Sbjct: 538 FSTKEYLKDKLDKYMAQF-PKVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVECNV 596
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
WL PLL I+ +RK + PVI+ I + + +V D+ RGIF W M + +P
Sbjct: 597 GWLEPLLERIHLNRKKVPCPVIEVISDKDMSYMTV---DNFQRGIFNWPMNFGWKPIPPD 653
Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
+K K ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG
Sbjct: 654 VIEKNKIKETDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGE 713
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++RS PY+F K DR+ + N RV E W D+ +K FY
Sbjct: 714 IEIIPCSRVGHIFRSDNPYSFPK--DRLT--TVERNLARVAEVWLDD-YKDLFYGHGYHL 768
Query: 360 LAMFLDMGDISEQ 372
+ LD+GD+++Q
Sbjct: 769 VQKNLDVGDLTQQ 781
>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
aries]
Length = 552
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 141/336 (41%), Positives = 195/336 (58%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ SI+ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ +
Sbjct: 183 RIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R PY F G TY
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388
>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 683
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 163/415 (39%), Positives = 221/415 (53%), Gaps = 63/415 (15%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYR---AAGDASLGEYGMNMETSNHIS 59
V A +G L PP P +GPGE G+ L + + A N S+ IS
Sbjct: 120 VDHAPAPVGVLAPPQNP--DGPGEMGRPVVLKDLTKEQEAKVKQGWDRNAFNQYISDMIS 177
Query: 60 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
R++PD+R ECK Y DLP SVI+ FHNE +S L+RTVHSII R+P + L EIIL
Sbjct: 178 LHRSLPDVRDSECKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIIL 237
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR---------------------- 157
VDD+S L QKLEDY+ F KV+++R +REGLIR
Sbjct: 238 VDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCE 296
Query: 158 -------------TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
R+ + V+ +LD+HCE WL PLL I + + PVI
Sbjct: 297 CTEGWLEPLLDRIARNSTTVXATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVVCPVI 356
Query: 205 DGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
D I T+E+ HYR G F+W + + + +PERE ++RK++ +P SP
Sbjct: 357 DVISDSTFEY--------HYRDSGGVNVGGFDWNLQFSWHAVPERERQRRKHSWDPVWSP 408
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
T AGGLF++D+AFF +LG YD G +WGGEN ELSFK WMCGG++E VPCS +GH++R
Sbjct: 409 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKR 468
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY + R ++ N R+ E W DE +K Y+Y R + D GD+S +
Sbjct: 469 SPYKW-----RSGVNVLRRNSVRLAEVWLDE-YKQYYYQRIGDDLG-DFGDVSAR 516
>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Gorilla gorilla gorilla]
Length = 517
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350
>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
partial [Oryctolagus cuniculus]
Length = 499
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 12 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLDLPATSVIIT 71
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 72 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 126
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + ++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 127 DRREGLIRSRVRGADVAAAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 186
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+A
Sbjct: 187 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKITRTDPTRPIRTPVIAGGIFVIDKA 243
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 244 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 297
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 298 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 341
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 148/355 (41%), Positives = 204/355 (57%), Gaps = 27/355 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEEDPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
++E VPCS +GH++R PY + R + N R+ E W DE + Y++
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVPKKNSVRLAEVWMDEYSQCYYH 475
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 207/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
L PP E +E PGE GK LP + + A + + N S+ IS RT+PD R
Sbjct: 130 LLPPSE-LEETPGEMGKPVKLPKDMPDDMKKAVEDGWTKNAFNQYASDLISVHRTLPDPR 188
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y DLPK VI+ FHNE +S L+RTVHS++ R+P + ++ILVDD+S
Sbjct: 189 DAWCKDTARYLTDLPKTDVIICFHNEAWSVLLRTVHSVLDRSPEHLIGKVILVDDYSDMP 248
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV+++R +REGLIR R GA+ ++ V+ +LD+HCE WL P
Sbjct: 249 HLKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWLEP 307
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I+ T E+ HYR G F+W + + + +P
Sbjct: 308 LLDRIARNSTTVVCPVIDVINDDTLEY--------HYRDSTGVNVGGFDWNLQFSWHAVP 359
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ ++EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 360 EREKKRHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSFKTWMCGG 419
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W D+ + Y+Y R
Sbjct: 420 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDD-YAQYYYHR 470
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 151/361 (41%), Positives = 210/361 (58%), Gaps = 20/361 (5%)
Query: 16 PLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
P +PY PGE GK L + + + +Y +N+ S+ IS R I D R+ CK
Sbjct: 12 PPDPY--SPGEWGKPSRLQLSSEEKKQEEELIEKYAINIYLSDKISLHRHIEDNRLSGCK 69
Query: 74 YWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
Y LP SVI+ F+NE +S+L+RT+HS+++ +P+ L+EIILVDD S K L
Sbjct: 70 AKSYNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPSVLLKEIILVDDLSDKVYLKTD 129
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 130 LEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERI 188
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
+ ++ PVID ID++T+E+ EP G F+W + ++ + +P+ E +RK +
Sbjct: 189 AENETVIVCPVIDTIDWKTFEYYMQTAEP---MIGGFDWRLTFQWHSVPKHERLRRKSET 245
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 246 DPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVG 305
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W DE +K +FY R P A + GD+SE
Sbjct: 306 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPSARKENYGDLSE 355
Query: 372 Q 372
+
Sbjct: 356 R 356
>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
davidii]
Length = 507
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 195/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD + N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 24 VGDDPYKLHAFNQRESERISSNRAIPDTRHLRCTLLMYCRDLPPTSIIITFHNEARSTLL 83
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ E+ I+ KV+ +RN +REGL+R+R
Sbjct: 84 RTIRSVLNRTPMNLIKEIILVDDFSNDPG---DCEELIKL--PKVKCLRNDQREGLVRSR 138
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 139 IRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFSY---IE 195
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R SEP ++P AGGLF MD+++F LG YD
Sbjct: 196 SATELRGGFDWSLHFQWEQLSPEQKAQRLDPSEPIRTPIIAGGLFVMDKSWFNFLGKYDM 255
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 256 DMDIWGGENFEMSFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 309
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K YFY P A+ GDI
Sbjct: 310 TKRTAEVWMDE-YKQYFYAARPFALERPFGDI 340
>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Gorilla gorilla gorilla]
Length = 552
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 383
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 206/349 (59%), Gaps = 7/349 (2%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
GE G++ +L + N+ S+ I +RT+ D R C+ Y LP
Sbjct: 2 GELGRSVNLNDNDSKLAKHLFQINQFNIVASDRIPLNRTLIDARRAACRNKTYSSALPTT 61
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
SVI+VFHNE +S+L+RTV S+I R+P + L+EIILVDD S +A L + L++++ V
Sbjct: 62 SVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKKALDNFVLNLPVPV 121
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
++R+ ER GLI+ R GA+++ G+V+ FLD+HCE WL PLL I DRKI PVI
Sbjct: 122 LIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPLLDRIAFDRKIAVAPVI 181
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
D I+ +T++++ + YRG F W + ++ P E K+R + + P ++PT AGGL
Sbjct: 182 DVINDETFQYQKGIDV---YRGGFNWNLQFRWYSSPPSELKRRGNDVTHPVRTPTIAGGL 238
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F++DR FF E+G YD + +WGGEN E+SF+IW CGG +E +PCS +GHV+R P++F
Sbjct: 239 FSIDRQFFFEIGAYDKEMKIWGGENLEMSFRIWQCGGQLEIIPCSHVGHVFRKKSPHDFP 298
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ +T N RV E W DE ++ ++ D+SE+
Sbjct: 299 RGN---SARTLTTNLVRVAEVWMDEWKSLFYIISSAAKNISEIIDVSER 344
>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
Length = 552
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGHANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Monodelphis domestica]
Length = 563
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 204/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+++
Sbjct: 79 KAY-LASKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVHYASDLPTTSIVI 137
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 138 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 192
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 193 NDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDK 309
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
A+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 310 AWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 408
>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Equus caballus]
Length = 619
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 201/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y +DLP SVI+
Sbjct: 133 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSVDLPATSVIIT 192
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 193 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 247
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 248 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 307
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 308 DNFAYLAA---SAILRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 364
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 365 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 418
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 419 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 462
>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 569
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 144/357 (40%), Positives = 214/357 (59%), Gaps = 15/357 (4%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWD 76
+ PGE G Y ++ + + E G N SN IS R++ D R +EC
Sbjct: 59 KAPGELGSPYIFNKSQLTSKEKLEYETGWKKNNFNEFASNRISLQRSLKDPRDKECHNLT 118
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP+ S+I+ FHNE +S L+R+V+SI+ RTP L+E+ILVDDFSS L + L+ +
Sbjct: 119 YSENLPEVSIIVTFHNEAWSVLIRSVYSILNRTPDSLLKEVILVDDFSSLEHLKEPLDQF 178
Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
+++F KV+++R TER+GLIR R RG +E+ G+V+VFLD+H E W PL+ PI +
Sbjct: 179 MEQFQ-KVKIVRATERQGLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPLIDPIARNW 237
Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE-PYK 255
+ PVID ID +T+++ G F+W +++ + +PE E K+R+ P +
Sbjct: 238 STVMTPVIDVIDKETFQY-GFQAASATNVGGFDWSLMFTWHFVPETEQKRRQNKHYLPVR 296
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SPT AGGLFA+ R +F +G YD G+ +WGGEN ELSF+IWMCGG++ PCS +GHV+R
Sbjct: 297 SPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSFRIWMCGGTLLTAPCSHVGHVFR 356
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+FG + VK L+ R+ E W D+ Y+Y + + GD+S +
Sbjct: 357 HTPPYSFGPKKNVVKNNLV-----RMAEVWLDD--FKYYYYQHINYTLGNYGDVSAR 406
>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
taurus]
Length = 552
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 196/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ SI+ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF MD+++F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G+I +
Sbjct: 355 TKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388
>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
mulatta]
Length = 517
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 148
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350
>gi|241682071|ref|XP_002411622.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215504373|gb|EEC13867.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 473
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 15/351 (4%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G G+ +L A + DA + G N+ S+ I +R++ DLR C+ +P DLP
Sbjct: 106 GSRGQGVYLGGAEKKEADAQFSKAGFNVYVSDRIPLNRSLADLRPLPCQALRFPKDLPSV 165
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF--NG 142
SV++ F+NE S+L+RTV+S++ R+P + L E+ILVDDFS ++ +L +++R G
Sbjct: 166 SVVITFYNEILSALLRTVYSVVNRSPRRILREVILVDDFSDLPEVKGQLYRFLKRHFRPG 225
Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
V+L+R REGLIR R GAKE+ G V+VFLD+HCE WL PL+ + D + P
Sbjct: 226 FVKLLRLPRREGLIRARLVGAKEAAGHVLVFLDSHCEATRQWLEPLVTAVNDDPTTVASP 285
Query: 203 VIDGIDYQTWEFRSV-YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
+I ID T+ + + P G FEW + P + + P +SPT AG
Sbjct: 286 IITIIDGNTFAHEDMGFLP----LGSFEWNGDFTWIHPPP--GWRSPDQTAPVRSPTIAG 339
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+DR +F ++GGYDPG+ WGGEN ELSF+IWMCGG + VPCS++GHV+R+ PY
Sbjct: 340 GLFAVDRTYFFQMGGYDPGMNGWGGENLELSFRIWMCGGRLVVVPCSQVGHVFRTDRPYT 399
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
D N KR E W DE +K FY +P+ +D GD+SE+
Sbjct: 400 IPNETDS-----HARNTKRAAEVWMDE-YKEIFYKEKPVMQTIDAGDVSER 444
>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Cricetulus griseus]
Length = 541
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 194/333 (58%), Gaps = 19/333 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 58 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 117
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 118 RTIRSVLNRTPTHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 171
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 172 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 228
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 229 ESASELRGGFDWSLHFQWEQLSPEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 288
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R PY F G TY
Sbjct: 289 VDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 342
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
N KR E W DE +K Y+Y P A+ G+I
Sbjct: 343 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNI 374
>gi|355689622|gb|AER98894.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mustela putorius
furo]
Length = 351
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/270 (52%), Positives = 188/270 (69%), Gaps = 7/270 (2%)
Query: 2 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 59 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
L+DDFS+K L KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKGKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
CEV LNW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVALNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
L RE K RK +EPY+SP AGGLF+
Sbjct: 322 RVPLTPREKKMRKTKTEPYRSPAMAGGLFS 351
>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
Length = 553
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 210/349 (60%), Gaps = 19/349 (5%)
Query: 30 AYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
AY +AY + G GE N E S+ + +R IPD R ++C ++ DLP
Sbjct: 58 AYFNEKAYISKGKLKPGEDAYHNNKFNQEASDTLESNRAIPDYRHKKCLDLEFSKDLPST 117
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
SVI+ FHNE S+L+RT+ S++ R+P+ ++EIILVDDFS+ A ++L KV
Sbjct: 118 SVIITFHNEARSTLLRTIVSVLNRSPSHLIKEIILVDDFSNDASDGRELVQI-----EKV 172
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
L+RN++REGL+R+R +GA+ + GE + FLD+HCE WL PLLA + DR + PVI
Sbjct: 173 ILVRNSKREGLVRSRVKGAEIATGEFLTFLDSHCECNEGWLEPLLARVVEDRTRIVCPVI 232
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
D I ++++ + RG F+W +++K LP E RK + + P ++P AGGL
Sbjct: 233 DVIAMDSFQYIAA---STELRGGFDWNLVFKWELLPAEEKANRKTDPTIPIRTPMIAGGL 289
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F +DR +F +LG YD + +WGGEN E+SF+ W CGG +E VPCSR+GHV+R PY+F
Sbjct: 290 FVIDRQYFQKLGSYDLQMDIWGGENLEISFRTWQCGGRLEIVPCSRVGHVFRKQHPYSFP 349
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ G + N +R E W D+ +K Y++ P+A + G+I+++
Sbjct: 350 GGS----GTIFARNTRRAAEVWMDD-YKKYYFAAVPMARTVTFGNITDR 393
>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
5 [Macaca mulatta]
Length = 552
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Mus musculus]
gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
Length = 558
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 186 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
Length = 489
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/358 (40%), Positives = 210/358 (58%), Gaps = 14/358 (3%)
Query: 5 KADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISF 60
K D L PP E +E PGE GK LP +A + A + + N S+ IS
Sbjct: 92 KEDAAQKVLLPPSE-LEETPGEMGKPVELPTNMSDAMKKAVEDGWTKNAFNQYASDLISV 150
Query: 61 DRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
+R +PD R CK Y DLPK VI+ FHNE +S+L+RTVHS++ R+P + ++IL
Sbjct: 151 NRKLPDPRSAWCKDTARYLTDLPKTDVIICFHNEAWSTLLRTVHSVLARSPEHLIGKVIL 210
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD+S L +L++Y + KV+L+R +REGL+R R G + + V+ FLD+HCE
Sbjct: 211 VDDYSDMPHLKIQLKEYFSLY-PKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCE 269
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
WL PLL I +R + P ID ID +T+++ Y+ + G+F+W + + +
Sbjct: 270 CTEGWLEPLLDRIARNRNTVASPTIDMIDPKTFQYN--YDGANDVLGVFDWNLEFYWIPI 327
Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
P RE K+R + +EP ++PT AGGLFA+D FF +G YDPG +WGG+N ELSFK WMCG
Sbjct: 328 PLRELKRRNHFAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSFKTWMCG 387
Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
G +E +PCS +GH++R PY + + ++ N R+ E W D+ K Y+Y R
Sbjct: 388 GILEIIPCSHVGHIFRDDSPYEWPS----SRAMMVESNLARLAEVWLDDYAK-YYYER 440
>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 511
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 139/307 (45%), Positives = 195/307 (63%), Gaps = 11/307 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +RT+PD+R + C KY + P LP SV++VFHNE +S+L+RTV S+I R
Sbjct: 35 NLLASDRIPLNRTLPDVRKKRCLTKYQNLPELLP-TSVVIVFHNEAWSTLLRTVQSVIDR 93
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + L EIILVDD S++ L + L++Y+ R V++IR EREGLIR R GAKE++G
Sbjct: 94 SPRELLTEIILVDDGSTRKFLKEDLDEYVARLPVPVKVIRTKEREGLIRARMIGAKEAKG 153
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
+V+ FLDAHCE WL PLL + DRK + PVID I+ T+ + +E H+ G F
Sbjct: 154 QVLTFLDAHCECTKGWLEPLLVRVSEDRKKVVCPVIDIINDDTFAYVRSFE--LHW-GAF 210
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L E KKRK + +EP+ +P AGGLFA+ R +F E+G YD + +WGGE
Sbjct: 211 NWNLHFRWYTLGTTEIKKRKNDVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGE 270
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF+ W CGGS+E VPCS +GH++R PY F G ++ N RV W D
Sbjct: 271 NLEMSFRGWQCGGSVEIVPCSHVGHLFRKSSPYTFPGGV----GEILHANLARVALVWMD 326
Query: 348 EKHKAYF 354
E + +F
Sbjct: 327 EWQEFFF 333
>gi|195028169|ref|XP_001986949.1| GH20244 [Drosophila grimshawi]
gi|193902949|gb|EDW01816.1| GH20244 [Drosophila grimshawi]
Length = 599
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 204/357 (57%), Gaps = 14/357 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLP 82
G G A HL A +A GD + +N E S +S++RT+ D R C + +D P LP
Sbjct: 87 GNKGVATHLKGAAKARGDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQRYDNPATLP 146
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
ASVI++F+NE +S L+RTVHS + Q L+EIILVDD S A+L KL+ Y++ RF
Sbjct: 147 TASVIVIFYNEPYSVLLRTVHSTLNTCNEQALKEIILVDDGSDNAELGGKLDHYVKTRFP 206
Query: 142 -GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
GKV ++R R GLIR R GA+ + G+V++FLDAHCE W PLL I R +
Sbjct: 207 IGKVTVLRLNNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQRIKDSRTSVL 266
Query: 201 VPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
VP+ID ID Y T ++S + G F+W L + +L + + P
Sbjct: 267 VPIIDVIDSVDFQYSTNGYKSFQVGGFQWNGHFDWVNLPEREKLRQSRECNQPREICPAY 326
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+IE +PCSR+GH++R
Sbjct: 327 SPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRVGHIFR 386
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F PY F D G N R+ W DE +F R L D+GD++ +
Sbjct: 387 DFHPYKFPNDRD-THG----INTARMALVWMDEYINVFFLNRPDLKFHPDIGDVTHR 438
>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 557
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 199/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSSDRPTRDTRHYSCPSLSYSADLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVASAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Callithrix jacchus]
Length = 558
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 200/346 (57%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 400
>gi|195384663|ref|XP_002051034.1| GJ22477 [Drosophila virilis]
gi|194145831|gb|EDW62227.1| GJ22477 [Drosophila virilis]
Length = 598
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 152/367 (41%), Positives = 211/367 (57%), Gaps = 16/367 (4%)
Query: 17 LEPYKEGP--GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC-- 72
L+ K+ P G G A HL A +A GD + +N E S +S++RT+ D R C
Sbjct: 76 LDLKKQDPSLGNKGAAVHLHGAAKARGDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLA 135
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ +D P LP ASVI++F+NE +S L+RTVHS + + L+E+ILVDD S A+L K
Sbjct: 136 QKYDDPGTLPTASVIIIFYNEPYSVLVRTVHSTLNTCNQKALKEVILVDDGSDNAELGGK 195
Query: 133 LEDYIQ-RF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
L+ Y + RF +GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PLL
Sbjct: 196 LDHYTRTRFPSGKVTILRLKNRLGLIRARLAGARIASGDVLIFLDAHCEANVGWCEPLLQ 255
Query: 191 PIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
I R + VP+ID ID Y T ++S + G F+W L + +L +
Sbjct: 256 RIKDSRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWVNLSEREKLRQSREC 315
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+ P SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+IE +
Sbjct: 316 SQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 375
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
PCSR+GH++R F PY F DR + N R+ W DE +F R L D
Sbjct: 376 PCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINVFFLNRPDLKFHAD 430
Query: 366 MGDISEQ 372
+GD++ +
Sbjct: 431 IGDVTHR 437
>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
Length = 612
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 208/352 (59%), Gaps = 20/352 (5%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
G GE G+ L + + + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 44 GAGEDGRPVRLSKEDERLSEDTFVINQFNLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 103
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +LE + +
Sbjct: 104 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPRENLKEIILVDDFSTRTFLKVELEKLVAQL 163
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 164 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 223
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
PVID I+ +T+ ++ E +RG F W + ++ LP K R + ++P SPT
Sbjct: 224 CPVIDIINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSDDPTKPIISPTM 280
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELS----------FKIWMCGGSIEWVPCSR 309
AGGLF++DR +F E+G YD + +WGGEN E+S F +W CGG +E +PCS
Sbjct: 281 AGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRLKLLKKNCFLVWQCGGRVEILPCSH 340
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+GHV+R P++F R G ++ N RV E W DE K +FY P A
Sbjct: 341 VGHVFRRTSPHDF---PGRKSGTILNSNLLRVAEVWMDE-WKFHFYRTAPQA 388
>gi|432097046|gb|ELK27544.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5, partial [Myotis davidii]
Length = 363
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 199/325 (61%), Gaps = 11/325 (3%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
YG+N S + R +PD R + C YP LP AS+++ FHNE F++L RTV S++
Sbjct: 15 YGLNTIISKSLGNQRPVPDTRDKMCLKKRYPTRLPSASIVICFHNEEFNTLFRTVSSVMN 74
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP Q LEEIILVDD S DL +KL+ +++ F GK+++IR T+REGLIR R GA +
Sbjct: 75 LTPHQILEEIILVDDMSEFDDLKEKLDYHLEMFRGKIKVIRTTKREGLIRARLIGAAHAS 134
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G+V+VFLD+HCEV WL PLLA I DRK++ P++D ID+ T Y P RG
Sbjct: 135 GDVLVFLDSHCEVNRVWLEPLLAAIAKDRKMVVCPMVDSIDHLTLN----YYPAPIVRGA 190
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F+W + + + + E + + P +SP +GG+FA++R +F ELG YD + +WG E
Sbjct: 191 FDWHLRFVWDTVFSYEMDGPEGPTTPIRSPAMSGGIFAINRHYFNELGQYDKDMNLWGAE 250
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N ELS +IWMCGG + +PCSR+GHV R + N ++ ++ YN R++ W D
Sbjct: 251 NLELSLRIWMCGGQLFILPCSRVGHVDRHIVQ-NVTQVLRALR-----YNNLRLVHVWLD 304
Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
E +K F+ R P + G+ISE+
Sbjct: 305 E-YKEQFFLRRPDLKSIPYGNISER 328
>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
Length = 553
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 67 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSDLPATSVIIT 126
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 127 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 181
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 182 DRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 241
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 242 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 298
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 299 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 352
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 353 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 396
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P GPG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NGPGADGKAFQKRKWTPLETQEKEEGFKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K + +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ailuropoda melanoleuca]
Length = 555
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 69 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSDLPATSVIIT 128
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 129 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 183
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 184 DRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 243
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 244 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 300
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 301 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 354
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 355 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 398
>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 591
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 190/313 (60%), Gaps = 13/313 (4%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYP-LDLPKASVILVFHNEGFSSLMRTVHSI 105
++ N+ SN + R++PD R C+ ++ LP ASV++ F+NE +S+L+RTVHSI
Sbjct: 90 QHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVHSI 149
Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAK 164
++RTPA L E+ILVDD S+ +L +L Y+ VRLIR REGLIR R GA
Sbjct: 150 LERTPAALLHELILVDDNSTLPELGLQLSRYVASELPSHVRLIRTPAREGLIRARMYGAH 209
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV + WL P+LA I ++R +T PVID I+ T+E Y
Sbjct: 210 NASGQVLVFLDSHCEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFE----YSASPIV 265
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F WG+ +K P ++ +P SPT AGGLFAMDR +F ELG YD G+ +W
Sbjct: 266 RGGFNWGLHFKWESPPRLRGPQQAI--DPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIW 323
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN E+SF+IWMCGG +E +PCSR+GHV+R PY D +T N RV
Sbjct: 324 GGENLEISFRIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGED-----TLTKNSLRVAHV 378
Query: 345 WFDEKHKAYFYTR 357
W DE Y TR
Sbjct: 379 WMDEYKTYYLQTR 391
>gi|291391583|ref|XP_002712189.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Oryctolagus cuniculus]
Length = 941
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/360 (41%), Positives = 214/360 (59%), Gaps = 14/360 (3%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
P +P + PG+ G +P E N+ S+ I DR I D R C
Sbjct: 433 PRDP--QAPGQFGLPVVVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 490
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD S+K L L+
Sbjct: 491 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNLDK 550
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 551 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 609
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK + ++
Sbjct: 610 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFLWPMNFGWKTIPPDVVAKNKIKETDII 666
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
+ P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 667 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 726
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
R+ PY+F K DR+K + N RV E W DE +K FY + LD+G++++Q
Sbjct: 727 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIEQGLDVGNLTQQ 781
>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Cavia
porcellus]
Length = 556
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + ++ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVAAAAILTFLDSHCEVNVEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D+A
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKA 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 580
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/356 (40%), Positives = 214/356 (60%), Gaps = 13/356 (3%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--L 79
GPGE GKA +P+ + + N+ S+ IS +RT+PD+RM+ CK YP
Sbjct: 72 NGPGEMGKAVIIPQDKESLKNEMFRINQFNLLASDMISINRTLPDVRMDGCKRKSYPPVS 131
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
+LP S+++VFHNE +S+L+R++HSII R+P + L EIILVDD S + L Q+L+DY++R
Sbjct: 132 ELPSTSIVIVFHNEAWSTLLRSIHSIINRSPRELLTEIILVDDASERDFLGQQLDDYVKR 191
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP--LLAPIYSDRK 197
V + R R GLIR R RGA +G V+ FL +H + + L P L A DR+
Sbjct: 192 LQVPVHVERMGTRSGLIRARLRGAGLVKGHVLGFLXSHDQCSASSLRPVYLEASRRHDRR 251
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKS 256
+ P+ID I + F + D Y G F W + ++ +P+REA +R + + P +S
Sbjct: 252 NVVCPIIDVISDDNFAFHT--GSDMTYGG-FNWKLQFRWYPVPQREADRRGGDRTIPLRS 308
Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
PT AGGLF++D+ +F E+G YD G+ VWGGEN E+SF+IWMCGG++E V CS +GHV+R
Sbjct: 309 PTMAGGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRIWMCGGTLEIVTCSHVGHVFRK 368
Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY F R+ I N +R+ E W D+ + ++Y P + GD+S++
Sbjct: 369 STPYTFPGGTGRI----INRNNQRLAEVWMDD-FRHFYYRISPGVRKTEFGDVSQR 419
>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Columba livia]
Length = 502
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+I+
Sbjct: 18 KAY-LSSKQLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 76
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + Q L KV+ +R
Sbjct: 77 TFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 131
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
NT REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 132 NTRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 191
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++ ++P AGG+F +D+
Sbjct: 192 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDK 248
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 249 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 303
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++E+
Sbjct: 304 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSVAER 347
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 156/371 (42%), Positives = 212/371 (57%), Gaps = 26/371 (7%)
Query: 12 NLEPPLEPYKEGP------GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
+L PL YK+ P GE GKA L E + + Y +N+ S+ IS R
Sbjct: 58 DLSQPL--YKKPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISLHRH 115
Query: 64 IPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD
Sbjct: 116 IEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDD 175
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 176 LSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNS 234
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPE 241
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ + +P+
Sbjct: 235 GWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPK 291
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
++ ++ +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG
Sbjct: 292 QKRDRQISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGK 351
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
+E PCS +GHV+ PY P N R E W DE +K +FY R P A
Sbjct: 352 LEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPA 401
Query: 362 MFLDMGDISEQ 372
GDISE+
Sbjct: 402 RKEAYGDISER 412
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/360 (42%), Positives = 207/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P + PGE G+ LP E + D + N S+ IS R++P
Sbjct: 95 GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 153
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 154 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 213
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 214 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 272
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I ++ + PVID ID T E+ H+R G F+W + + +
Sbjct: 273 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 324
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WM
Sbjct: 325 AVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWM 384
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 385 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 438
>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Metaseiulus occidentalis]
Length = 610
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 204/336 (60%), Gaps = 10/336 (2%)
Query: 22 EGPGEGGK-AYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
EG G G+ Y LP E R+ S+ + N+ S+ IS DRT+ D R C+ Y
Sbjct: 97 EGAGNMGQPVYPLPSEVVRSKMLYSINRF--NLLVSDKISVDRTLADARKSVCRNISYAY 154
Query: 80 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
DLP SVI+VFHNE +S+L+RTVHS+I R+P ++EI+LVDD S + L + L+ Y++
Sbjct: 155 DLPDTSVIIVFHNEAWSTLLRTVHSVINRSPRDLVKEIMLVDDASDREFLKRSLDAYVRS 214
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
N +++IR+ +R GLIR R GA+ + G+V+ FLDAHCE WL PLL I DR +
Sbjct: 215 LNFPIKVIRSPKRSGLIRARLMGARAAEGKVLTFLDAHCECTTGWLEPLLQRIKEDRTRV 274
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
P+ID I T+ + +E H+ G W M ++ + K+R + SEP+K+P
Sbjct: 275 VCPIIDIIHDDTFAYVKSFE--LHW-GAINWEMHFRWYPVGPHVLKQRHGDPSEPFKTPV 331
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGGLF++D+ +F E+G YD + +WGGEN E+SF+IW CGGS+E VPCS +GHV+R
Sbjct: 332 MAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMSFRIWQCGGSLEIVPCSHVGHVFRRSS 391
Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
PY F + G ++ N RV E W D+ + YF
Sbjct: 392 PYTFPH--PKGVGGILFSNLARVAEVWMDDWAEFYF 425
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/360 (42%), Positives = 207/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P + PGE G+ LP E + D + N S+ IS R++P
Sbjct: 94 GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 152
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 153 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 212
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 213 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 271
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I ++ + PVID ID T E+ H+R G F+W + + +
Sbjct: 272 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 323
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WM
Sbjct: 324 AVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWM 383
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 384 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 437
>gi|51316066|sp|Q95JX4.2|GLTL5_MACFA RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
gi|15207881|dbj|BAB62965.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 201/331 (60%), Gaps = 17/331 (5%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+V LD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVILDSHCEVNRVWLEPLLHAIAKDPKMVVRPLIDVIDDRTLE----YKPSPVV 272
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + ++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 273 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
GGEN ELS +IWMCGG + +PCSR+GH+ K R +I+ +NY R+
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 383
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ W DE +K F+ R+P ++ G+I E+
Sbjct: 384 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 413
>gi|345328051|ref|XP_003431229.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Ornithorhynchus anatinus]
Length = 863
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 217/368 (58%), Gaps = 13/368 (3%)
Query: 9 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
K+ L+ L P + PG+ G A +P + E N+ S+ I DR I D
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R + C DLP ++I+ F +E +S+L+R++HS++ R+P ++EIILVDDFS+K
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LL + RK + PVI+ I + +++V D+ RGIF W M + +P +K
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666
Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VP
Sbjct: 667 KMKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVP 726
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFL 364
CSR+GH++R+ PY+F K DRVK + N RV E W DE +K FY L
Sbjct: 727 CSRVGHIFRNDNPYSFPK--DRVK--TVERNLVRVAEVWLDE-YKDLFYGHGLHLLERRS 781
Query: 365 DMGDISEQ 372
D+G++++Q
Sbjct: 782 DIGNLTQQ 789
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/357 (41%), Positives = 205/357 (57%), Gaps = 28/357 (7%)
Query: 13 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
++PP ++E PGE GK LP + + A D + N S+ +S R++PD R
Sbjct: 138 IDPPGN-FEENPGEMGKPVRLPKEMPDDMKKAVDDGWTKNAFNQYVSDLVSVHRSLPDPR 196
Query: 69 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 197 DAWCKDSTQYLTNLPTTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 256
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 257 HLKKQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSAVLTYLDSHCECTEGWLEP 315
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
LL I + + PVID I T E+ HYR G F+W + + + +P
Sbjct: 316 LLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHSVP 367
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
ERE K+ ++EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WMCGG
Sbjct: 368 ERERKRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 427
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
++E VPCS +GH++R PY + R ++ N R+ E W D+ + Y+Y R
Sbjct: 428 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDD-YAQYYYHR 478
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 202/333 (60%), Gaps = 14/333 (4%)
Query: 41 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
GD + + N++ S+ + DR +PD+R ++CK +P DLP ++I+ FHNEG S+L+R
Sbjct: 99 GDDAYAKNAYNIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLR 158
Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
TV S + R+P L+EIILVDDFSS ++L KV+LIRNT+REGLIR+R
Sbjct: 159 TVISALNRSPPHLLKEIILVDDFSSDPKDGRRLLKL-----PKVKLIRNTKREGLIRSRV 213
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
+GA +RGEV+ FLD+HCE NWL PLL I K + P+ID I+ T+++
Sbjct: 214 KGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDYLG---S 270
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F W + +K + LP +R+ + P KSP AGGLF++ + +F LG YD
Sbjct: 271 SADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSPVIAGGLFSVAKKWFETLGKYDM 330
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYK 339
+ VWGGEN E+SF+ W CGG++E +PCSR+GHV+R+ PY F + V N +
Sbjct: 331 QMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNRHPYQFPGGSMNV----FQKNTR 386
Query: 340 RVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R +E W D+ +K Y+Y P A GDI E+
Sbjct: 387 RAVEVWMDD-YKRYYYAAVPYAKNTPYGDIEER 418
>gi|149639580|ref|XP_001512277.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Ornithorhynchus anatinus]
Length = 949
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 217/368 (58%), Gaps = 13/368 (3%)
Query: 9 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
K+ L+ L P + PG+ G A +P + E N+ S+ I DR I D
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490
Query: 68 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
R + C DLP ++I+ F +E +S+L+R++HS++ R+P ++EIILVDDFS+K
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
LL + RK + PVI+ I + +++V D+ RGIF W M + +P +K
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666
Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VP
Sbjct: 667 KMKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVP 726
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFL 364
CSR+GH++R+ PY+F K DRVK + N RV E W DE +K FY L
Sbjct: 727 CSRVGHIFRNDNPYSFPK--DRVK--TVERNLVRVAEVWLDE-YKDLFYGHGLHLLERRS 781
Query: 365 DMGDISEQ 372
D+G++++Q
Sbjct: 782 DIGNLTQQ 789
>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1,
partial [Felis catus]
Length = 553
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 67 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVAYSADLPATSVIIT 126
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 127 FHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 181
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 182 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 241
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 242 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 298
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 299 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 352
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 353 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 396
>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Rattus norvegicus]
Length = 558
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 186 DKREGLIRSRVRGADVAGASVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Saimiri boliviensis boliviensis]
Length = 552
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 194/335 (57%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y +LP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTELPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP + EIILVDDFS+ D Q+L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + + +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFHWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G++ +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388
>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
glaber]
Length = 553
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS R +PD R C Y LP S+I+ FHNE S+L+
Sbjct: 70 VGDDPYKLYAFNQRESERISSHRAVPDTRHPRCMLLVYHTALPPTSIIITFHNEARSTLL 129
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ D ++L KV+ +RN+ER+GL+R+R
Sbjct: 130 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNSERQGLVRSR 184
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 185 MRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 241
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 242 SASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 301
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 302 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 355
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W DE +K Y+Y P A+ G+I
Sbjct: 356 TKRTAEVWMDE-YKQYYYAARPFALERPFGNI 386
>gi|291397402|ref|XP_002715124.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oryctolagus cuniculus]
Length = 439
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/360 (40%), Positives = 206/360 (57%), Gaps = 17/360 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
EP E K G H PE Y + +YG+N+ S + R +PD R + C
Sbjct: 66 EPAFEHLKSYSKPIGNFNHSNPEFY-----SGFFKYGLNILISRSVGIRRDVPDTRDKIC 120
Query: 73 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
YP LP AS+I+ FHNE ++L+RT+ S++ TP+ LEEIILVDD S DL ++
Sbjct: 121 HQKRYPHRLPTASIIICFHNEEINALLRTLSSVVNLTPSHLLEEIILVDDMSEFDDLKEE 180
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
L+ ++ F G V+LIRN REGLIR R GA + G+V+VFLD+HCEV WL PLL+ I
Sbjct: 181 LDQKLEDFRGVVKLIRNKRREGLIRARLIGAAHASGDVLVFLDSHCEVNKVWLEPLLSVI 240
Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
D + P+ID ID T E Y+P RG F W + +K + + E + + ++
Sbjct: 241 AKDPHTVVCPIIDVIDEMTLE----YKPSPIVRGTFNWMLQFKWDNVFSYEMEGPEGPAK 296
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P +SP+ AGG+FA+ R +F E+G YD + +WGGEN E+S +IWMCGG + +PCSR+GH
Sbjct: 297 PIRSPSMAGGIFAIHRHYFKEIGQYDKDMDLWGGENVEISLRIWMCGGQLFIIPCSRVGH 356
Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ R N +T NY R++ TW DE +K F+ P + G+ISE+
Sbjct: 357 ITRKSPEPNLAVTK------AVTRNYLRLVHTWLDE-YKEQFFLHRPGLRSIPYGNISER 409
>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Sarcophilus harrisii]
Length = 563
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 204/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+++
Sbjct: 79 KAY-LASKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVHYASDLPATSIVI 137
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I K++ +R
Sbjct: 138 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KIKCLR 192
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 193 NDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDK 309
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 310 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 408
>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Anolis carolinensis]
Length = 574
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/356 (40%), Positives = 205/356 (57%), Gaps = 21/356 (5%)
Query: 22 EGPGEGG---KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
E PG G KAY + AG+ ++ N S+ +S DR I D R C Y
Sbjct: 80 EKPGLRGFDEKAY-VSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCASIHYG 138
Query: 79 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
DLP S+I+ FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + Q L
Sbjct: 139 ADLPSTSIIITFHNEARSTLLRTVTSVLNRTPANLIQEIILVDDFSSDPEDCQLLTKI-- 196
Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
KV+ +RN REGLIR+R RGA + +++ FLD+HCEV WL P+L + D
Sbjct: 197 ---PKVKCLRNNRREGLIRSRVRGADMATADILTFLDSHCEVNSEWLQPMLQRVKEDYTR 253
Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
+ P+ID I + + + RG F+W + +K ++P + R ++ ++P
Sbjct: 254 VVSPIIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKLSRTDPTQSIRTPV 310
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
AGG+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R
Sbjct: 311 IAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRH 370
Query: 319 PYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY+F +G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 371 PYDFP------EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 419
>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
Length = 558
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 201/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
EAY +A GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 EAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
+REGLIR+R R A + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 186 DKREGLIRSRVRRADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDLTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 213/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P + PG G+A+ E D ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--KSPGADGRAFQKSEWTPQETQEKDEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ PA L EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLPSTSVIIVFHNEAWSTLLRTVYSVLHTAPAILLREIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D ++ P I ID T+EF + V H RG F+W + + +P E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENRR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K + +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMD-NYKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
saltator]
Length = 598
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/352 (41%), Positives = 203/352 (57%), Gaps = 7/352 (1%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
G GE G+ +L + G+ L + +N+ SN IS R +PD+R C Y LP
Sbjct: 84 GLGENGEPAYLHGKEKVEGETVLAKKALNVVLSNKISLTRKLPDVRNPLCANLTYDTLLP 143
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
SVI++F+NE +S L+RTVHS++K + L+EIILVDD S + +L +L+ Y+ R
Sbjct: 144 SVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLKEIILVDDHSEEEELQGQLDYYLSTRLP 203
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV+L+R R+GLIR R GAK + G+V+VFLDAHCEV +WL PLL I R + +
Sbjct: 204 TKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDWLQPLLQRIKEKRNAVLM 263
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
P+ID I +T E+ E G F W + + + E K R P +SPT AG
Sbjct: 264 PIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHELKSRLSLISPTRSPTMAG 323
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA+DR +F E+G YD + WGGEN E+SF+IW CGG++E +PCSR+GH++R+F PY
Sbjct: 324 GLFAIDRKYFWEVGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIPCSRVGHIFRNFHPYK 383
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM-GDISEQ 372
F D G N R+ W DE + + R + GDISE+
Sbjct: 384 FPNDKD-THG----INTARLAFVWMDEYKRLFLLHRSEFKNKSSLFGDISER 430
>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 648
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 202/349 (57%), Gaps = 10/349 (2%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
G+ G A L + D + N+ S+ + +R++ D R CK YP+ +LP
Sbjct: 134 GKNGHAVILGPEEQLEADKEFSKAAFNVYVSDRLPLNRSLRDTRHRHCKAVTYPMAELPT 193
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNG 142
ASV+++F +E FS+L+RT+ S I R+P L EIILVDDFS DL +L+ YI F
Sbjct: 194 ASVVIIFTDEIFSTLLRTIVSTINRSPNHLLREIILVDDFSQSEDLKDRLQRYITHHFRA 253
Query: 143 KV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
V RLIR ER GLIR R GA+ ++G+V++FLD+HCE WL PLL PI DR+ +
Sbjct: 254 DVVRLIRLPERSGLIRARLAGARAAKGDVLIFLDSHCETTPGWLEPLLEPIRRDRRAVVC 313
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
PVID ID +T ++ + E D G F W + + +P K R +EP +SPT AG
Sbjct: 314 PVIDIIDDKTLQYVAA-EGDRFQIGGFNWKGEFSWHNIPAAWRKNRTSIAEPMRSPTMAG 372
Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
GLFA++R +F E G YD + WGGEN E+SF+IW CGG I PCS +GH++R + PY
Sbjct: 373 GLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIAPCSHVGHIFRDYHPYK 432
Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
F K D N KR +E W DE K YFY P + +GDIS
Sbjct: 433 FPKGKD-----TNAINTKRAVEVWMDE-FKKYFYQTRPELTKMKVGDIS 475
>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
norvegicus]
gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[Rattus norvegicus]
gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
Length = 552
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 195/336 (58%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN+ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNSERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388
>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
adhaerens]
Length = 491
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 202/337 (59%), Gaps = 17/337 (5%)
Query: 38 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
R + D ++ N S+ I R +PD R CK Y L++P SV+++FHNE S+
Sbjct: 11 RGSKDEGYEKHQFNQFESDIIGAYRRVPDTRNPLCKNKIYRLNMPSVSVVIIFHNEARST 70
Query: 98 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
L+RTV S++ RTP L EI+LVDD S A L Q+L KV+LIRN +REGLIR
Sbjct: 71 LLRTVQSVLDRTPPHLLSEIVLVDDNSDDATLGQELLTL-----PKVKLIRNKKREGLIR 125
Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
+R G K S+G+ I+FLD+HCEV W PLL I + K + PV+D ID T+E++
Sbjct: 126 SRVFGVKSSQGKAIIFLDSHCEVNQQWAEPLLEQIVLNPKAIVSPVLDNIDMNTFEYQ-- 183
Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
E RG F+W + ++ + + E +R + P K+PT AGG++A+ + +F +LG Y
Sbjct: 184 -EGTEDVRGGFDWSLTFRWDYMTEAMINQRIDPTSPIKTPTIAGGIYAVSKQWFNDLGEY 242
Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY- 336
D G +WGGEN ELSF+ WMCGG ++ +PCSR+GHV+R PY F + A R TY
Sbjct: 243 DMGQKIWGGENLELSFRAWMCGGFMKIIPCSRVGHVFRLQHPYIFPEGAGR------TYY 296
Query: 337 -NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N +RV+E W DE +K YFY + +D G++ +
Sbjct: 297 RNLRRVVEVWLDE-YKVYFYQIRKIIKSIDYGNVKSR 332
>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
Length = 550
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPHTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR E W DE +K Y+Y P A+ G+I +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERHFGNIENR 388
>gi|332243646|ref|XP_003270989.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Nomascus leucogenys]
Length = 443
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 201/325 (61%), Gaps = 11/325 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ F+NE F++L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEEFNALFRTVSS 156
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ TP +LEEIILVDD S DL +KL+ +++ F K+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFREKIKIIRNKKREGLIRARLIGAS 216
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKVVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + ++P SP +GG+FA+ R +F E+G YD + W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGPTKPIWSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+GH+ + GK + + +T+NY R++
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDI 369
W DE +K F+ R+P ++ G+I
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNI 410
>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos taurus]
gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
taurus]
Length = 557
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-FKQYYYEARPSAIGKAFGSVATR 400
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ L + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKRKLTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + + H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K + +I N R+ E W D K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMD-SFKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos grunniens mutus]
Length = 557
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-FKQYYYEARPSAIGKAFGSVATR 400
>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Taeniopygia guttata]
Length = 570
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 202/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+I+
Sbjct: 86 KAY-LSSKVLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 144
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + Q L KV+ +R
Sbjct: 145 TFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 199
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
NT REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 200 NTHREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 259
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++ ++P AGG+F +D+
Sbjct: 260 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDK 316
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 317 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 371
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++++
Sbjct: 372 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSVADR 415
>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Otolemur garnettii]
Length = 678
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 191/324 (58%), Gaps = 17/324 (5%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S +R +PD R C Y DLP S+I+ FHNE S+L+RT+ S++
Sbjct: 77 YAFNQRESERTPSNRAVPDTRHSRCTLLVYYTDLPPTSIIITFHNEARSTLLRTIRSVLN 136
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
RTP ++EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R RGA ++
Sbjct: 137 RTPMHLIQEIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADVAQ 191
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G + FLD+HCEV +WL PLL I D + PVID I+ T+ + E RG
Sbjct: 192 GTTLTFLDSHCEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTY---IESASELRGG 248
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F+W + ++ +L + +R +EP ++P AGGLF +D+A+F LG YD + +WGGE
Sbjct: 249 FDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGE 308
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETW 345
NFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N KR E W
Sbjct: 309 NFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVW 362
Query: 346 FDEKHKAYFYTREPLAMFLDMGDI 369
DE +K Y+Y P A+ G+I
Sbjct: 363 MDE-YKQYYYAARPFALERPFGNI 385
>gi|357622639|gb|EHJ74065.1| putative N-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 646
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 205/348 (58%), Gaps = 29/348 (8%)
Query: 40 AGDASLGEYGMNMET-----SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
A D + E G NM S I R +PD R + C+ YP LPKAS+I+ F+NE
Sbjct: 132 ADDVRIREKGYNMHAFNTLISQRIGNHRGLPDTRNKLCRSQKYPDKLPKASIIICFYNEH 191
Query: 95 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREG 154
F +LMR+VHSI+ RT +YL+EIILVDD+S DL ++++ + NGK+ + + REG
Sbjct: 192 FETLMRSVHSILDRTDLKYLKEIILVDDYSDITDLHEEVQKAVNELNGKMLITLTSTREG 251
Query: 155 LIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI----------YSDRKIMTVPVI 204
LIR R GA S G+V+VFLD+H EV ++WLPPLL + +S R + P+I
Sbjct: 252 LIRARLYGADNSVGDVLVFLDSHIEVNVDWLPPLLTRLSEGVDGVNVRFSPRAV--TPII 309
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
D I+ T+E+ S RG F WG+ +K + LP+ K + +P +SPT AGGLF
Sbjct: 310 DVINADTFEYTS----SPLVRGGFNWGLHFKWDNLPKGTLKDDEDFIKPIRSPTMAGGLF 365
Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
A+ R +F ++G YD G+ +WGGEN E+SF+IWMCGG +E PCSR+GHV+R PY G+
Sbjct: 366 AIYREYFNKIGKYDSGMNLWGGENLEISFRIWMCGGVLELCPCSRVGHVFRKRRPYGAGE 425
Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N R+ W DE + + P A + +GDISE+
Sbjct: 426 -------DYMLRNSMRMARVWMDE-YVNKVIEQNPSAAHVSIGDISER 465
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 153/371 (41%), Positives = 215/371 (57%), Gaps = 24/371 (6%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P PG GKA+ P R + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETREK-EEGYKKHCFNAFASDRISLQRSLGPDTRP 162
Query: 70 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 163 PECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTE 221
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EHLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E +
Sbjct: 281 PLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ 340
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
PCS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMA 454
Query: 362 MFLDMGDISEQ 372
GDISE+
Sbjct: 455 QEKSFGDISER 465
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 147/361 (40%), Positives = 202/361 (55%), Gaps = 16/361 (4%)
Query: 14 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 74 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
+ LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 126 AKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQ 185
Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
LE YI + +VRL R +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 186 LEAYISNLD-RVRLTRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERI 244
Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 SRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSRI 301
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W D+ +K +FY R P A DISE
Sbjct: 362 HVFSKRAPY---------ARPNFLQNTAREAEVWMDD-YKEHFYNRNPPARKETYDDISE 411
Query: 372 Q 372
+
Sbjct: 412 R 412
>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oryzias latipes]
Length = 533
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 135/346 (39%), Positives = 202/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AGD E+ N++ S+ + +R I D R C Y DLP +VI+
Sbjct: 52 KAY-LSAKQLKAGDDPYREHAFNLQESDRLGGERAIRDTRHYRCAALSYDADLPSTTVII 110
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ R+P ++E++L+DDFSS + Q L KVR +R
Sbjct: 111 TFHNEARSTLLRTVKSVLMRSPPSLIQEVLLIDDFSSDLEDCQLLAQI-----PKVRCLR 165
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N+ REGLIR+R +GA + ++ FLD+HCEV +WL P++ + D + P+ID I
Sbjct: 166 NSRREGLIRSRVKGANSASAPILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVIS 225
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R + P ++P AGG+F MD+
Sbjct: 226 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTLPIRTPVIAGGIFVMDK 282
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E +PCSR+GHV+R PY+F
Sbjct: 283 SWFNHLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRVGHVFRKRHPYDFP----- 337
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N +R E W DE +K ++Y+ P A G I+E+
Sbjct: 338 -EGNALTYIKNTRRAAEVWMDE-YKQFYYSARPSAQGKAFGSITER 381
>gi|395846631|ref|XP_003796006.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Otolemur garnettii]
Length = 943
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 149/373 (39%), Positives = 220/373 (58%), Gaps = 18/373 (4%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 426 VLKIDVTLSPRDP------KAPGQFGRPVVVPLGKEKEAERRWKEGNFNVYLSDLIPVDR 479
Query: 63 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 480 AIEDTRPVGCAEQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 539
Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
S+K L L++Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 540 CSTKDYLKDNLDEYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 598
Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
WL PLL +Y R+ + PVI+ I+ + + +V D+ RGIF W M + +P +
Sbjct: 599 GWLEPLLERVYLSRQKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPD 655
Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
AK + ++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG
Sbjct: 656 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGE 715
Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K FY
Sbjct: 716 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 770
Query: 360 LAMFLDMGDISEQ 372
+ L++G++++Q
Sbjct: 771 IDQGLEVGNLTQQ 783
>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
musculus]
Length = 551
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 70 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 129
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 130 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 183
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 184 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 240
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 300
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY
Sbjct: 301 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 354
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 355 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 389
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 156/371 (42%), Positives = 217/371 (58%), Gaps = 24/371 (6%)
Query: 15 PPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP + PG GKA+ +L + + ++ N S+ IS R + PD R
Sbjct: 115 PPQD--SNAPGASGKAFKTINLSPDEQKEKERGDEKHCFNAFASDRISLHRDLGPDTRPP 172
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S
Sbjct: 173 ECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDD 231
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L KL+DY+++F+ V+++R ER+GLI R GA + GE + FLDAHCE WL P
Sbjct: 232 YLQDKLDDYVKQFH-IVKVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEP 290
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAK 245
LLA I + + P I ID T+EF S Y H+ RG F+W + + LPE E+K
Sbjct: 291 LLARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHN-RGNFDWSLSFGWESLPEHESK 349
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
KRK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 350 KRKDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSFRVWQCGGQLEII 409
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
PCS +GHV+RS P++F K +IT N R+ E W DE +K FY R A +
Sbjct: 410 PCSVVGHVFRSKSPHSFPKGTQ-----VITRNQVRLAEVWMDE-YKNIFYRRNTEAAKIV 463
Query: 365 ---DMGDISEQ 372
GDIS++
Sbjct: 464 KQQTFGDISKR 474
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFQRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
Length = 550
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388
>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mus musculus]
Length = 550
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388
>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
6 [Ciona intestinalis]
Length = 627
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 150/364 (41%), Positives = 213/364 (58%), Gaps = 20/364 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH-----ISFDRTIPDLRM 69
P ++P PGE GKAY + + +A L + G + NH IS R++ D R
Sbjct: 115 PKVDP--SAPGEYGKAYKVTD--NSAEVKKLVKEGWDKHAFNHYVCQKISLHRSVGDKRD 170
Query: 70 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
+ECK + LP SVI++FHNE + +L+RTVHS+++ +P L+EIILVDD S+ ++L
Sbjct: 171 QECKVRKWRKPLPDTSVIIIFHNEAWCALLRTVHSVLENSPKILLKEIILVDDASTLSNL 230
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
++L DY+ + V++IR R GLIR R GA+E++G V+ FLD+HCE +WL P+L
Sbjct: 231 GKELTDYVAKLQ-IVKIIRLPSRAGLIRARLAGAQEAQGSVLTFLDSHCECAPHWLEPML 289
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER-EAKKRK 248
I D + PVI+ ID T+ S+ GI W + + N P + +
Sbjct: 290 ERIAEDNTRVVCPVIEVIDADTFAM-SLTTARSVQTGILSWSLGF--NWAPRKINPGQPI 346
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
N E S T AGGLFAM R +F LG YD +LVWGGEN E+S +IWMCGGS+E PCS
Sbjct: 347 KNDEALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSLRIWMCGGSLEIHPCS 406
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+R PY+ +D +IT+N KRV E W DE +K +Y R P A ++ GD
Sbjct: 407 HVGHVFRKRAPYSHPGGSD-----VITHNNKRVAEVWLDE-YKEQYYKRVPRARAVEAGD 460
Query: 369 ISEQ 372
++ +
Sbjct: 461 LTAR 464
>gi|326427851|gb|EGD73421.1| GALNT4 protein [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 20/311 (6%)
Query: 51 NMETSNHISFDRTIPDLRMEEC---KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSII 106
N S+ +S R D R EC KY YPL +LP SVIL+F+NE S+L+RTV S++
Sbjct: 172 NQWISDRLSLHRRAYDTRPVECLHKKY--YPLSELPTVSVILIFYNEARSTLLRTVWSVL 229
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
R+P ++EI+LVDD SS L L+ + K R+IR ER GLIR + GA+++
Sbjct: 230 DRSPRSLIKEILLVDDHSSMPHLGYPLDQEVAGIP-KTRVIRLPERSGLIRAKVYGAQQA 288
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYR 225
RG+V+V+LD+HCEV WL PLL I +RK + +P+ID IDY+TWE R+ + E R
Sbjct: 289 RGDVLVYLDSHCEVNDGWLEPLLDRIRRNRKTVAMPIIDAIDYETWEHRTGLLE-----R 343
Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
GIF+W +++K +L + + R +++P+ SP AGGLFAMDR +F E+G YD G+ WG
Sbjct: 344 GIFDWSLVFKWKQLTADDKRGRPDDTDPFASPAMAGGLFAMDRKYFFEVGAYDMGMETWG 403
Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGP--LITYNYKRVIE 343
GEN E+S ++W CGG IE +PCS + HV+R PY F + K P I N RV E
Sbjct: 404 GENIEMSMRVWACGGRIEALPCSHVAHVFRKKTPYEF-----KTKDPQETIARNLNRVAE 458
Query: 344 TWFDEKHKAYF 354
W DE Y+
Sbjct: 459 VWMDEYKDVYY 469
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 213/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ E + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIIL+DD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLNEIILMDDASTDE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LE Y+Q+ VR++R ER GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ + P I ID T++F R V H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG + +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLGIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K +I N R+ E W D+ +K FY R A +
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMVQ 455
Query: 365 --DMGDISEQ 372
+ GDISE+
Sbjct: 456 ENNFGDISER 465
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|281485547|ref|NP_660335.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Homo sapiens]
gi|322510123|sp|Q7Z4T8.3|GLTL5_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
Length = 443
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 135/328 (41%), Positives = 203/328 (61%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + C YP LP AS+++ F+NE ++L +T+ S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEECNALFQTMSS 156
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
+ TP +LEEIILVDD S DL +KL+ +++ F GKV++IRN +REGLIR R GA
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + +++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
G EN ELS +IWMCGG + +PCSR+GH+ + GK + + +T+NY R++
Sbjct: 333 GRENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ R+P ++ G+I E+
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNIRER 413
>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
caballus]
Length = 539
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 137/335 (40%), Positives = 194/335 (57%), Gaps = 17/335 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 56 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTTLVYCTDLPPTSIIITFHNEARSTLL 115
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ D +L KV+ +RN R+GL+R+R
Sbjct: 116 RTIRSVLNRTPMNLIKEIILVDDFSNDPDDCNQLIKL-----PKVKCLRNENRQGLVRSR 170
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA + G ++ F+D+HCEV +WL PLL + D + PVID I+ + + E
Sbjct: 171 IRGADFAEGAILTFMDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDNFNY---IE 227
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + +R +EP ++P AGGLF M++++F LG YD
Sbjct: 228 SATELRGGFDWSLHFQWEQLSPEQKAQRLDPAEPIRTPVIAGGLFVMNKSWFDYLGKYDM 287
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R PY F G TY N
Sbjct: 288 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 341
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KR +E W DE +K Y+Y P A+ G+I +
Sbjct: 342 TKRTVEVWMDE-YKQYYYAARPFALERPFGNIDSR 375
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 202/329 (61%), Gaps = 15/329 (4%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECK---YWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
+ N E S+ ++ +R IPD R +C +LP SV++ FHNE S+L+RT+ S
Sbjct: 84 HKFNQEASDALASNRAIPDTRHPQCAKEGLLKPQEELPATSVVITFHNEARSALLRTIVS 143
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ R+PA+ +EEIILVDDFS ++L IQ K+RL+RNT+REGL+R+R RGA+
Sbjct: 144 VLNRSPAELIEEIILVDDFSDDPSDGEELAK-IQ----KIRLLRNTQREGLVRSRVRGAR 198
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
++ V+ FLD+HCE WLPPLL + D + + PVID I+ +++++ +
Sbjct: 199 AAKAPVLTFLDSHCECNQGWLPPLLRRVKEDPRRVVCPVIDVINLESFKY---FGASSDL 255
Query: 225 RGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLV 283
RG F W +++K L +E ++R N + P ++P AGGLF +DRA F LG YD + +
Sbjct: 256 RGGFNWNLVFKWEFLSNKEREERANNPTLPIRTPMIAGGLFVVDRAQFERLGAYDTAMDI 315
Query: 284 WGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIE 343
WGGEN ELSF+ W CGGS+E +PCSR+GHV+R PY+F + V N +R E
Sbjct: 316 WGGENLELSFRAWQCGGSLEILPCSRVGHVFRKQHPYSFPGGSGNVFAR--QANTRRAAE 373
Query: 344 TWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W D+ +K Y+Y P+A + MG + E+
Sbjct: 374 VWMDD-YKKYYYATVPVARNVPMGSVEER 401
>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 510
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 203/349 (58%), Gaps = 23/349 (6%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
GE GK + ++ + + N+ S+ I+ +R++PD+R C+ +YP L
Sbjct: 6 GEMGKPVFIADSQKEKMNQLFPLNQFNVMASDMIALNRSLPDIRPRGCQNREYPGVLQTT 65
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
SV++VFHNE +++L+RTVHS+I R+P L EIILVDD+S++ V
Sbjct: 66 SVVIVFHNEAWTTLLRTVHSVINRSPRHLLTEIILVDDYSNRV---------------PV 110
Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
+ +REGL R R GA + GEV+ FLD+HCE WL PLLA I D+ + PVI
Sbjct: 111 MVHHCQQREGLTRARLIGAAMATGEVVTFLDSHCECTRGWLEPLLARIAEDKTNVVCPVI 170
Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
+ I T+EF + D G F+W +++ + +P RE ++ K++ + P +SPT AGGL
Sbjct: 171 NIISDTTFEF--INGSDATQVGGFDWRLIFNWHVVPHRELQRIKFDRTSPVRSPTMAGGL 228
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
F++ + FF LG YDPG VWG EN ELSFK WMCGG++E+VPCS +GHV+R P+ F
Sbjct: 229 FSIHKEFFTRLGTYDPGFDVWGAENLELSFKTWMCGGTLEFVPCSHVGHVFRKRSPHRFP 288
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
V + N +R+ E W DE +K +Y P + D GDISE+
Sbjct: 289 PTTHNV----MQRNNRRLAEVWLDE-YKYLYYNAHPEILKTDPGDISER 332
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R + PD R
Sbjct: 108 PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 165
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 166 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 224
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L + LE Y+Q+ VR++R R+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 225 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 283
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D + P I ID T+EF + V + H RG F+W + + LP E ++
Sbjct: 284 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 343
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK ++P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 344 RKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 403
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K + +I N R+ E W DE +K FY R A +
Sbjct: 404 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDE-YKEIFYRRNIQAAKMAR 457
Query: 365 --DMGDISEQ 372
GDISE+
Sbjct: 458 EKSFGDISER 467
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R + PD R
Sbjct: 98 PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 155
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 156 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 214
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L + LE Y+Q+ VR++R R+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 215 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 273
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D + P I ID T+EF + V + H RG F+W + + LP E ++
Sbjct: 274 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 333
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK ++P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 334 RKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 393
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K + +I N R+ E W DE +K FY R A +
Sbjct: 394 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDE-YKEIFYRRNIQAAKMAR 447
Query: 365 --DMGDISEQ 372
GDISE+
Sbjct: 448 EKSFGDISER 457
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 154/360 (42%), Positives = 205/360 (56%), Gaps = 28/360 (7%)
Query: 25 GEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
GE GK LP A D N S+ IS RT+PD R E CK LD
Sbjct: 131 GEMGKPVVLPANLTADVKKLVDEGWKNNAFNQYASDLISLHRTLPDPRDEWCKKPGRYLD 190
Query: 81 -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
LP+ SVI+ FHNE +S L+RTVHSI+ R+P + EIILVDDFS L +LE+Y +
Sbjct: 191 NLPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDMPHLKTQLEEYSEN 250
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
+ K++++R +REGLIR R GA+ + V+ +LD+HCE WL PLL I + +
Sbjct: 251 Y-PKIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPLLDRIAREASTV 309
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSE 252
PVID ID T EF HYR G F+W + + + +P++E K+ K +E
Sbjct: 310 VCPVIDVIDDSTLEF--------HYRDAGGVNVGGFDWNLQFNWHVVPDKEKKRHKNAAE 361
Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
P SPT AGGLFA+D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS +GH
Sbjct: 362 PVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGH 421
Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
++R PY + R ++ N R+ E W D+ K Y+Y R + D GDI+ +
Sbjct: 422 IFRKRSPYKW-----RTGVNVLKKNSIRLAEVWMDDYAK-YYYERIGNDLG-DYGDITSR 474
>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis mellifera]
Length = 635
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 207/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P EP PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 83 GVLVAPREPDASAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 141
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 142 DPRDPWCKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFS 201
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 202 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 260
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I + + PVID ID T E+ H+R G F+W + + +
Sbjct: 261 LEPLLDRIARNPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 312
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 313 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWM 372
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 373 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 426
>gi|328792011|ref|XP_624873.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Apis mellifera]
Length = 637
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 208/353 (58%), Gaps = 16/353 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
++G E G +L + + D Y N+ S++I R +PD R + C+ Y
Sbjct: 109 EQGLDELGMIKNLDDQRKR--DEGYKNYSFNILVSDNIGLHRELPDTRHKLCELQKYSSK 166
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QR 139
L AS+++ F+NE + +L+R++HSII RTP L EIILV+D+S L +K++ YI
Sbjct: 167 LSNASIVICFYNEHYMTLLRSLHSIIDRTPTNLLHEIILVNDWSDSKILHEKIKIYIANN 226
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
FNGKV+ + +REGLIR R GA+++ GE+++FLD+H EV W+ PLL+ I + I
Sbjct: 227 FNGKVKYFKTEKREGLIRARIFGARKATGEILIFLDSHIEVNRQWIEPLLSRIVYSKTIT 286
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
+PVID I+ T++ Y RG F WG+ +K + +P + +P KSPT
Sbjct: 287 AMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPIGTFVHDEDFVKPIKSPTM 342
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R P
Sbjct: 343 AGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPCSRVGHVFRKRRP 402
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
Y D + N RV W DE +K YF +D GDI+E+
Sbjct: 403 YGAYDQHDT-----MLKNSLRVAHVWLDE-YKDYFLQN---IKKIDYGDITER 446
>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 626
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 205/352 (58%), Gaps = 11/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
G+GG L A + + G N + + +RT+ D R C+ +Y + +LP
Sbjct: 102 GKGGAGVTLTGAEKEKANKEFSRAGFNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPT 161
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL-DQKLEDYIQRF-- 140
ASV+++F +E FS+L+RTV+S+I RTP + L EIILVDD+S ++ + +LE +I+R
Sbjct: 162 ASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLERFIRRHFR 221
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
G V+LI +REGLIR R GA+ + G+V+VFLD+HCE +WL P++ I DR +
Sbjct: 222 PGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELIKKDRTTVV 281
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID +T ++ D + G F W + PE K RK ++P +SPT A
Sbjct: 282 CPIIDVIDDKTLQYMGT-SSDFYQIGGFNWKGEFIWINTPEAWRKARKSKADPMRSPTMA 340
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F E G YD + WGGEN E+SF+IWMCGGS+ PCS +GH++R + PY
Sbjct: 341 GGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVGHIFRDYHPY 400
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F D N R+ E W D +K YFY P + GDISE+
Sbjct: 401 KFPSNKD-----THGINTARLAEVWMD-NYKYYFYQNRPELRKISFGDISER 446
>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
Length = 442
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 141/362 (38%), Positives = 216/362 (59%), Gaps = 14/362 (3%)
Query: 15 PPLEPYKEGPGEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
PP++P PGEGG +L PE R + L N S+ IS R++PDLR
Sbjct: 16 PPVDP--TAPGEGGHGVNLQPSTPEEKRLYKEG-LKNNSFNAWASSKISLHRSLPDLRHR 72
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK + LP+ SVI++F+NE +S+L+RTVHS+++ +PA+ L E+ILVDD S+ L
Sbjct: 73 LCKQKQFFRPLPQTSVIIIFYNEAWSTLLRTVHSVLEASPAELLREVILVDDCSTFDHLK 132
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
LE Y+ +VRL+R+ +R+GLIR R GA +RGEV+ FLD+HCE WL P L
Sbjct: 133 APLETYLSTLP-QVRLVRSPKRQGLIRARLLGALHARGEVLTFLDSHCECMHGWLEPQLE 191
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
I + + + V+D I + T+++ + GI + + +PE E +++K
Sbjct: 192 TIARNYTTVPISVLDNILHDTFQYTFMDLQSTQMGGINFKELTFIWEPIPEHERRRQKSP 251
Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
+P +SPT AGG+F++++ +F LG YD G+ VWGGEN E+SF+IW CGG+I +PCS +
Sbjct: 252 VDPIRSPTMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMSFRIWQCGGTIVVLPCSHV 311
Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
GHV+R PY+ G + + +N +R+ E W D+ +K +Y + P DMGD++
Sbjct: 312 GHVFRPTSPYSTGDAWKK-----LVHNNRRMAEVWMDD-YKEIYYRKHPEYRKYDMGDVT 365
Query: 371 EQ 372
++
Sbjct: 366 QR 367
>gi|395519600|ref|XP_003763931.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Sarcophilus harrisii]
Length = 945
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 147/365 (40%), Positives = 217/365 (59%), Gaps = 13/365 (3%)
Query: 12 NLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
NL+ L P + PG+ G +P E N+ S+ I DR I D R
Sbjct: 430 NLDVTLSPRNPKAPGQFGNPVVVPFGKEKEVKRRWKEGNFNVYLSDLIPLDRAIDDTRPS 489
Query: 71 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L
Sbjct: 490 GCADQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKGYLK 549
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL
Sbjct: 550 DQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLLE 608
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
+Y ++K + PVI+ I+ + + +V D+ RGIF W M + ++P K+ K
Sbjct: 609 RVYLNKKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPEIIKQNKIK 665
Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR
Sbjct: 666 ETDVIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIEIIPCSR 725
Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMG 367
+GH++R PY+F + +R+K I N RV E W DE +K FY L L++G
Sbjct: 726 VGHIFRKDNPYSFPE--NRIK--TIERNLIRVAEVWLDE-YKELFYGHGYHLLDQSLNVG 780
Query: 368 DISEQ 372
++++Q
Sbjct: 781 NLTQQ 785
>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Pteropus alecto]
Length = 557
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 198/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y +DLP S ++
Sbjct: 71 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSVSYSVDLPATSFVIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + ++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVASAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R + P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKISRTDPTRPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oryctolagus cuniculus]
Length = 551
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 191/332 (57%), Gaps = 17/332 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD + N S I +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 68 VGDDPYKLHAFNQRESERIPSNRVVPDTRHNRCALLVYCKDLPPTSIIITFHNEARSTLL 127
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RTV SI+ RTP ++EIILVDDFSS D +L KV+ +RN ER+GL+R+R
Sbjct: 128 RTVRSILNRTPMHLIQEIILVDDFSSDPDDCNQLIKL-----PKVKCLRNNERQGLVRSR 182
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 183 IRGADIAQGATLTFLDSHCEVNKDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 239
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + + +L + +R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 240 SASELRGGFDWSLHFHWEQLSPEQKARRLDPTEPIRTPVIAGGLFVIDKAWFDYLGKYDT 299
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMC GS+E +PCSR+GHV+R PY F G TY N
Sbjct: 300 DMDIWGGENFEISFRVWMCRGSLEIIPCSRVGHVFRKKHPYAFP------NGNTNTYIKN 353
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
KR E W D+ +K Y+Y P A+ G+I
Sbjct: 354 TKRTAEVWMDD-YKQYYYAARPFALERPFGNI 384
>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Ovis
aries]
Length = 557
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 198/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVAAAAFFTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 155/365 (42%), Positives = 217/365 (59%), Gaps = 24/365 (6%)
Query: 14 EPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+P LEP GE G+A L A + + S+ + +N+ S+ IS R +P+
Sbjct: 74 KPDLEP--GALGELGRAVRLELSPAEKRLQEESIRRHQINIYLSDRISLHRRLPERWHPL 131
Query: 72 CK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
CK +DY LPK SV++ F+NE +S+L+RTVHS+++ +P LEE+ILVDD+S K L
Sbjct: 132 CKGKKYDY-YSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHL 190
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+ LE+Y+ KVRLIR +REGL+R R GA +RG+++ FLD HCE WL PLL
Sbjct: 191 KEPLENYVAGLR-KVRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPLL 249
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+RK
Sbjct: 250 ERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKRRK 306
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
+ +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 307 SKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 366
Query: 309 RIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
+GHV+ PY+ K LA+ V R E W DE +K +Y R P A G
Sbjct: 367 HVGHVFPKQAPYSRSKALANSV----------RAAEVWMDE-YKELYYHRNPHARLEPYG 415
Query: 368 DISEQ 372
D+SE+
Sbjct: 416 DVSER 420
>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 526
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 205/352 (58%), Gaps = 11/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
G+GG L A + + G N + + +RT+ D R C+ +Y + +LP
Sbjct: 102 GKGGAGVTLTGAEKEKANKEFSRAGFNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPT 161
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL-DQKLEDYIQRF-- 140
ASV+++F +E FS+L+RTV+S+I RTP + L EIILVDD+S ++ + +LE +I+R
Sbjct: 162 ASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLERFIRRHFR 221
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
G V+LI +REGLIR R GA+ + G+V+VFLD+HCE +WL P++ I DR +
Sbjct: 222 PGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELIKKDRTTVV 281
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID +T ++ D + G F W + PE K RK ++P +SPT A
Sbjct: 282 CPIIDVIDDKTLQYMGT-SSDFYQIGGFNWKGEFIWINTPEAWRKARKSKADPMRSPTMA 340
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+DR +F E G YD + WGGEN E+SF+IWMCGGS+ PCS +GH++R + PY
Sbjct: 341 GGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVGHIFRDYHPY 400
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
F D N R+ E W D +K YFY P + GDISE+
Sbjct: 401 KFPSNKD-----THGINTARLAEVWMD-NYKYYFYQNRPELRKISFGDISER 446
>gi|195402751|ref|XP_002059968.1| GJ14949 [Drosophila virilis]
gi|194140834|gb|EDW57305.1| GJ14949 [Drosophila virilis]
Length = 666
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 199/345 (57%), Gaps = 16/345 (4%)
Query: 7 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
D + EP K+G G G+ +P R N+ S+ I +RT+ D
Sbjct: 80 DYNINQFEP-----KQGEGADGRPVVIPPRDRFRMQRFFKLNSFNILASDRIPLNRTLKD 134
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
R EC+ Y LP SVI+VFHNE +S L+RT+ S+I R+P Q L+EIILVDD S +
Sbjct: 135 YRTNECRDKRYAHGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRQLLKEIILVDDASDR 194
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
+ L ++LE YI+ N RL R ER GL+ R GA+ +RG+V+ FLDAHCE WL
Sbjct: 195 SFLKRQLEAYIKVLNVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLE 254
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREA 244
PLLA I R+++ PVID I + + +E +H+ G F W + ++ ++ + +
Sbjct: 255 PLLARIKESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKRQTS 311
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K K ++ P +P AGGLFA+DR +F E+G YD + +WGGEN E+SF+IW CGG IE
Sbjct: 312 VKPKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSEMRIWGGENVEMSFRIWQCGGRIEI 371
Query: 305 VPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
PCS +GH++RS PY F G +++ ++T N R W D+
Sbjct: 372 SPCSHVGHIFRSSTPYTFPGGMSE-----VLTANLARAATVWMDD 411
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R A +
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMTQ 455
Query: 365 --DMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Nasonia vitripennis]
Length = 610
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 194/307 (63%), Gaps = 11/307 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +RT+PD+R ++C +Y + DLP SVI+VFHNE +S+L+RTVHS+I R
Sbjct: 126 NLLASDRIPLNRTLPDVRKKKCITRYANLG-DLPSTSVIIVFHNEAWSTLLRTVHSVINR 184
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + LEEIILVDD S + L + L++Y+ + N R++R+ +R GL+ R GA E++G
Sbjct: 185 SPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLNVPTRVLRSDKRVGLVNARLMGANEAKG 244
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
EV+ FLDAHCE WL PLL I +R + PVID I+ T+ + +E H+ G F
Sbjct: 245 EVLTFLDAHCECTAGWLEPLLEAISKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAF 301
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L ++R+ N +P+K+P AGGLF+MDR +F ELG YD + +WGGE
Sbjct: 302 NWDLHFRWLMLNGALLRERRENIVDPFKTPAMAGGLFSMDREYFFELGSYDEHMRIWGGE 361
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N ELSF++W CGGS+E PCS +GH++R PY F D + + N RV W D
Sbjct: 362 NLELSFRVWQCGGSVEIAPCSHVGHIFRKSSPYTFPGGVDEI----LYGNLARVALVWMD 417
Query: 348 EKHKAYF 354
E K YF
Sbjct: 418 EWGKFYF 424
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 153/365 (41%), Positives = 219/365 (60%), Gaps = 24/365 (6%)
Query: 14 EPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+P L+P GE G+A L A + + S+ + +N+ S+ IS R +P+
Sbjct: 78 KPALDP--GALGELGRAVRLELSPAEKRRQEESIRRHQINIYLSDRISLHRRLPERWHPL 135
Query: 72 C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
C K +DY +LPK SV++ F+NE +S+L+RTVHS+++ +P LEEIILVDD+S K L
Sbjct: 136 CREKKYDY-YNLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHL 194
Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
+ LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL PLL
Sbjct: 195 KETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLL 253
Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
A I + + PVID ID+ T+E+ + EP G F+ +++ + PERE K+RK
Sbjct: 254 ARIAEEETAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDXRLVFTWHSTPEREQKRRK 310
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
++ +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 311 SKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 370
Query: 309 RIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
+GHV+ PY+ K LA+ V R E W DE +K +Y R P A G
Sbjct: 371 HVGHVFPKQAPYSRAKALANSV----------RAAEVWMDE-YKQLYYHRNPHARLEPYG 419
Query: 368 DISEQ 372
D++E+
Sbjct: 420 DVTER 424
>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 152/352 (43%), Positives = 207/352 (58%), Gaps = 29/352 (8%)
Query: 23 GPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
GPGE GK L EA + G N S+ IS +R+I DLR
Sbjct: 75 GPGEQGKPATLSPEEATSELRKELYYKNGFNALLSDKISINRSIADLR------------ 122
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
SV++ F+ E +S+L+RT++S++ R+P L+EII+VDD S+K L KLEDY+++
Sbjct: 123 --HPSVVVPFYEEHWSTLLRTIYSVLNRSPPHLLKEIIIVDDGSTKEFLHNKLEDYVKQN 180
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KV+L+R ER GLI+ R GAK + G+V++FLD+H E G NWLPPLL PI + K
Sbjct: 181 LPKVKLVRQPERTGLIKARLAGAKIASGDVLIFLDSHTEAGYNWLPPLLEPIAENPKTCV 240
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID ID QT++ V+ D RG+F+W YK + + R +EP+ SP A
Sbjct: 241 CPLIDVIDDQTFD---VHPQDEGGRGLFDWTFHYKRVVIKNED---RISPTEPFPSPVMA 294
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ FF ELGGYD L +WG E +E+SFKIW CGG + PCSR GH+YR++ P+
Sbjct: 295 GGLFAIGADFFWELGGYDEELDIWGAEQYEISFKIWQCGGRMLDAPCSRFGHIYRTYSPF 354
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF-LDMGDISE 371
+ D IT N+KRV E W DE +K Y Y R+P D GD+S+
Sbjct: 355 PNSRKYD-----FITRNHKRVAEIWMDE-YKQYIYDRDPERYAKTDAGDMSK 400
>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 578
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 212/376 (56%), Gaps = 28/376 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
RP++K +PP + GE GKA L E + + Y +N+ S+ I
Sbjct: 61 RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FL H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLYCH 229
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I + PVID ID+ T+EF + EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +P++E +R +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
CGG +E PCS +GHV+ PY P N R E W DE +K +FY
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396
Query: 357 REPLAMFLDMGDISEQ 372
R P A GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
Length = 575
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 195/312 (62%), Gaps = 11/312 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +R++PD R ++C + DYP PK S+I+VFHNE +S+L+RTV S+I R
Sbjct: 91 NLLASDRIPLNRSLPDFRRKKCATLFGDYPT-YPKTSIIIVFHNEAWSTLLRTVWSVINR 149
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + LEEIILVDD S + L + L+DY+ +++R+ R GLI+ R +GA ++G
Sbjct: 150 SPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKG 209
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
V+ FLDAHCE WL LL+ I DR + PVID I+ T+ + +E H+ G F
Sbjct: 210 PVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAF 266
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L RE K RK + ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGE
Sbjct: 267 NWNLQFRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGE 326
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF+IW CGG ++ PCSR+GH++R PY+F ++ + N RV W D
Sbjct: 327 NLEMSFRIWQCGGKVQIAPCSRVGHLFRKSSPYSFPGGINKT----LFSNLARVARVWMD 382
Query: 348 EKHKAYFYTREP 359
+ + YF EP
Sbjct: 383 DWARFYFKFNEP 394
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 150/371 (40%), Positives = 218/371 (58%), Gaps = 24/371 (6%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P GPG GKA+ P+ + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NGPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162
Query: 70 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ PA +L+EIILVDD S++
Sbjct: 163 PECLDQKFRRCP-QLPTTSVIIVFHNEAWSTLLRTVYSVLHTAPAIFLKEIILVDDASTE 221
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EYLKEQLDQYVKQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLLA I D ++ P I ID T+EF + V H RG F+W + + +P E +
Sbjct: 281 PLLARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQ 340
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
PCS +GHV+R+ P+ F K + +I N R+ E W D+ +K FY R A +
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGIN-----VIARNQVRLAEVWMDD-YKEIFYRRNLQAAKMA 454
Query: 365 ---DMGDISEQ 372
GDISE+
Sbjct: 455 EEKSFGDISER 465
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 149/357 (41%), Positives = 207/357 (57%), Gaps = 26/357 (7%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
E + GPGE G+ Y L A +A L E G + S+ I+ +R+ +
Sbjct: 70 ESKRTGPGEHGRPYKLSSEQDIALNAKLFKENGYSAVVSDMIALNRS------------E 117
Query: 77 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
Y +LP SVI++F+NE +S+L+RTV+S++ R+P L+EIILV+D S+K L L ++
Sbjct: 118 YLKELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPALLKEIILVNDHSTKPFLWTPLREF 177
Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
++ KVRL+ ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D
Sbjct: 178 VESELAPKVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEPIAED 237
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
+ P ID I + T+++RS D RG F+W YK L + ++P+
Sbjct: 238 YRTCVCPFIDVIAHDTFQYRS---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFN 291
Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+GHVYR
Sbjct: 292 SPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 351
Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ P+ + + + N+KRV E W DE + + Y R P D GD+S Q
Sbjct: 352 GYAPFGNPRGVN-----FVVRNFKRVAEVWMDE-YSQFLYERNPQFAKTDPGDLSAQ 402
>gi|195467145|ref|XP_002076010.1| GK16099 [Drosophila willistoni]
gi|194172095|gb|EDW86996.1| GK16099 [Drosophila willistoni]
Length = 348
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 195/313 (62%), Gaps = 13/313 (4%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ G GE G+ H+ + D G N S+ IS +R++PD+R EECK Y
Sbjct: 36 RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKTRKYLAK 95
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 139
LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S L Q+L+DY+
Sbjct: 96 LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 155
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI D KI
Sbjct: 156 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 215
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPT 258
T P++D I++ T+ + ++ RG F+W YK+ LPE K S PY++P
Sbjct: 216 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQLPVLPEDSLDK----SLPYRNPV 269
Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VPCSR+ H++R M
Sbjct: 270 MMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM 329
Query: 319 -----PYNFGKLA 326
P N+ +A
Sbjct: 330 DARPNPRNYNFVA 342
>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
Length = 568
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 140/358 (39%), Positives = 215/358 (60%), Gaps = 13/358 (3%)
Query: 20 YKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
+K P G G +P+ + ++ E N+ S IS +RT+PD R + C+
Sbjct: 64 FKYSPHGSNGDGVKIPDHLKNLEESRFSENNFNVVASEMISVNRTLPDYRSDACRISGGK 123
Query: 79 LD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
++ LP+AS+I+ FHNE +++++RT+HSI R+P +EEI+LVDD+S K L L+
Sbjct: 124 INTTELPRASIIITFHNEAWTTIIRTLHSISNRSPRHLIEEIVLVDDYSDKYWLKGPLDI 183
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
Y+++F V + ER GLIR R GAK ++G +++FLD+H EV WL PL++ + D
Sbjct: 184 YVRQFEIPVHVTHLPERSGLIRARLTGAKIAKGPILLFLDSHIEVSEGWLEPLISRVADD 243
Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR-KYNSEPY 254
R + P+ID I + + F S D G F W + +K ++ + ++ +EP
Sbjct: 244 RTRIIAPIIDNISDEDFGF-STGRTD--LWGGFSWILSFKWFDMNGNDTQRLIAKKAEPI 300
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
++PT AGGLFA++R +F E+G YD G+ VWGGEN E+SF+IWMCGGS+E PCS +GHV+
Sbjct: 301 RTPTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISFRIWMCGGSMEIHPCSHVGHVF 360
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R+ PY+F K + V I N R E W DE +K +F+ P A +++GD+ E+
Sbjct: 361 RTKTPYSFTKEVNFV----IRRNQARTAEVWMDE-YKEFFFKMVPSAQKMEIGDLQER 413
>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
Length = 607
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 195/312 (62%), Gaps = 11/312 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +R++PD R ++C + DYP PK S+I+VFHNE +S+L+RTV S+I R
Sbjct: 123 NLLASDRIPLNRSLPDFRRKKCATLFGDYP-TYPKTSIIIVFHNEAWSTLLRTVWSVINR 181
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + LEEIILVDD S + L + L+DY+ +++R+ R GLI+ R +GA ++G
Sbjct: 182 SPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKG 241
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
V+ FLDAHCE WL LL+ I DR + PVID I+ T+ + +E H+ G F
Sbjct: 242 PVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAF 298
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L RE K RK + ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGE
Sbjct: 299 NWNLQFRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGE 358
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF+IW CGG ++ PCSR+GH++R PY+F ++ + N RV W D
Sbjct: 359 NLEMSFRIWQCGGKVQIAPCSRVGHLFRKSSPYSFPGGINKT----LFSNLARVARVWMD 414
Query: 348 EKHKAYFYTREP 359
+ + YF EP
Sbjct: 415 DWARFYFKFNEP 426
>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
echinatior]
Length = 595
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 197/308 (63%), Gaps = 13/308 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +R++PD+R ++C +Y + +LPK S+I+VFHNE +S+L+RTVHS+I R
Sbjct: 132 NLMASDKIPLNRSLPDVRKKKCISRYTNLG-NLPKTSIIIVFHNEAWSTLLRTVHSVINR 190
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + LEEIILVDD S + L L+DY++ + R++R+ ER GLI+ R GA +++G
Sbjct: 191 SPKELLEEIILVDDNSEREFLKNSLDDYVKNLSVSTRVLRSNERIGLIKARLLGANDAKG 250
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
EV+ FLDAHCE + WL PLL + + + PVID I+ T+ + +E H+ G F
Sbjct: 251 EVLTFLDAHCECTIGWLEPLLEAVGKNATRIVAPVIDIINDNTFSYTRSFE--LHW-GAF 307
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L R K+R+ N EP+++P AGGLF+M+R +F +LG YD + +WGGE
Sbjct: 308 NWDLHFRWLTLNGRLLKERRDNIVEPFRTPAMAGGLFSMNRDYFFKLGSYDDQMRIWGGE 367
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
N ELSF+ W CGGSIE PCS +GH++R PY F G + D + G N RV W
Sbjct: 368 NLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYG-----NLARVALVWM 422
Query: 347 DEKHKAYF 354
D+ + YF
Sbjct: 423 DQWAEFYF 430
>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ornithorhynchus anatinus]
Length = 555
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 197/328 (60%), Gaps = 17/328 (5%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
++ N S+ +S DR I D R C YP DLP S+++ FHNE S+L+RTV S++
Sbjct: 88 QHAFNQLESDKLSSDRAIRDTRHYRCTSAHYPSDLPVTSIVITFHNEARSTLLRTVKSVL 147
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
RTPA + EIILVDDFS+ + D +L I KV+ + N +REGLIR+R RGA+ +
Sbjct: 148 NRTPANLVREIILVDDFSADPE-DCQLLTRIP----KVKCLHNNQREGLIRSRVRGAEVA 202
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
+++ FLD+HCEV WL PLL + D + P+ID I + + + RG
Sbjct: 203 TADILTFLDSHCEVNSEWLQPLLQRVKEDYTRVVSPIIDVISLDNFAYLAA---SADLRG 259
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + +K ++P + R ++ ++P AGG+F +D+++F LG YD + +WGG
Sbjct: 260 GFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGG 319
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIET 344
ENFELSF++WMCGGS+E VPCSR+GHV+R PY+F +G +TY N KR E
Sbjct: 320 ENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP------EGNALTYIKNTKRAAEV 373
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W D+ +K Y+Y P A+ G ++E+
Sbjct: 374 WMDD-YKQYYYEARPSAIGKAFGSVAER 400
>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Saccoglossus kowalevskii]
Length = 2434
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/344 (39%), Positives = 207/344 (60%), Gaps = 19/344 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY + + G + N S+ +S+DR IPD R CK D+ LP+ SVI+
Sbjct: 1943 KAY-ISKTVVQTGQDAYARNKFNQVESDKLSYDRDIPDTRNPLCKKLDWKTALPQTSVII 2001
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ R+P ++EIILVDD+S A+ ++LE KV+++R
Sbjct: 2002 TFHNEARSTLLRTVVSVLNRSPTSIIKEIILVDDYSDNAEDGKELEKI-----PKVKVLR 2056
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N +REGL+R+R RGA + G ++ FLD+HCE NW+ PL+ I + K + P+ID I+
Sbjct: 2057 NEKREGLMRSRVRGADYATGTILTFLDSHCECNQNWIEPLITKIQENNKAVVSPIIDVIN 2116
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY---KSPTHAGGLFA 265
+++ + +G F+W +++K + + E KRK S+P ++P AGGLFA
Sbjct: 2117 MDNFQYVAASA---DLKGGFDWNLVFKWDYMTPAERNKRK--SDPIAAIRTPMIAGGLFA 2171
Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
+ +++F ELG YD + VWGGEN E+SF++W CGG++E +PCSR+GHV+R PY F
Sbjct: 2172 ISKSWFEELGKYDMMMDVWGGENLEISFRVWQCGGTLEIIPCSRVGHVFRKQHPYTFPGG 2231
Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+ G + N +R E W DE +K Y+Y+ P + + G+I
Sbjct: 2232 S----GNVFAKNTRRAAEVWMDE-YKKYYYSAVPSSKNIAFGNI 2270
>gi|345491789|ref|XP_001607575.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 566
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 199/333 (59%), Gaps = 10/333 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G+ G+A +L ++ + G + +N+ SN I R I D+R CK Y LP
Sbjct: 63 GDFGEAAYLSDSEKQNGSLVYSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPTT 122
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
SV+++FHNE +S L+RTV+S+++ +P ++L+EIILVDD S++ +L+ L YI+ R K
Sbjct: 123 SVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELEDILAYYIETRLPKK 182
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+L+R +R+GLIR R GA+++ G+V+VFLDAHCEV WL PLL I + + +PV
Sbjct: 183 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHRIKARPNAVLIPV 242
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID ID +T E++ H G F+W + + + + +P +PT AGGL
Sbjct: 243 IDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRTTASPIDPINTPTMAGGL 302
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
FA+DR +F +G YD + WGGEN E+SF+IW CGGSIE VPCSR+GH++R F PY F
Sbjct: 303 FAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCSRVGHIFRDFFPYEFP 362
Query: 324 KLADRVKGPLITY--NYKRVIETWFDEKHKAYF 354
D TY N R W D+ + +F
Sbjct: 363 SSRD-------TYLINTARAAHVWMDDYKRLFF 388
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 149/370 (40%), Positives = 216/370 (58%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P PG GKA+ PE D + + N S+ IS R++ D R
Sbjct: 100 PPQDP--GSPGADGKAFKKDQMSPEEETEKKDG-MTRHCFNQFASDRISLSRSLGEDTRP 156
Query: 70 EECKYWDYPL--DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC +P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S
Sbjct: 157 RECVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASVAG 216
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LE+++ +F VR++R ER+GLI R GA E++GEV+ FLDAHCE WL P
Sbjct: 217 HLKEQLEEFVLQFK-IVRVLRQPERKGLITARLLGASEAQGEVLTFLDAHCECFHGWLEP 275
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAKK 246
LLA I + + P I ID ++++F H + RG F+W + + ++PE K
Sbjct: 276 LLARIVEEPTAVVSPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGWEQIPEAARKL 335
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 336 RKDETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIP 395
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K + +IT N R+ E W D+ +K FY R A +
Sbjct: 396 CSVVGHVFRTKSPHTFPKGTE-----VITRNQVRLAEVWMDD-YKKIFYRRNKNAAKMAK 449
Query: 365 --DMGDISEQ 372
+ GDISE+
Sbjct: 450 ENNYGDISER 459
>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
Length = 580
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 194/318 (61%), Gaps = 25/318 (7%)
Query: 10 LGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
L N + P+ YK G PGEGGKA + A + + + G N N
Sbjct: 97 LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFN--------- 145
Query: 66 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
+CK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L EIILVDDFS
Sbjct: 146 -----QCKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSD 200
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
A L LE Y+++F KVR++R +REGLIR R +GA S+G VI +LD+HCE W+
Sbjct: 201 MAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWM 259
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 244
PLL I + K + PVID ID T+E+ Y + G F+W + + + +PE++
Sbjct: 260 EPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDR 317
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K R+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCGG +E
Sbjct: 318 KGRR-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEI 376
Query: 305 VPCSRIGHVYRSFMPYNF 322
VPCS +GH++R PY +
Sbjct: 377 VPCSHVGHIFRKRSPYKW 394
>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
Length = 563
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 198/343 (57%), Gaps = 17/343 (4%)
Query: 32 HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
+L + AG+ ++ N S+ +S +R I D R C Y DLP SVI+ FH
Sbjct: 81 YLSSKFIKAGEDPYRQHAFNQLESDKLSSERPIRDTRHYRCTSVHYDNDLPSTSVIITFH 140
Query: 92 NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
NE S+L+RT+ S++ R+P ++EIILVDDFS+ D Q L KV+ +RN
Sbjct: 141 NEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQLLTKI-----PKVKCLRNNR 195
Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
REGLIR+R RGA+ + V+ FLD+HCEV WL PLL + D + P+ID I
Sbjct: 196 REGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDN 255
Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
+ + + RG F+W + +K ++P + R + ++P AGG+F +D+++F
Sbjct: 256 FAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWF 312
Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
+LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY F G
Sbjct: 313 NQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYEFP------DG 366
Query: 332 PLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+TY N KR +E W DE +K Y+Y P A+ G ++++
Sbjct: 367 NALTYIKNTKRTVEVWMDE-YKQYYYQARPSAIGKSYGSVADR 408
>gi|380016857|ref|XP_003692388.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like,
partial [Apis florea]
Length = 556
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/332 (42%), Positives = 199/332 (59%), Gaps = 14/332 (4%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
D Y N+ S++I R +PD R + C+ Y L AS+++ F+NE + +L+R+
Sbjct: 47 DEGYKNYSFNILVSDNIGLHRELPDTRHKLCEIQKYSSKLSNASIVICFYNEHYMTLLRS 106
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRS 160
+HSII RTP L EIILV+D+S L +K++ YI FNGKV+ + +REGLIR R
Sbjct: 107 LHSIIDRTPTYLLHEIILVNDWSDSKILHEKIKIYIANNFNGKVKYFKTEKREGLIRARI 166
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
GA+++ GE+++FLD+H EV W+ PLL+ I + I +PVID I+ T++ Y
Sbjct: 167 FGARKATGEILIFLDSHIEVNKQWIEPLLSRIVYSKTITAMPVIDIINPDTFQ----YTG 222
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F WG+ +K + +P + +P KSPT AGGLFAM+R +F +LG YD G
Sbjct: 223 SPLVRGGFNWGLHFKWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAG 282
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R PY D + N R
Sbjct: 283 MDIWGGENLEISFRIWMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDT-----MLKNSLR 337
Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
V W DE +K YF +D GDI+E+
Sbjct: 338 VAHVWLDE-YKDYFLQN---IKKIDYGDITER 365
>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
Length = 550
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +R+ ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRHNERQGLVRS 182
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239
Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299
Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353
Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
N KR E W DE +K Y+Y P A+ G+I +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388
>gi|195488539|ref|XP_002092358.1| GE11714 [Drosophila yakuba]
gi|194178459|gb|EDW92070.1| GE11714 [Drosophila yakuba]
Length = 601
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/365 (40%), Positives = 207/365 (56%), Gaps = 15/365 (4%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L+ K G GE G A HL A + GDA + +N E S +S++R++ D R C
Sbjct: 83 LQKQKAGLGEQGVAVHLSGAAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPLCAKQR 142
Query: 77 Y-PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+ LP ASV+++F NE +S L+RTVHS + + L+EIILVDD S +L KL+
Sbjct: 143 FDAASLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
Y++ GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PLL I
Sbjct: 203 YVRTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262
Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE- 252
R + VP+ID ID +++ + G F+W + LPERE ++++ +
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGG-FQWNGHFDWINLPEREKQRQRRECKH 321
Query: 253 -----PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
P SPT AGGLFA+DR +F E+G YD + WGGEN E+SF+IW CGG+IE +PC
Sbjct: 322 DREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPC 381
Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
SR+GH++R F PY F DR + N R+ W DE +F R L D+G
Sbjct: 382 SRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIG 436
Query: 368 DISEQ 372
D++ +
Sbjct: 437 DVTHR 441
>gi|149050681|gb|EDM02854.1| rCG61782, isoform CRA_a [Rattus norvegicus]
Length = 397
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 192/325 (59%), Gaps = 17/325 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ + ++L KV+ +RN+ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNSERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
KR E W DE +K Y+Y P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378
>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
Length = 488
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 200/344 (58%), Gaps = 23/344 (6%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
GKA +P+ + N+ I+ +RT+PD+RME CK YP +LP+ SV+
Sbjct: 2 GKAVVIPKEKEKEKNEKFKINQFNLMACEMIALNRTLPDVRMEGCKSKTYPKELPRMSVV 61
Query: 88 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
+VFHNE + +L+R+V+SII RTP YLEEIILVDD S + V+L
Sbjct: 62 IVFHNEAWCTLLRSVNSIINRTPRPYLEEIILVDDASERGV--------------PVKLE 107
Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
R +R GLIR R RG+ ++G VI FLDAH E W PLL I DR + P+ID I
Sbjct: 108 RMGKRSGLIRARLRGSGAAKGPVITFLDAHIECTEGWAEPLLTRIAEDRTTVVCPIIDVI 167
Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
T+E+ + D Y G F W + ++ +P+RE +R + + P ++PT AGGLFA+
Sbjct: 168 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRGGDRTMPLRTPTMAGGLFAI 224
Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
D+++F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R PY F
Sbjct: 225 DKSYFEEIGTYDSGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT 284
Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
G +I N +R+ E W D K +FY P +D GD++
Sbjct: 285 ----GQIINKNNRRLAEVWMD-NFKDFFYIISPGVTKVDYGDVT 323
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 149/370 (40%), Positives = 215/370 (58%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE-AYRAAGDASLG--EYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + ++ + G ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTHQETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SV++VFHNE +S+L+RTV+S++ TPA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++LE Y+++ VR++R ER+GLI R GA ++ +V+ FLDAHCE WL P
Sbjct: 223 YLKEQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D ++ P I ID T+EF + V H RG F+W + + +P E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGVS-----VIARNQVRLAEVWMD-NYKEIFYRRNMQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P + PG GKA+ + + ++ N S+ IS R + PD R
Sbjct: 98 PPQDP--KSPGADGKAFQKNNWTPLETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 155
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 156 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTED 214
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L KLE Y++ V+++R ER+GLI R GAK ++ EV+ FLDAHCE WL P
Sbjct: 215 YLKDKLEQYVKELQ-VVKVVRQVERKGLITARLLGAKVAQAEVLTFLDAHCECFHGWLEP 273
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 274 LLARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSRGNFDWSLTFGWETLPPHEKQR 333
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 334 HKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 393
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K + +I N R+ E W D +K FY R +A
Sbjct: 394 CSVVGHVFRTKSPHTFPKGIN-----VIARNQVRLAEVWMDS-YKQIFYRRNLQAAKMAQ 447
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 448 EKSFGDISER 457
>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
Length = 597
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 212/370 (57%), Gaps = 15/370 (4%)
Query: 12 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+++ LE K G GE G + HL + GDA + +N E S +S++R++ D R
Sbjct: 74 SIQLDLEKQKIGLGEQGASVHLSGKAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPL 133
Query: 72 C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
C +++D LP ASVI++F+NE +S L+RTVHS + Q L+EIILVDD S +L
Sbjct: 134 CLAQHFDSST-LPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPEL 192
Query: 130 DQKLEDYIQRFN--GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
KL+ YI+ GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W P
Sbjct: 193 GGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEP 252
Query: 188 LLAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
LL I R + VP+ID ID Y T ++S + G F+W L + + +R
Sbjct: 253 LLHRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQR 312
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
K++ P SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+I
Sbjct: 313 RECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 372
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
E +PCSR+GH++R F PY F DR + N R+ W DE +F R L
Sbjct: 373 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEFINIFFLNRPDLKF 427
Query: 363 FLDMGDISEQ 372
D+GD++ +
Sbjct: 428 HADIGDVTHR 437
>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
Length = 604
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 149/358 (41%), Positives = 210/358 (58%), Gaps = 16/358 (4%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLP 82
G G A HL A +A G+ + +N E S +S++RT+ D R C + +D P LP
Sbjct: 92 GNKGVAVHLTGAAKARGERIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQKYDDPSTLP 151
Query: 83 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
ASV+++F+NE +S L+RTVHS + + L+EIILVDD S A+L KL+ Y++ RF
Sbjct: 152 TASVVIIFYNEPYSVLVRTVHSTLNTCNEKSLKEIILVDDGSDNAELGGKLDYYVRTRFP 211
Query: 142 -GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
GKV ++R R GLIR R GA+ + G+V++FLDAHCE W PLL I R +
Sbjct: 212 PGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQRIKESRTSVL 271
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR-KYNSEPYK---- 255
VP+ID ID + +++ + G F+W + LPERE +++ + S+P +
Sbjct: 272 VPIIDVIDAKDFQYSTNGYKSFQVGG-FQWSGHFDWVNLPEREKQRQLRECSQPREICPA 330
Query: 256 -SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+IE +PCSR+GH++
Sbjct: 331 YSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRVGHIF 390
Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
R F PY F D G N R+ W DE +F R L D+GD++ +
Sbjct: 391 RDFHPYKFPNDRD-THG----INTARMALVWMDEYINVFFLNRPDLKFHPDIGDVTHR 443
>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1-like [Sus
scrofa]
Length = 557
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 199/345 (57%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA GE + N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + V+ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 186 DRREGLIRSRVRGADVAAAGVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + ++P ++P AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIAWTDPTKPIRTPVIAGGIFVIDKS 302
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PYNF
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400
>gi|148706467|gb|EDL38414.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_c [Mus
musculus]
Length = 429
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 191/325 (58%), Gaps = 17/325 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
KR E W DE +K Y+Y P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378
>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
echinatior]
Length = 597
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 208/352 (59%), Gaps = 13/352 (3%)
Query: 25 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
G G+ +L + G+A+L + +N+ SN IS R +PD+R C Y LP A
Sbjct: 85 GNNGEPAYLYGREKILGEAALAKKALNVILSNKISLTRKLPDVRNPLCANVTYDKLLPSA 144
Query: 85 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
S+I++F+NE +S L+RTVHS++K +P L+EIILVDD S + +L +L+ Y+ R K
Sbjct: 145 SIIIIFYNEPWSVLLRTVHSVLKGSPPNLLKEIILVDDHSEEEELQGQLDYYLSTRLPAK 204
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+L+R R+GLIR R GAK + G+V+VFLDAHCEV +WL PLL I ++ + +P+
Sbjct: 205 VKLLRLPYRQGLIRARLHGAKNAVGDVLVFLDAHCEVIKDWLQPLLQRIKDNKNAVLMPI 264
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID I +T E+ E G F W + + + E + R P +SPT AGGL
Sbjct: 265 IDNISEETLEYFHDNEAFFFQVGGFTWSGHFTWITIQKHEVESRFSPISPTRSPTMAGGL 324
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
FA++R +F E+G YD + WGGEN E+SF+IW CGG++E +PCSR+GH++R+F PY F
Sbjct: 325 FAINRKYFWEIGSYDDKMDGWGGENLEISFRIWQCGGTLEIIPCSRVGHIFRNFHPYKFP 384
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD----MGDISE 371
D G N R+ W DE + + R + F D +GDISE
Sbjct: 385 NDKD-THG----INTARLAFVWMDEYKRLFLLHR---SEFKDNPELIGDISE 428
>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 212/370 (57%), Gaps = 15/370 (4%)
Query: 12 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+++ LE K G GE G + HL + GDA + +N E S +S++R++ D R
Sbjct: 74 SIQLDLEKQKIGLGEQGASVHLSGKAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPL 133
Query: 72 C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
C +++D LP ASVI++F+NE +S L+RTVHS + Q L+EIILVDD S +L
Sbjct: 134 CLAQHFDSST-LPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPEL 192
Query: 130 DQKLEDYIQRFN--GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
KL+ YI+ GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W P
Sbjct: 193 GGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEP 252
Query: 188 LLAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
LL I R + VP+ID ID Y T ++S + G F+W L + + +R
Sbjct: 253 LLHRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQR 312
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
K++ P SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+I
Sbjct: 313 RECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 372
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
E +PCSR+GH++R F PY F DR + N R+ W DE +F R L
Sbjct: 373 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEFINIFFLNRPDLKF 427
Query: 363 FLDMGDISEQ 372
D+GD++ +
Sbjct: 428 HADIGDVTHR 437
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428
>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Takifugu rubripes]
Length = 577
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 154/377 (40%), Positives = 223/377 (59%), Gaps = 30/377 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHI 58
RPV++ +PPL+ PGE G+A L E + + SL ++ +N+ S+ +
Sbjct: 56 RPVYE--------KPPLD--WNAPGEMGRAVRLTLSEEEKRKEEESLQKHQINIYISDKV 105
Query: 59 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
S R +P+ C+ Y LP SVI+ F+NEG+S+L+RTVHS+++ +P L+E+
Sbjct: 106 SLHRRLPERWNPLCRQLKYDYRSLPTTSVIIAFYNEGWSTLLRTVHSVLETSPDILLKEV 165
Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
+LVDD+S +A L + LE+YI KVRLIR T+REGL+R R GA + G+V+ FLD H
Sbjct: 166 VLVDDYSDRAHLKEPLENYISGLK-KVRLIRATKREGLVRARLLGASITTGDVLTFLDCH 224
Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKE 236
CE WL PLL I + + PVID ID+ +++ + EP G F+W +++
Sbjct: 225 CECHEGWLEPLLHRIKEEPSAVVCPVIDVIDWNNFQYLGNAGEPQ---IGGFDWRLVFTW 281
Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
+ +PE E K+RK ++ +SPT AGGLFA+ + +F LG YD G+ VWGGEN E SF+IW
Sbjct: 282 HSIPEYEQKRRKSPTDVIRSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFRIW 341
Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
CGGS+E PCS +GHV+ PY+ K LA+ V R E W DE +K +Y
Sbjct: 342 QCGGSLEVHPCSHVGHVFPKKAPYSRNKALANSV----------RAAEVWMDE-YKEIYY 390
Query: 356 TREPLAMFLDMGDISEQ 372
R P A GD++E+
Sbjct: 391 HRNPHARLEAYGDVTER 407
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428
>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oreochromis niloticus]
Length = 567
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 132/345 (38%), Positives = 201/345 (58%), Gaps = 22/345 (6%)
Query: 35 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
+AY AA LG+ + N++ S+ + +R I D R C Y DLP S+++
Sbjct: 86 KAYLAAKQLKLGDDPYKDHAFNLQESDRLGGERAIRDTRHYRCAALTYDTDLPSTSIVIT 145
Query: 90 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
FHNE S+L+RT+ S++ R+P ++EIIL+DDFSS + Q L KVR +RN
Sbjct: 146 FHNEARSTLLRTIKSVLMRSPPSLIQEIILIDDFSSDPEDCQLLAQI-----PKVRCLRN 200
Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
REGLIR+R RGA + ++ FLD+HCEV +WL P++ + D + P+ID I
Sbjct: 201 GRREGLIRSRVRGANMASASILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVISL 260
Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+ + + RG F+W + +K ++P + R ++ ++P AGG+F MDR+
Sbjct: 261 DNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTQAIRTPVIAGGIFVMDRS 317
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F LG YD + +WGGENFELSF++W+CGGS+E +PCSR+GHV+R PY+F
Sbjct: 318 WFNHLGQYDTHMDIWGGENFELSFRVWLCGGSLEILPCSRVGHVFRKRHPYDFP------ 371
Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N +R E W DE +K Y+Y+ P A G ++++
Sbjct: 372 EGNALTYIKNTRRAAEVWMDE-YKQYYYSARPSAQGKAFGSVTDR 415
>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 2
[Gallus gallus]
Length = 557
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+I+
Sbjct: 73 KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 131
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + D +L I KV+ +R
Sbjct: 132 TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCQLLTRIP----KVKCLR 186
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 187 NIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 246
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++ ++P AGG+F +++
Sbjct: 247 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 303
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 304 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 358
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 359 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 402
>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Meleagris gallopavo]
Length = 509
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 201/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+I+
Sbjct: 25 KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDADLPATSLII 83
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + Q L KV+ +R
Sbjct: 84 TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 138
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 139 NIRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 198
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++ ++P AGG+F +++
Sbjct: 199 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 255
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 256 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 310
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 311 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 354
>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Acyrthosiphon pisum]
Length = 581
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 137/343 (39%), Positives = 208/343 (60%), Gaps = 19/343 (5%)
Query: 36 AYRAAG-----DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
AY AAG D + N S+ + +R +PD R +C Y +DLP+ SVI+ F
Sbjct: 93 AYVAAGGLRHGDDAYSRNKFNQLASDSLRSNRPVPDTRNAKCLTKKYRIDLPQTSVIITF 152
Query: 91 HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
HNE S+L+RTV S++ R+P ++EIILVDDFS + Q+L IQ KV+LIRN
Sbjct: 153 HNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDDSTDGQELSK-IQ----KVKLIRNE 207
Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
+REGL+R+R RG++ + V+ FLD+H E +NWL PLL + D + P+ID I+
Sbjct: 208 KREGLMRSRVRGSEIATAPVLTFLDSHVECNVNWLEPLLDRVAEDPTRVVCPIIDVINMD 267
Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
+++ RG F+W +++K L E A+++K + P ++P AGGLF MD+
Sbjct: 268 NFQY---IGASSELRGGFDWNLVFKWEYLSKEVRAQRQKDPTLPIRTPMIAGGLFVMDKD 324
Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
+F++LG YD + +WGGEN E+SF++W CGGS+E +PCSR+GHV+R PY F +
Sbjct: 325 YFVKLGTYDKEMNIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGS--- 381
Query: 330 KGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + +N +R E W D+ +K Y+Y PL+ + G+I+++
Sbjct: 382 -GNVFAHNTRRAAEVWMDQ-YKRYYYNAVPLSRIVPFGNIADR 422
>gi|328723398|ref|XP_001946977.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 1 [Acyrthosiphon pisum]
gi|328723400|ref|XP_003247833.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 2 [Acyrthosiphon pisum]
Length = 624
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 131/307 (42%), Positives = 196/307 (63%), Gaps = 11/307 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRT 109
N+ S+ I +R++PD+R + C+ +D LP ++VI+VFHNE +S+LMRTV S+I R+
Sbjct: 130 NLMASDRIPLNRSLPDVRKKSCRLKKIDIDKLPSSTVIIVFHNEAWSTLMRTVQSVIDRS 189
Query: 110 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 169
P L EIILVDD S++ L+++L+DY+ + R+IR+ +R GLI+ R GA++++G+
Sbjct: 190 PKYLLNEIILVDDASTRKFLEKELDDYVAKLPVLTRIIRSPKRIGLIKARLMGARQAKGK 249
Query: 170 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 229
++VFLDAHCE L WL L++ + DRK + PVID I +T+ + +E H+ G F
Sbjct: 250 ILVFLDAHCECTLGWLEALVSRVAEDRKRVVCPVIDIISDETFAYVRSFE--LHW-GAFN 306
Query: 230 WGMLYK--ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ P+ +R ++ +++P AGGLFAMD+++F ELGGYD + +WGGE
Sbjct: 307 WDLHFRWYTRTTPDIMKGQRDI-TQAFRTPAMAGGLFAMDKSYFFELGGYDERMEIWGGE 365
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N ELSF++W CGGSIE PCS +GHV+R PY F V + N RV W D
Sbjct: 366 NLELSFRVWQCGGSIEIAPCSHVGHVFRKSSPYTFPGGVSHV----LYTNLARVALVWMD 421
Query: 348 EKHKAYF 354
E + YF
Sbjct: 422 EWQEFYF 428
>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 1
[Gallus gallus]
Length = 563
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/346 (40%), Positives = 203/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+I+
Sbjct: 79 KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 137
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ RTP ++EIILVDDFSS + D +L I KV+ +R
Sbjct: 138 TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCQLLTRIP----KVKCLR 192
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 193 NIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++ ++P AGG+F +++
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 309
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY+F
Sbjct: 310 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N KR E W DE +K Y+Y P A+ G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 408
>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Taeniopygia guttata]
Length = 669
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 201/323 (62%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR+IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 203 NQVESDKLRMDRSIPDTRHDQCQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSP 262
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
+ ++EIILVDD+S+ D D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 263 SHLIKEIILVDDYSNDPD-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 317
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + D+ + P+ID I+ +++ +G F+W
Sbjct: 318 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 374
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+++F ELG YD + VWGGEN
Sbjct: 375 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 434
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 435 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 489
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 490 YKNFYYAAVPSARNVPYGNIQSR 512
>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
(Silurana) tropicalis]
gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
tropicalis]
Length = 555
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 143/328 (43%), Positives = 192/328 (58%), Gaps = 18/328 (5%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S I DR I D R C Y DLP SVI+ FHNE S+L+RT+ S++
Sbjct: 77 YAFNQRESERIPSDRAIKDTRHYRCTELHYQSDLPPTSVIITFHNEARSTLLRTIRSVLN 136
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE-GLIRTRSRGAKES 166
RTP + EI+LVDDFS D D +L + KVR +RN +RE GLIR+R RGA +
Sbjct: 137 RTPMHLIHEILLVDDFSDNLD-DCRLLSKLP----KVRCLRNEQREAGLIRSRVRGAGVA 191
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
+ V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + + RG
Sbjct: 192 QAAVLTFLDSHCEVNKDWLPPLLHRIKEDPTRVVSPVIDIINLDTFAYIAA---SSDLRG 248
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + +K +L + KR +EP K+P AGGLF +++++F LG YD + +WGG
Sbjct: 249 GFDWSLHFKWEQLSAEQKAKRLDPTEPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGG 308
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIET 344
ENFE+SF++WMCGGS+E +PCSR+GHV+R PY F +G TY N KR E
Sbjct: 309 ENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------EGNANTYIKNTKRTAEV 362
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE K ++Y P A GDI ++
Sbjct: 363 WMDE-FKNHYYAARPAAQGRPYGDIQKR 389
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 213/384 (55%), Gaps = 43/384 (11%)
Query: 28 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
GK +P + E N+ S+ IS +R++ D+R E C+ Y LP S++
Sbjct: 2 GKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASKLPTTSIV 61
Query: 88 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
+VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE+Y+ + K ++
Sbjct: 62 IVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVL 121
Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I +R+ + P+ID I
Sbjct: 122 RTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVVCPIIDVI 181
Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
+T+E+ + D + G F W + ++ +P RE +R + + P ++PT AGGLF++
Sbjct: 182 SDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSI 238
Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKL 325
D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV+R PY F G +
Sbjct: 239 DKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGV 298
Query: 326 ADRV-------------------------------------KGPLITYNYKRVIETWFDE 348
A V ++ +N R++E W D+
Sbjct: 299 AKIVLHNAARVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDD 358
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
K ++Y+ P A GD+S++
Sbjct: 359 -WKEFYYSFYPGARKASAGDVSDR 381
>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
musculus]
Length = 515
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 193/336 (57%), Gaps = 22/336 (6%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 31 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 90
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 91 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 144
Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLL---APIYSDRKIMTVPVIDGIDYQTWEFR 215
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 145 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEVLQDYTRVVCPVIDIINLDTFNY- 203
Query: 216 SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELG 275
E RG F+W + ++ +L + R +EP ++P AGGLF +D+A+F LG
Sbjct: 204 --IESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLG 261
Query: 276 GYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT 335
YD + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G T
Sbjct: 262 KYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANT 315
Query: 336 Y--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
Y N KR E W DE +K Y+Y P A+ G+I
Sbjct: 316 YIKNTKRTAEVWMDE-YKQYYYAARPFALERPFGNI 350
>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
Length = 554
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 138/327 (42%), Positives = 189/327 (57%), Gaps = 23/327 (7%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S I +R + D R C Y DLP ++++ FHNE S+L+RTV S++
Sbjct: 79 YAFNQRESERIPSNRALRDTRHYRCTTLHYDPDLPSTTIVITFHNEARSTLLRTVRSVLN 138
Query: 108 RTPAQYLEEIILVDDFSSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
RTP + EIILVDDFS + L KL KV+ +RN REGLIR+R RGA
Sbjct: 139 RTPVHLIHEIILVDDFSEDPNDCLLLTKLP--------KVKCLRNKHREGLIRSRVRGAD 190
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ +++ FLD+HCEV +WLPPLL + D + PVID I+ T+ + +
Sbjct: 191 AAGAQILTFLDSHCEVNKDWLPPLLQRVKEDPTSVASPVIDIINMDTFAYVAA---SSDL 247
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K +L + KR +EP K+P AGGLF +DR++F LG YD + +W
Sbjct: 248 RGGFDWSLHFKWEQLSAEKRAKRADPTEPIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIW 307
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVI 342
GGENFE+SF++WMCGGS+E +PCSR+GHV+R PY F +G TY N +R
Sbjct: 308 GGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYIFP------EGNANTYIKNTRRTA 361
Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDI 369
E W DE K ++Y+ P A GDI
Sbjct: 362 EVWMDE-FKLFYYSARPAARGKSYGDI 387
>gi|71682529|gb|AAI00448.1| Galntl5 protein, partial [Mus musculus]
Length = 447
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 132/328 (40%), Positives = 198/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 94 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 153
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 154 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 213
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T ++ +
Sbjct: 214 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLDYMAA----PIV 269
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 270 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 329
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 330 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 383
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 384 WLDE-YKGNFFLQRPSLTYVSCGNISER 410
>gi|148671133|gb|EDL03080.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_a
[Mus musculus]
Length = 490
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 137 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 196
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 197 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 256
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 257 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 312
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 313 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 372
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 373 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 426
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 427 WLDE-YKGNFFLQRPSLTYVSCGNISER 453
>gi|12832954|dbj|BAB22325.1| unnamed protein product [Mus musculus]
Length = 429
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 135/325 (41%), Positives = 191/325 (58%), Gaps = 17/325 (5%)
Query: 40 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
RT+ S++ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
RG F+W + ++ ++ + R +EP ++P AGGLF +D+A+F LG YD
Sbjct: 241 SASELRGGFDWSLHFQWEQISLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF++WMCGG +E +PCSR+GHV+R PY F G TY N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354
Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
KR E W DE +K Y+Y P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378
>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Meleagris gallopavo]
Length = 709
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 142/353 (40%), Positives = 209/353 (59%), Gaps = 10/353 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+ PG+ G +P+ + + E N+ S+ I DR I D R C DL
Sbjct: 208 QAPGQFGHPVAVPDDKQEEAKSRWKEGNFNVFLSDLIPVDRAIADTRPAGCLEQQVHDDL 267
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P ++I+ F +E +S+L+R+VHS++ R+P L+E+ILVDDFS+K L +KL+ Y+ +F
Sbjct: 268 PTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQFP 327
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV+++ ER GLIR R GA+ + G V+ FLD+H E + WL PLL + R +
Sbjct: 328 -KVKVLHLRERHGLIRARLAGAQMATGTVLTFLDSHVECNVGWLEPLLERVRLHRARVAC 386
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
PVI+ I + + +V D+ RGIF W M + ++P+ +K K ++ + P A
Sbjct: 387 PVIEVISDKDMSYMTV---DNFQRGIFTWPMNFGWKQIPQEVIEKNKLKETDIIRCPVMA 443
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++++ +F ELG YD GL VWGGEN ELSFK+WMCGG IE VPCSR+GH++R+ PY
Sbjct: 444 GGLFSVEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCSRVGHIFRNDNPY 503
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
+F K DRV+ + N RV E W D K Y + L ++GD+S+Q
Sbjct: 504 SFPK--DRVR--TVERNLARVAEVWLDGYKELFYGHAYHLLQRRAELGDLSQQ 552
>gi|19922324|ref|NP_611043.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|24653878|ref|NP_725472.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|51315876|sp|Q6WV20.2|GALT1_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
Short=pp-GaNTase 1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 1; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1
gi|10121393|gb|AAG13184.1|AF218236_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
gi|7303062|gb|AAF58130.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|21064373|gb|AAM29416.1| RE14585p [Drosophila melanogaster]
gi|21645385|gb|AAM70974.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|220947986|gb|ACL86536.1| GalNAc-T1-PA [synthetic construct]
Length = 601
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 146/364 (40%), Positives = 207/364 (56%), Gaps = 13/364 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L+ K G GE G A HL A + GD + +N E S ++++R++ D R C
Sbjct: 83 LQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQR 142
Query: 77 YPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+ D LP ASV+++F NE +S L+RTVHS + + L+EIILVDD S +L KL+
Sbjct: 143 FDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
Y++ +GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PLL I
Sbjct: 203 YVRTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262
Query: 194 SDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
R + VP+ID ID Y T ++S + G F+W L + + +R K++
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQE 322
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
P SPT AGGLFA+DR +F E+G YD + WGGEN E+SF+IW CGG+IE +PCS
Sbjct: 323 REICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
R+GH++R F PY F DR + N R+ W DE +F R L D+GD
Sbjct: 383 RVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 369 ISEQ 372
++ +
Sbjct: 438 VTHR 441
>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
Length = 562
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 135/343 (39%), Positives = 198/343 (57%), Gaps = 17/343 (4%)
Query: 32 HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
+L + AG+ ++ N S+ +S +R I D R C + DLP SVI+ FH
Sbjct: 80 YLSSKFIKAGEDPYRQHAFNQLESDKLSSERPIRDTRHYRCTSVHHDNDLPSTSVIITFH 139
Query: 92 NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
NE S+L+RT+ S++ R+P ++EIILVDDFS+ D Q L KV+ +RN
Sbjct: 140 NEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQLLTKI-----PKVKCLRNNR 194
Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
REGLIR+R RGA+ + V+ FLD+HCEV WL PLL + D + P+ID I
Sbjct: 195 REGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDN 254
Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
+ + + RG F+W + +K ++P + R + ++P AGG+F +D+++F
Sbjct: 255 FAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWF 311
Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
+LG YD + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R PY F G
Sbjct: 312 NQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYEFP------DG 365
Query: 332 PLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+TY N KR +E W DE +K Y+Y P A+ G ++++
Sbjct: 366 NALTYIKNTKRTVEVWMDE-YKQYYYQARPSAIGKSYGSVADR 407
>gi|29437281|gb|AAH49554.1| Galntl5 protein, partial [Mus musculus]
Length = 434
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394
>gi|12838270|dbj|BAB24147.1| unnamed protein product [Mus musculus]
Length = 424
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394
>gi|254553456|ref|NP_080725.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Mus musculus]
gi|51316084|sp|Q9D4M9.2|GLTL5_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
gi|148671134|gb|EDL03081.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_b
[Mus musculus]
gi|148877565|gb|AAI45758.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Mus musculus]
Length = 431
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394
>gi|195120313|ref|XP_002004673.1| GI20058 [Drosophila mojavensis]
gi|193909741|gb|EDW08608.1| GI20058 [Drosophila mojavensis]
Length = 668
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 137/347 (39%), Positives = 197/347 (56%), Gaps = 18/347 (5%)
Query: 7 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
D + EP K+G G G+ +P R N+ S+ I +RT+ D
Sbjct: 80 DYNINQFEP-----KQGEGADGRPVIIPLRDRFRMQRFFKLNSFNLLASDRIPLNRTLKD 134
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
R EC+ Y ++P SVI+VFHNE +S L+RT+ S+I R+P L EIILVDD S +
Sbjct: 135 YRTNECREKRYTQNMPTTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASDR 194
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
+ L ++LE YI+ RL R ER GL+ R GA+ +RG+V+ FLDAHCE WL
Sbjct: 195 SFLKRQLEAYIEVLKVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLE 254
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK----ENELPER 242
PLLA I R ++ PVID I + + +E +H+ G F W + ++ + + +
Sbjct: 255 PLLARIKESRNVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKTRQA 311
Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
AK+ K ++ P +P AGGLFA+DR +F E+G YD + +WGGEN E+SF+IW CGG I
Sbjct: 312 IAKENKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDRDMRIWGGENVEMSFRIWQCGGRI 371
Query: 303 EWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
E PCS +GH++RS PY F G +++ ++T N R W D+
Sbjct: 372 EISPCSHVGHIFRSSTPYTFPGGMSE-----VLTSNLARAATVWMDD 413
>gi|403258971|ref|XP_003922013.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Saimiri boliviensis boliviensis]
Length = 967
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 148/380 (38%), Positives = 215/380 (56%), Gaps = 39/380 (10%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR-------------- 68
PG+ G+ +P + E N+ S+ I DR I D R
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGEQLLLPLFPCS 496
Query: 69 -------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
+ C +LP SVI+ F +E +S+L+R+VHS++ R+P ++
Sbjct: 497 HMTLAEIKTSLFLIHGCTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIK 556
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
EI+LVDDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD
Sbjct: 557 EILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLD 615
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
+H E + WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M +
Sbjct: 616 SHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFG 672
Query: 236 ENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
+P + AK R ++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK
Sbjct: 673 WRTIPPDVIAKNRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFK 732
Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
+WMCGG IE +PCSR+GH++R+ PY+F K DR+K + N RV E W DE +K F
Sbjct: 733 VWMCGGEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELF 787
Query: 355 YTR--EPLAMFLDMGDISEQ 372
Y + LD+G++++Q
Sbjct: 788 YGHGDHLINQGLDVGNLTQQ 807
>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 592
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 154/382 (40%), Positives = 211/382 (55%), Gaps = 20/382 (5%)
Query: 2 PVFKADGKLGNLEPPLEPYKE-----GPGEGGKAYHLPEAY-----RAAGDASLGEYGMN 51
PV +L L P P K GPGEG Y + + +A D + N
Sbjct: 62 PVLARPKELSGLSPSYPPPKSDQNSVGPGEGAVPYLVNRSALSVEEQAKYDKGFQDNAFN 121
Query: 52 METSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPA 111
S+ IS R IPD R CK + DLPK +VI+ FHNE +S+L+R+VHS++ +P
Sbjct: 122 QYASDRISVRRYIPDFRNGACKTQSFSSDLPKTAVIICFHNEAWSALLRSVHSVLDYSPK 181
Query: 112 QYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVI 171
+ L+EIILVDDFSS+ L + LE Y+Q+F V++IR REGLIR R G S EV+
Sbjct: 182 ELLQEIILVDDFSSRDYLKEPLEIYMQQF-PVVKIIRTKRREGLIRARMVGTNVSTAEVL 240
Query: 172 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 231
+LD+H E WL PLL I + + VPVI+ I+ Q ++ E G F+W
Sbjct: 241 TYLDSHIECTPGWLEPLLERIKASTSNVVVPVIEIINDQDLSMKATQEASVQVGG-FDWS 299
Query: 232 MLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFEL 291
+ + + P+R+ + P +SPT AGGLFA+ R FF LG YD + VWGGEN EL
Sbjct: 300 LTFTWHLPPKRDQIRLGAPYSPIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLEL 359
Query: 292 SFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHK 351
SFK WMCGG +E V CS +GH++RS PY++ + + I +N R+ ETW D+
Sbjct: 360 SFKTWMCGGQLETVVCSHVGHIFRSRSPYSW----ESKRTSPIKFNLVRLAETWLDDYKF 415
Query: 352 AYFYTREPLAMFL-DMGDISEQ 372
Y+ + L L D GDIS +
Sbjct: 416 LYY---DSLNFDLGDYGDISSR 434
>gi|395732382|ref|XP_002812541.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Pongo abelii]
Length = 967
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 150/387 (38%), Positives = 218/387 (56%), Gaps = 41/387 (10%)
Query: 16 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR------- 68
P +P + PG+ G+ +P + E N+ S+ I DR I D R
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGGQLF 489
Query: 69 --------------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
+ C +LP SVI+ F +E +S+L+R+VHS++ R
Sbjct: 490 LPLFPYSHMTLAEIKTPLFLIHGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNR 549
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P ++EI+LVDDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G
Sbjct: 550 SPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATG 608
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
+V+ FLD+H E + WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF
Sbjct: 609 DVLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIF 665
Query: 229 EWGMLYKENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W M + +P + AK R ++ + P AGGLF++D+++F ELG YDPGL VWGGE
Sbjct: 666 VWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGE 725
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N ELSFK+WMCGG IE +PCSR+GH++R+ PY+F K DR+K + N RV E W D
Sbjct: 726 NMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLD 781
Query: 348 EKHKAYFYTR--EPLAMFLDMGDISEQ 372
E +K FY + LD G++++Q
Sbjct: 782 E-YKELFYGHGDHLIDQGLDAGNLTQQ 807
>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
Length = 455
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 204/345 (59%), Gaps = 15/345 (4%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY A + DA + N + + DR +PD R C+ +Y LP SVI+
Sbjct: 14 KAYIGATALKQGEDAYI-RNAFNQAECDKLPTDRGVPDTRDYSCRSLEYKHKLPTTSVII 72
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RT+ S++ R+P++ L+EIILVDDFS A+ D +L + KV+ +R
Sbjct: 73 TFHNEARSALLRTIRSVLNRSPSELLKEIILVDDFSDNAN-DGRLLKILP----KVKTLR 127
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N +REGLIR+R RGA ++G+V+ FLD+HCEV WL PLL+ + + I+ P+ID I
Sbjct: 128 NNKREGLIRSRVRGADLAKGDVLTFLDSHCEVNERWLEPLLSRVAQNETIVVSPIIDVIH 187
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK-YNSEPYKSPTHAGGLFAMD 267
T+ + +G F W + +K + + E +R + + P K+P AGGLF++
Sbjct: 188 MDTFNY---IGSSADLKGGFGWNLNFKWDSMTSEEQSQRAAHPTRPIKTPMIAGGLFSIS 244
Query: 268 RAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLAD 327
+ +F++ G YD G+ VWGGEN E+S +IWMCGGS+E VPCSR+GHV+R PY F
Sbjct: 245 KNWFIKSGKYDMGMDVWGGENLEISLRIWMCGGSLEIVPCSRVGHVFRKRHPYTFPGGG- 303
Query: 328 RVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
G + N +R E W D K ++Y REP A + GDIS++
Sbjct: 304 ---GFVFAKNTRRAAEAWMDGYAK-FYYKREPGARGVPYGDISDR 344
>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
gallus]
Length = 566
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 200/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 100 NQVESDKLRMDRNIPDTRHDQCQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSP 159
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
+ ++EIILVDD+S+ D D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 160 SHLIKEIILVDDYSNDPD-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 214
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + D+ + P+ID I+ +++ +G F+W
Sbjct: 215 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 271
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+++F ELG YD + VWGGEN
Sbjct: 272 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 331
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 332 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 386
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 387 YKNFYYAAVPSARNVPYGNIQSR 409
>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
5 [Pan troglodytes]
Length = 638
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 18/351 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRR 446
>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oreochromis niloticus]
Length = 598
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 188/331 (56%), Gaps = 17/331 (5%)
Query: 41 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
GD Y N S I DR + D R C Y +LP S+I+ FHNE S+L+R
Sbjct: 116 GDDPYTLYAFNQRESERIPSDRALRDTRHYRCTTLHYDSELPSTSIIITFHNEARSTLLR 175
Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
T+ S++ RTP + EIILVDDFS Q L KV+ RN +REGLIR+R
Sbjct: 176 TIKSVLNRTPVHLIYEIILVDDFSDDESDCQLLTKL-----PKVKCFRNNKREGLIRSRV 230
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
RG +R +V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + +
Sbjct: 231 RGTDAARAKVLTFLDSHCEVNKDWLPPLLQRIKEDPSRVVSPVIDIINMDTFAYVAA--- 287
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F+W + +K +L + +R ++P K+P AGGLF +DRA+F LG YD
Sbjct: 288 SADLRGGFDWSLHFKWEQLSPEQRARRTDPTQPIKTPIIAGGLFVIDRAWFNHLGKYDTA 347
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
+ +WGGENFE+SF++W CGGS+E +PCSR+GHV+R PY F +G TY N
Sbjct: 348 MDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFRKKHPYVFP------EGNANTYIKNT 401
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+R E W D+ + ++Y+ P A GDI
Sbjct: 402 RRTAEVWMDD-FRLFYYSARPAARGKSYGDI 431
>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Takifugu rubripes]
Length = 579
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 188/331 (56%), Gaps = 17/331 (5%)
Query: 41 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
GD Y N S I +R + D R C Y DLP S+I+ FHNE S+L+R
Sbjct: 97 GDDPYTLYAFNQRESERIPSNRALRDTRHFRCATIRYDSDLPPTSIIITFHNEARSTLLR 156
Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
TV S++ RTP + EIILVDDFS Q L KVR +RN +REGLIR+R
Sbjct: 157 TVRSVLNRTPVHLIHEIILVDDFSDDESDCQLLIKL-----PKVRCVRNPQREGLIRSRV 211
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
RGA ++ V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + +
Sbjct: 212 RGADSAKAAVLTFLDSHCEVNKDWLPPLLQRIKQDPTRVVSPVIDIINMDTFAYVAA--- 268
Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
RG F+W + +K +L + +R ++P K+P AGGLF +DR++F LG YD
Sbjct: 269 SADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTA 328
Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
+ +WGGENFE+SF++W CGGS+E +PCSR+GHV+R PY F +G TY N
Sbjct: 329 MDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFRKKHPYVFP------EGNANTYIKNT 382
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+R E W D+ ++Y+ P A GDI
Sbjct: 383 RRTAEVWMDD-FSLFYYSARPAARGKSYGDI 412
>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 147/365 (40%), Positives = 213/365 (58%), Gaps = 20/365 (5%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P PG GKA+ PE + + + N S+ IS R++ D R
Sbjct: 100 PPQDP--GSPGADGKAFQKDQMTPEEENEKKEG-MTRHCFNQFASDRISLSRSLGDDTRP 156
Query: 70 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 157 PECVERKFLRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASAA 215
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L ++LE ++ + VR++R ER+GLI R GA ++GEV+ FLDAHCE WL
Sbjct: 216 DHLKEQLEVFVHQLK-IVRVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLE 274
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAK 245
PLLA I + + P I ID +T++F H Y RG F+WG+ + ++PE K
Sbjct: 275 PLLARIVEEPTAVVSPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEAARK 334
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
RK + P K+PT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 335 LRKDETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEII 394
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
PCS +GHV+R+ P+ F K D +IT N R+ E W D+ +K FY R A +
Sbjct: 395 PCSVVGHVFRTKSPHTFPKGTD-----VITRNQVRLAEVWMDD-YKKIFYRRNRNAENMA 448
Query: 366 MGDIS 370
D++
Sbjct: 449 KEDLT 453
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 195/311 (62%), Gaps = 12/311 (3%)
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
R CK YP LP SVI+ F +E FS++MR+VHSII RTP L E+ILVDD S++
Sbjct: 83 CRQVRCKTKKYPEYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTR 142
Query: 127 ADLDQKLEDYIQRFNG--KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
A+L L+DY++R G KV+++ +REGLIR R RGA+++ G V+ FLDAH E + W
Sbjct: 143 AELKGHLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGW 202
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPERE 243
+ PLL I+ +R + +P+I+ ID +T+E+ V + RG F W + + +PE E
Sbjct: 203 VEPLLHRIWENRSNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYE 262
Query: 244 AKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
K+ K + + P +SPT AGGLF++D+++F ELG YD + WGGEN ELSFKIWMCGG++
Sbjct: 263 IKRWKGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMCGGTL 322
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGP-LITYNYKRVIETWFDEKHKAYFYTREPLA 361
E PCS++GHV+RS PY+ GP N RV+E W D +K FY P
Sbjct: 323 EQPPCSKVGHVFRSSAPYS------NPSGPKTFIRNTLRVVEVWLDS-YKDLFYALNPHM 375
Query: 362 MFLDMGDISEQ 372
GD+SE+
Sbjct: 376 QGEPYGDVSER 386
>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
Length = 586
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 202/346 (58%), Gaps = 18/346 (5%)
Query: 29 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
KAY L E G ++ N+ S+ + +R I D R C Y +LP S+I+
Sbjct: 110 KAY-LTEKLLKPGVDPYQDHAFNVLESDRVGSERAIRDTRHYRCASISYDPELPSTSIII 168
Query: 89 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
FHNE S+L+RTV S++ R+P ++EIIL+DDFSS + D +L +I KVR +R
Sbjct: 169 TFHNEARSTLLRTVKSVLMRSPPSLIQEIILIDDFSSDPE-DCQLLVHIP----KVRCLR 223
Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
N REGLIR+R RGA + ++ FLD+HCEV +WL P++ + D + P+ID I
Sbjct: 224 NVRREGLIRSRVRGANAASAPILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVIS 283
Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
+ + + RG F+W + +K ++P + R ++P ++P AGG+F MD+
Sbjct: 284 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTQPIRTPVIAGGIFVMDK 340
Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
++F LG YD + +WGGENFELSF++WMCGGS+E +PCSR+GHV+R PY F
Sbjct: 341 SWFNRLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRVGHVFRKRHPYEFP----- 395
Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+G +TY N +R E W DE +K Y+Y+ P A G I+++
Sbjct: 396 -EGNALTYIRNTRRAAEVWMDE-YKQYYYSARPSAQGKAFGSITDR 439
>gi|327274929|ref|XP_003222227.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Anolis carolinensis]
Length = 605
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 196/345 (56%), Gaps = 19/345 (5%)
Query: 32 HLPEA--YRAAGDAS------LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
HL E ++ DAS L YG N S I R +P++R C + +LP
Sbjct: 139 HLAEEDEFQNQTDASEQTIDGLEIYGFNEALSKQIPLHRELPEVRHPLCLQQEPSPNLPT 198
Query: 84 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
ASV++ FH+E +S+L+RTVHS++ P +L+EIILVDD S++ L L +YI + G
Sbjct: 199 ASVVICFHDEAWSTLLRTVHSVLDTAPRDFLKEIILVDDLSTQEYLKSSLSEYISKLPG- 257
Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
V+LIR+ R G+I+ R GA + GEV+VF+D+HCE WL PLL + SDR + PV
Sbjct: 258 VKLIRSNRRLGVIQGRMLGAARATGEVVVFMDSHCECHNGWLEPLLERLASDRSRIVSPV 317
Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
ID ID++T+++ E RG+F+W + + L E E K R P +SP GG+
Sbjct: 318 IDVIDWKTFQYHHTMELQ---RGVFDWKLDFHWKPLTEHEKKVRPSPVSPIRSPAVPGGV 374
Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
A+ R F GGYD + + GGEN ELS K W+CGGS+E +PCSR+GHVYR+ MPYNF
Sbjct: 375 IAVHRHHFQNTGGYDSDMTLLGGENIELSIKAWLCGGSVEILPCSRVGHVYRTGMPYNFS 434
Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
I N R+ ETW D K FY + LA + +
Sbjct: 435 ------DEKAIERNKIRIAETWLD-SFKHLFYQHDRLACLISKAE 472
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 197/328 (60%), Gaps = 18/328 (5%)
Query: 51 NMETSNHISFDRTIPDLRMEECK--YWDYPLD---LPKASVILVFHNEGFSSLMRTVHSI 105
N + S+ + +R +PD R C+ W LP SVI+ FHNE S+L+RTV S+
Sbjct: 196 NQQASDGLKSNRELPDTRNAMCRRTSWSSATSIESLPATSVIITFHNEARSTLLRTVVSV 255
Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
+ R+P + + EIILVDDFS + Q+L IQ KVRLIRN +REGL+R+R GA
Sbjct: 256 LNRSPERLIHEIILVDDFSDFPEDGQELAK-IQ----KVRLIRNAKREGLVRSRVTGAAA 310
Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
+ +V+ FLD+HCE ++WL PLLA + D + PVID I T+++ R
Sbjct: 311 ATAKVLTFLDSHCECNVHWLEPLLARVAEDPTRVVCPVIDVISMDTFQY---IGASADLR 367
Query: 226 GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
G F+W +++K L E K+R+ + + P ++P AGGLF +DR++F +LG YD + +W
Sbjct: 368 GGFDWNLVFKWEYLSGAERKERQRDPTAPIRTPMIAGGLFVIDRSYFEKLGTYDTQMDIW 427
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN E+SF++W CGGS+E +PCSR+GHV+R PY F G + N +R E
Sbjct: 428 GGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGG---SGNIFAKNTRRAAEV 484
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K Y+Y PLA + GDI ++
Sbjct: 485 WMDE-YKRYYYAAVPLATNIPFGDIEDR 511
>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
saltator]
Length = 493
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 132/308 (42%), Positives = 199/308 (64%), Gaps = 13/308 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +R++PD+R ++C +Y + LPK S+I+VFHNE +S+L+RTVHS+I R
Sbjct: 11 NLMASDRIPLNRSLPDVRKKKCISRYANLG-KLPKTSIIIVFHNEAWSTLLRTVHSVIDR 69
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + LEEIILVDD S + L L++Y+++ + +++R+TER GLI+ R GA +++G
Sbjct: 70 SPRELLEEIILVDDNSEREFLKNPLDEYVKKLSVPTKVLRSTERVGLIKARLLGASDAKG 129
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
EV+ FLDAHCE + WL PLL + + + PVID I+ T+ + +E H+ G F
Sbjct: 130 EVLTFLDAHCECTVGWLEPLLEAVGKNATRIISPVIDIINDNTFSYTRSFE--LHW-GAF 186
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L R K+R+ + EP+++P AGGLF+M+R +F +LG YD + +WGGE
Sbjct: 187 NWDLHFRWLTLNGRLLKERRESIVEPFRTPAMAGGLFSMNRNYFFQLGSYDDQMRIWGGE 246
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
N ELSF+ W CGGSIE PCS +GH++R PY F G + D + G L+ RV W
Sbjct: 247 NLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLV-----RVASVWM 301
Query: 347 DEKHKAYF 354
D+ + YF
Sbjct: 302 DQWAEFYF 309
>gi|194755004|ref|XP_001959782.1| GF13042 [Drosophila ananassae]
gi|190621080|gb|EDV36604.1| GF13042 [Drosophila ananassae]
Length = 599
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 147/369 (39%), Positives = 210/369 (56%), Gaps = 13/369 (3%)
Query: 12 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+++ L+ + G GE G A HL A + G+A + +N E S + ++R++ D R
Sbjct: 76 SIQLDLQKQRVGLGEQGVAVHLTGAAKERGEAIYKKIALNEELSEQLLYNRSVGDHRNPL 135
Query: 72 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C + +D LP ASV+++F NE +S L+RTVHS + + L+EIILVDD S +L
Sbjct: 136 CAAERFDVDTLPTASVVIIFFNEPYSVLLRTVHSTLTTCNEKALKEIILVDDGSDNPELG 195
Query: 131 QKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
KL+ YI+ GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PL
Sbjct: 196 GKLDYYIRTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPL 255
Query: 189 LAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 243
L I R + VP+ID ID Y T ++S + G F+W L + + +R
Sbjct: 256 LQRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRR 315
Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
K++ P SPT AGGLFAMDR +F E+G YD + WGGEN E+SF+IW CGG+IE
Sbjct: 316 ECKQQREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIE 375
Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
+PCSR+GH++R F PY F DR + N R+ W DE +F R L
Sbjct: 376 TIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFH 430
Query: 364 LDMGDISEQ 372
D+GD++ +
Sbjct: 431 ADIGDVTHR 439
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 150/370 (40%), Positives = 213/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 87 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 144
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 145 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 203
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE L P
Sbjct: 204 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGRLEP 262
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 263 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 322
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 323 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 382
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 383 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 436
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 437 EKSFGDISER 446
>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 26/372 (6%)
Query: 15 PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 68
PP+ P G G+GGK+ L E + + + + + +N S IS RT+ + R
Sbjct: 73 PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132
Query: 69 MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
++C K + Y LP SVI+ F+NEG+++L+RT++SI+ +P L+EIIL+DD
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191
Query: 124 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
S K + L ++LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
WL PLL I D ++ VP+I I +Q + F G F+W + ++ + +P
Sbjct: 251 NNGWLEPLLQRIAEDDSVVAVPIISTIAWQDFAFHHSSNSIEPQIGGFDWRLTFQWHSIP 310
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ KRK +++P +PT AGGLFA+ R +F +G YD G+ VWGGEN E+SF++WMCGG
Sbjct: 311 DEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGG 370
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
S+E +PCS +GHV+ PY T N R +E W D+ +K +FY R PL
Sbjct: 371 SLEIIPCSIVGHVFPKTAPYERKSF---------TPNTVRAVEVWLDD-YKRHFYARNPL 420
Query: 361 AMFLDMGDISEQ 372
+ GDISE+
Sbjct: 421 SKDEKYGDISER 432
>gi|198474479|ref|XP_002132699.1| GA25744 [Drosophila pseudoobscura pseudoobscura]
gi|198138409|gb|EDY70101.1| GA25744 [Drosophila pseudoobscura pseudoobscura]
Length = 635
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/367 (41%), Positives = 213/367 (58%), Gaps = 21/367 (5%)
Query: 18 EPYKEGPGEGGKAYHLPEA--YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
+ K G GE G + + Y+ S+ + G N S+ IS +RTI D R CK
Sbjct: 85 DSLKTGLGEQGLRIAIEDTKEYQEMIAMSIKK-GFNSLLSDKISVNRTIADTRPLRCKSR 143
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
Y + LP SVI+VFHN S L+R +HSII RTP + L E+ILVDD S+ +L ++L+
Sbjct: 144 KYLVKLPNVSVIMVFHNTHLSVLLRAIHSIINRTPHELLHEVILVDDGSTAQELQEQLDK 203
Query: 136 YI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
Y+ + F KV +IR +R G+ R G + G V+VF DA EV NWLPPLL P+
Sbjct: 204 YVNEHFGSKVSIIRQKKRTGMPAARVAGVNSANGTVMVFCDASIEVIYNWLPPLLEPMTL 263
Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
KI+T P++D ID + F+ + +RG F+W + N+LP + + K S+PY
Sbjct: 264 HYKIVTSPILDEIDNTDFSFK--WSDPLLWRGGFDWH--FNFNKLPVLQ-EDIKGESQPY 318
Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
++P G +FA+DR +FLELGGYD GL GGE +E+SFKIWMCGG + VPCSR+GH+
Sbjct: 319 RNPVMEGTVFAIDRKYFLELGGYDEGLDASGGEQYEMSFKIWMCGGMLLQVPCSRVGHI- 377
Query: 315 RSFMP-------YNFGKLADRVKGP--LITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
+ P + G+L + KG +T NYKRV E W D +K Y Y R+P ++
Sbjct: 378 -AIDPKDAQDPTWQKGELLESTKGEYDTLTRNYKRVAEVWMD-GYKHYLYLRDPYKYHIN 435
Query: 366 MGDISEQ 372
G+++ Q
Sbjct: 436 AGNVTRQ 442
>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Cavia porcellus]
Length = 531
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R E+C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 65 NQVESDKLRMDRAIPDTRHEQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 124
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 125 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 179
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 180 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 236
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 237 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 296
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 297 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 351
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 352 YKNFYYAAVPSARNVPYGNIQSR 374
>gi|195148070|ref|XP_002014997.1| GL18654 [Drosophila persimilis]
gi|194106950|gb|EDW28993.1| GL18654 [Drosophila persimilis]
Length = 635
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/334 (43%), Positives = 200/334 (59%), Gaps = 18/334 (5%)
Query: 49 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
G N S+ IS +RTI D R CK Y + LP SVI+VFHN S L+R +HSII R
Sbjct: 117 GFNSLLSDKISVNRTIADTRPLRCKSRKYLVKLPNVSVIMVFHNTHLSVLLRAIHSIINR 176
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
TP + L E+ILVDD S+ +L ++L+ Y+ + F KV +IR +R G+ R G +
Sbjct: 177 TPHELLHEVILVDDGSTAQELQEQLDKYVNEHFGSKVSIIRQKKRTGMPAARVAGVNSAN 236
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
G V+VF DA EV NWLPPLL P+ KI+T P++D ID + F+ + +RG
Sbjct: 237 GTVMVFCDASIEVIYNWLPPLLEPMTLHYKIVTSPILDEIDNTDFSFK--WSDPLLWRGG 294
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F+W + N+LP + + K S+PY++P G +FA+DR +FLELGGYD GL GGE
Sbjct: 295 FDWH--FNFNKLPVLQ-EDIKGESQPYRNPVMEGTVFAIDRKYFLELGGYDEGLDASGGE 351
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP-------YNFGKLADRVKGP--LITYNY 338
+E+SFKIWMCGG + VPCSR+GH+ + P + G+L + KG +T NY
Sbjct: 352 QYEMSFKIWMCGGMLLQVPCSRVGHI--AIDPKDAQDPTWQKGELLESTKGEYDTLTRNY 409
Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
KRV E W D +K Y Y R+P ++ G+++ Q
Sbjct: 410 KRVAEVWMD-GYKHYLYLRDPYKYHINAGNVTRQ 442
>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 573
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/375 (38%), Positives = 216/375 (57%), Gaps = 20/375 (5%)
Query: 3 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
+ + GK G L E G+ G+ L E +A + N+ SN I R
Sbjct: 51 IADSSGKFG-----LHDQSEKFGDMGRPVVLSEFLKAESKLTFHLNEFNLVVSNLIGTRR 105
Query: 63 TIPDLRMEECKYWDYPLD--LP-KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
+ D R C++ PLD LP K SVI+VFHNE +S+L+RTVHS++ RTP Q L EIIL
Sbjct: 106 NLDDFRHPSCRH-QIPLDKLLPFKTSVIIVFHNEAWSALLRTVHSVLDRTPVQLLHEIIL 164
Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
VDD S+++ L +L++Y++ N VR+ R + R GLIR R GAK S G+ + FLDAHCE
Sbjct: 165 VDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLIRARLHGAKISTGKTLTFLDAHCE 224
Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
V + WL LL I ++K + P+ID I + T+E+ + D + G F+W + +
Sbjct: 225 VTIGWLETLLKHISENQKRIVCPIIDVISHDTFEY--LLGSDRTW-GTFDWQFNFHWETV 281
Query: 240 PEREAKK-RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
+RE + ++ P ++PT AGGLF + R +F E+G YD + +WGGEN ELSF++W C
Sbjct: 282 VDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELSFRVWQC 341
Query: 299 GGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
GG + PCSR+GHV+R PY + G ++ ++ N+ R W D+ + YF
Sbjct: 342 GGELLIDPCSRVGHVFRKSSPYTWPGGVSH-----ILHKNFVRTALVWLDQYSRFYFML- 395
Query: 358 EPLAMFLDMGDISEQ 372
P A+ +D GD++++
Sbjct: 396 NPSALSVDYGDVTKR 410
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 156/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)
Query: 11 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 86 GVLVAPREQDSSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 144
Query: 66 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
D R CK Y +LP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 145 DPRDPWCKEPGRYLKELPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDYS 204
Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 205 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 263
Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 264 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 315
Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
+PERE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WM
Sbjct: 316 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWM 375
Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
CGG++E VPCS +GH++R PY + R ++ N R+ E W DE K Y+Y R
Sbjct: 376 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 429
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 210/370 (56%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+H + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I + + P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNMQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
Length = 564
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 201/324 (62%), Gaps = 16/324 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N E S+++ +R IPD R C+ + DLP SVI+ FHNE S+L+RTV S++ R+P
Sbjct: 100 NQEASDNLPSNREIPDTRNAMCRRKLWRTDLPPTSVIITFHNEARSTLLRTVVSVLNRSP 159
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDDFS + ++L IQ KVR++RN +REGL+R+R RGA + V
Sbjct: 160 EHLIKEIILVDDFSDNPEDGEELAK-IQ----KVRVLRNDKREGLMRSRVRGADAATASV 214
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +NWL PLL + D + PVID I T+++ RG F+W
Sbjct: 215 LTFLDSHCECNVNWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGA---SADLRGGFDW 271
Query: 231 GMLYKENEL--PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+++K L ERE+++R ++ ++P AGGLF +++A+F +LG YD + VWGGEN
Sbjct: 272 NLVFKWEYLGYAERESRQRD-PTQAIRTPMIAGGLFVINKAYFEKLGKYDMKMDVWGGEN 330
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W D+
Sbjct: 331 LEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGS----GNVFARNTRRAAEVWMDD 386
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y PLA + GDISE+
Sbjct: 387 -YKHFYYAAVPLAKNIPFGDISER 409
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/370 (40%), Positives = 210/370 (56%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+H + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I + + P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNMQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/371 (40%), Positives = 211/371 (56%), Gaps = 24/371 (6%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P PG GKA+ P+ + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--SSPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162
Query: 70 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L EIILVDD S+
Sbjct: 163 PECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLREIILVDDASTD 221
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L ++LE Y+++ VR++R ER GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EYLKEQLEQYVKQLQ-VVRVVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLLA I D + P I ID T+EF + V H RG F+W + + LP E +
Sbjct: 281 PLLARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQ 340
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
PCS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-GYKEIFYRRNMQAAKMA 454
Query: 362 MFLDMGDISEQ 372
GDISE+
Sbjct: 455 QEKSFGDISER 465
>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 26/372 (6%)
Query: 15 PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 68
PP+ P G G+GGK+ L E + + + + + +N S IS RT+ + R
Sbjct: 73 PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132
Query: 69 MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
++C K + Y LP SVI+ F+NEG+++L+RT++SI+ +P L+EIIL+DD
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191
Query: 124 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
S K + L ++LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250
Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
WL PLL I D ++ VP+I I +Q + F G F+W + ++ + +P
Sbjct: 251 NDGWLEPLLQRIAEDDSVVAVPIISTIAWQDFGFHHSSNSIEPQIGGFDWQLTFQWHSIP 310
Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
+ KRK +++P +PT AGGLFA+ R +F +G YD G+ VWGGEN E+SF++WMCGG
Sbjct: 311 DEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGG 370
Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
S+E +PCS +GHV+ PY T N R +E W D+ +K +FY R PL
Sbjct: 371 SLEIIPCSIVGHVFPKTAPYERKSF---------TPNTVRAVEVWLDD-YKRHFYARNPL 420
Query: 361 AMFLDMGDISEQ 372
+ GDISE+
Sbjct: 421 SKDEKYGDISER 432
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/364 (41%), Positives = 215/364 (59%), Gaps = 22/364 (6%)
Query: 14 EPPLEPYKEGPGEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+PPL+ E GE G+A +L E + + S+ + +N S+ IS R +P+
Sbjct: 59 KPPLD--LEAVGEMGRAVKLNLNEEEKRKEEESIKAHQINTYVSDKISLHRRLPERWNPL 116
Query: 72 CKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
CK Y LP SV++ F+NE +S+L+RTVHS+++ +P L+E++LVDD+S KA L
Sbjct: 117 CKELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDKAHLK 176
Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
+ L+ YI N KVRLIR T+REGL+R R GA + GEV+ FLD HCE WL P+L
Sbjct: 177 EPLDKYISGLN-KVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPVLH 235
Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
I + K + PVID ID+ T+++ EP G F+W +++ + +P+ E K+R+
Sbjct: 236 RIKEEPKAVVCPVIDVIDWNTFQYLGHAGEPQ---IGGFDWRLVFTWHSIPDYEQKRRRS 292
Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
+ +SPT AGGLFA+ + FF LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 293 PVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSH 352
Query: 310 IGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
+GHV+ PY+ K LA+ V R E W DE K +Y R P A GD
Sbjct: 353 VGHVFPKKAPYSRSKALANSV----------RAAEVWLDE-FKEIYYHRNPHARLEAFGD 401
Query: 369 ISEQ 372
++E+
Sbjct: 402 VTER 405
>gi|18314429|gb|AAH22021.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Homo sapiens]
gi|51105933|gb|EAL24517.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 15 [Homo sapiens]
gi|119574364|gb|EAW53979.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_c
[Homo sapiens]
gi|123979772|gb|ABM81715.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
gi|123994539|gb|ABM84871.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
Length = 443
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 202/328 (61%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L +YG N+ S + +R +PD R + YP LP AS+++ F+NE ++L +T+ S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMRLQKHYPARLPTASIVICFYNEECNALFQTMSS 156
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
+ TP +LEEIILVDD S DL +KL+ +++ F GKV++IRN +REGLIR R GA
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + +K + + E + +++P +SP +GG+FA+ R +F E+G YD + W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
G EN ELS +IWMCGG + +PCSR+GH+ + GK + + +T+NY R++
Sbjct: 333 GRENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ R+P ++ G+I E+
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNIRER 413
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 149/369 (40%), Positives = 205/369 (55%), Gaps = 29/369 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSN 56
+P + D K ++PP + E GE GK LP + + A + N S+
Sbjct: 133 KPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVSD 190
Query: 57 HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
IS RT+PD R CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P +
Sbjct: 191 LISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIG 250
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD+S L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +LD
Sbjct: 251 RIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLD 309
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
+HCE WL PLL I + + PVID I T E+ HYR G F
Sbjct: 310 SHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGF 361
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + + +PERE K+ +EP SPT AGGLF++DR +F LG YD G +WGGEN
Sbjct: 362 DWNLQFSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGEN 421
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFK WMCGG++E VPCS +GH++R PY + R ++ N R+ E W DE
Sbjct: 422 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDE 476
Query: 349 KHKAYFYTR 357
+ Y+Y R
Sbjct: 477 -YSQYYYHR 484
>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
Length = 607
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 194/329 (58%), Gaps = 23/329 (6%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
++ N++ S+ + DR +PD R + CK +YP +LP SVI VF+NE S L R++H ++
Sbjct: 125 QHCFNLKRSDSLPLDRPVPDHRDKRCKEIEYPHNLPTTSVIFVFYNEPLSPLYRSIHGVL 184
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
RTP L EIILVDD S L + EDYI+ K +L+R +ER GL+ RS GA+ +
Sbjct: 185 DRTPEHLLHEIILVDDGSDADYLKKDFEDYIKLLP-KTKLVRKSERSGLMDARSYGAEVA 243
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
G+ I FLDAH EV WL P++A I DRK + +P+ID ID ++ + RG
Sbjct: 244 TGDTITFLDAHIEVSKGWLEPMMARINEDRKHVVMPIIDSIDPDSFNY---------MRG 294
Query: 227 I-----FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
F WGM K +R+ EP SP AGGLF+MDR +F +LGGYDPG+
Sbjct: 295 GLDILGFSWGMGQKSI------GSRRRTRVEPMPSPIMAGGLFSMDRKYFFDLGGYDPGM 348
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
++GGE E+SF+IW CGG++E +PCSR+GHV+R+ Y G++ V G +I N R
Sbjct: 349 KLYGGEELEISFRIWQCGGTLECIPCSRVGHVFRTGA-YWKGQVYT-VPGHVIVKNKLRA 406
Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDIS 370
E W DE + PL +D+GD+S
Sbjct: 407 AEVWMDEYKEVVQRVMPPLPRGMDLGDLS 435
>gi|195587296|ref|XP_002083401.1| GD13712 [Drosophila simulans]
gi|194195410|gb|EDX08986.1| GD13712 [Drosophila simulans]
Length = 631
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 206/344 (59%), Gaps = 17/344 (4%)
Query: 23 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASSLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVTKL 201
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
P ID ID+ + +R+ D RG F+W YK LPE K+ ++P+KSP A
Sbjct: 322 PFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSRIGH+YR P
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
N KG + NYKRV E K K++ + E +A L
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEL----KCKSFKWFMEEVAFDL 470
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 149/369 (40%), Positives = 205/369 (55%), Gaps = 29/369 (7%)
Query: 1 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSN 56
+P + D K ++PP + E GE GK LP + + A + N S+
Sbjct: 133 KPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVSD 190
Query: 57 HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
IS RT+PD R CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P +
Sbjct: 191 LISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIG 250
Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
IILVDD+S L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +LD
Sbjct: 251 RIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLD 309
Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
+HCE WL PLL I + + PVID I T E+ HYR G F
Sbjct: 310 SHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGF 361
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + + +PERE K+ +EP SPT AGGLF++DR +F LG YD G +WGGEN
Sbjct: 362 DWNLQFSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGEN 421
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSFK WMCGG++E VPCS +GH++R PY + R ++ N R+ E W DE
Sbjct: 422 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDE 476
Query: 349 KHKAYFYTR 357
+ Y+Y R
Sbjct: 477 -YSQYYYHR 484
>gi|34042906|gb|AAQ56699.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 601
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/364 (39%), Positives = 207/364 (56%), Gaps = 13/364 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L+ K G GE G A HL A + GD + +N E S ++++R++ D R C
Sbjct: 83 LQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQR 142
Query: 77 YPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
+ + LP ASV+++F NE +S L+RTVHS + + L+EIILVDD S +L KL+
Sbjct: 143 FDSESLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
Y++ +GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PLL I
Sbjct: 203 YVRTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262
Query: 194 SDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
R + VP+ID ID Y T ++S + G F+W L + + +R K++
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQE 322
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
P SPT AGGLFA+DR +F E+G YD + WGGEN E+SF+IW CGG+IE +PCS
Sbjct: 323 REICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCS 382
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
R+GH++R F PY F DR + N R+ W DE +F R L D+GD
Sbjct: 383 RVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 369 ISEQ 372
++ +
Sbjct: 438 VTHR 441
>gi|12855129|dbj|BAB30220.1| unnamed protein product [Mus musculus]
Length = 431
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 132/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)
Query: 45 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
++ +P LEEIILVDD S DL KL+ Y++ F G+V+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGEVKLIRNKKREGLIRSKMIGAS 197
Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
RG F+W + + + + E + S P +SP GG+FA++R +F ELG YD G+ +
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN ELS +IWMCGG + +PCSR+G+ ++ + R ++ N RV+
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K F+ + P ++ G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394
>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
Length = 494
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 131/335 (39%), Positives = 197/335 (58%), Gaps = 18/335 (5%)
Query: 41 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
G+ + G+ N S+ I DR +PD R C+Y YP LP S+I+ FHNE S+L+R
Sbjct: 17 GEDAYGKNQFNQAISDKIGGDRDVPDTRHSHCRYEAYPSTLPATSIIITFHNEARSTLLR 76
Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
TV SI+ +TP + EIILVDDFS A+ + + KV+++RN +R+GLIR+R
Sbjct: 77 TVKSILNKTPPNLVNEIILVDDFSDDAE-----DGLLLMGLPKVKVLRNNKRQGLIRSRV 131
Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
+G+ ++ +V+ FLD+HCE +WL PLL + ++K + P+ID I+ + +
Sbjct: 132 KGSDTAKSDVLTFLDSHCECNTDWLQPLLKRVVQNKKAVVSPIIDVINMDDFSYIGA--- 188
Query: 221 DHHYRGIFEWGMLYK-ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
+G F+W + +K +N PE++ +R P K+P AGGLF + +++F E+G YD
Sbjct: 189 SADIKGGFDWSLHFKWDNLTPEQKQSRRSTPIAPIKTPMIAGGLFVVTKSWFEEMGKYDT 248
Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
+ +WGGENFE+SF+ W CGGS+E +PCSR+GHV+R PY F G TY N
Sbjct: 249 MMDIWGGENFEISFRTWQCGGSMEIIPCSRVGHVFRKRHPYTF------PDGNANTYMKN 302
Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+R E W DE +K ++Y P+A G I +
Sbjct: 303 TRRTAEVWMDE-YKRFYYAARPMARSALYGSIKSR 336
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 196/328 (59%), Gaps = 18/328 (5%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYP-----LDLPKASVILVFHNEGFSSLMRTVHSI 105
N + S+ + +R +PD R C+ + LP SVI+ FHNE S+L+RTV S+
Sbjct: 33 NQQASDGLKSNRELPDTRNAMCRRSSWSDLSTIAHLPATSVIITFHNEARSTLLRTVVSV 92
Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
+ R+P + + EIILVDD+S + Q+L IQ KVRLIRN++REGL+R+R GA
Sbjct: 93 LNRSPERLIHEIILVDDYSDFPEDGQELAK-IQ----KVRLIRNSKREGLVRSRVTGAAA 147
Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
+ +V+ FLD+HCE +NWL PLLA + D + PVID I T+++ R
Sbjct: 148 ATAKVLTFLDSHCECNVNWLEPLLARVAEDPTRVVCPVIDVISMDTFQY---IGASADLR 204
Query: 226 GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
G F+W +++K L E K R+ + + P ++P AGGLF +D+A+F LG YD + +W
Sbjct: 205 GGFDWNLVFKWEYLSNAERKARQRDPTAPIRTPMIAGGLFVIDKAYFERLGTYDTQMDIW 264
Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
GGEN E+SF++W CGGS+E +PCSR+GHV+R PY F G + N +R E
Sbjct: 265 GGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGG---SGNIFAKNTRRAAEV 321
Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
W DE +K Y+Y PLA + GDI ++
Sbjct: 322 WMDE-YKKYYYAAVPLATNIPFGDIDDR 348
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 156/388 (40%), Positives = 222/388 (57%), Gaps = 25/388 (6%)
Query: 1 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
RP + L+P L+ P ++ PG GK + HL + + ++ N
Sbjct: 95 RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFA 154
Query: 55 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
A L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AILLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAET 272
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + LP+ E ++RK + P K+PT AGGLF++ R +F +G YD + +WGGEN
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGEN 391
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
E+SF++W CGG +E +PCS +GHV+RS P+ F K +I N R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE 446
Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
+K FY R A + GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKSFGDLSKR 473
>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4-like
[Ornithorhynchus anatinus]
Length = 580
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 151/361 (41%), Positives = 206/361 (57%), Gaps = 24/361 (6%)
Query: 19 PYKEGPGEGGKAYHLPEAYRAAGDASLGE------YGMNMETSNHISFDRTIPDLRMEEC 72
P PGE G+A L + GDA E Y +N+ S+ IS R I D RM EC
Sbjct: 71 PEPRAPGEWGEATRL----QLRGDAKKREEELVEKYAINIHLSDRISLHRRIRDRRMPEC 126
Query: 73 KYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
+ Y LP SV++ F+NE +S+L+RTVHS+++ +PA L+E+ILVDD S + L
Sbjct: 127 RAVTYDYRRLPTTSVVIAFYNEAWSTLLRTVHSVLETSPAVLLKEVILVDDLSDRPYLKA 186
Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
+LE Y+ +VRL+R REGL+R R GA + GEV+ FLD HCE G WL PLL
Sbjct: 187 ELEKYVSALQ-RVRLVRTNRREGLVRARLIGATFATGEVLTFLDCHCECGPGWLEPLLER 245
Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
I + + PVID ID+ T+EF + G F+W + ++ +PERE ++R+
Sbjct: 246 IGRNETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWQTVPERERRRRRSRI 303
Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
+P SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG++E +PCS +G
Sbjct: 304 DPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEILPCSHVG 363
Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
HV+ PY P N R E W D +K +FY R P A D+SE
Sbjct: 364 HVFPKRAPY---------ARPSFLRNTARAAEVWMD-GYKEHFYNRNPPARKESYWDLSE 413
Query: 372 Q 372
+
Sbjct: 414 R 414
>gi|443685595|gb|ELT89149.1| hypothetical protein CAPTEDRAFT_34275, partial [Capitella teleta]
Length = 358
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 193/316 (61%), Gaps = 9/316 (2%)
Query: 42 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
+ + +Y MN++ SN + DR+I D R EC + L K S+I+ F++E +S L+R
Sbjct: 1 ETNFDQYSMNVQLSNTVPLDRSILDTRNPECHVVQFSQQL-KVSIIVPFYDESWSMLLRM 59
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
+HS+I RTP LEEIIL+DD SS+ L L++Y + + K+R+IR+ REGL+R R
Sbjct: 60 LHSVIDRTPDALLEEIILIDDKSSRDYLKAPLDEYCKVLSPKIRIIRSEHREGLMRGRMV 119
Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
GAKE++ + +VFLDAH E WL PLL I + + VP +D ID QT ++ S +
Sbjct: 120 GAKEAKADTLVFLDAHVECNEGWLDPLLQIIMDHPRAIAVPTMDNIDPQTIKYESW---N 176
Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
H G F W M Y+ LP+ K ++P+ SPT G AM+R +F E+GG+D G+
Sbjct: 177 HVAYGGFTWNMEYQWKVLPDTLVNKLISKTQPFPSPTTIGCAMAMNRDYFFEIGGFDEGM 236
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLI-TYNYKR 340
+WGGEN E+SFK WMCG + PCSR+GH++R+ +PY F ++ G ++ NY+R
Sbjct: 237 FIWGGENLEISFKTWMCGEGLYISPCSRVGHLFRTILPYVF---PNQYGGGMVRQKNYQR 293
Query: 341 VIETWFDEKHKAYFYT 356
V E W DE +K FY
Sbjct: 294 VAEVWMDE-YKELFYA 308
>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 628
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR +PD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 162 NQVESDKLRMDRAVPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVASVLKKSP 221
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ V
Sbjct: 222 PHLVKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQARV 276
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + D+ + P+ID I+ +++ +G F+W
Sbjct: 277 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 333
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+++F ELG YD + VWGGEN
Sbjct: 334 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 393
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E VPCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 394 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 448
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 449 YKNFYYAAVPSARNVPYGNIQSR 471
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 258 bits (659), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 149/370 (40%), Positives = 211/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP +P PG GKA+ + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTSLETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKEQLDQYVKKLQ-IVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I D ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIP 401
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
CS +GHV+R+ P+ F K +I N R+ E W D +K FY R +A
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNLQAAKMAQ 455
Query: 363 FLDMGDISEQ 372
GDISE+
Sbjct: 456 EKSFGDISER 465
>gi|195455372|ref|XP_002074693.1| GK23025 [Drosophila willistoni]
gi|194170778|gb|EDW85679.1| GK23025 [Drosophila willistoni]
Length = 599
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 146/370 (39%), Positives = 210/370 (56%), Gaps = 15/370 (4%)
Query: 12 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
+++ L + G G G A HL + + G+ + +N E S +S++RT+ D R
Sbjct: 75 SIQLDLAKQRPGLGNNGVAVHLTGSAKERGEKIYKKIALNEELSEQLSYNRTVGDHRNPL 134
Query: 72 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
C + + LP ASVI++F NE +S L+RTVHS + + L+EIILVDD S +L
Sbjct: 135 CASQRFDTNSLPSASVIIIFFNEPYSVLLRTVHSTLSTCNEKSLKEIILVDDGSDNVELG 194
Query: 131 QKLEDYIQ-RF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
KL+ YI+ RF GKV ++R R GLIR R GA+ + G+V++FLDAHCE + W PL
Sbjct: 195 GKLDHYIRTRFPAGKVTVLRLKNRLGLIRARLAGARMATGDVLIFLDAHCEGNVGWCEPL 254
Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
L I R + VP+ID ID +++ S G F+W + LPERE ++++
Sbjct: 255 LQRIKESRTSVLVPIIDVIDANDFQY-STNGYKAFQVGGFQWNGHFDWVNLPEREKQRQR 313
Query: 249 YNSE------PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
+ P SPT AGGLFA+DR +F E+G YD + WGGEN E+SF+IW CGG+I
Sbjct: 314 RECDQAREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 373
Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
E +PCSR+GH++R F PY F DR + N R+ W D+ +F R L
Sbjct: 374 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDDYINIFFLNRPDLKF 428
Query: 363 FLDMGDISEQ 372
D+GD++ +
Sbjct: 429 HADIGDVTHR 438
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 152/370 (41%), Positives = 213/370 (57%), Gaps = 26/370 (7%)
Query: 15 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTI-PDLR 68
PP +P GPG GKA+ + A ++ E G N S+ IS R + PD R
Sbjct: 105 PPQDP--SGPGADGKAFK--KEQWTAEESKEKERGYEKHCFNAFASDRISLQRALGPDSR 160
Query: 69 MEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
EC K+ P LP SV++VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 161 PPECIDQKFKRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHASPAALLREIILVDDAST 219
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
L +L+ Y+++ VR++R ER+GLI R GA + GEV+ FLDAHCE WL
Sbjct: 220 DEYLKDELDRYVKQLQ-IVRVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWL 278
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREA 244
PLL+ I + + P I ID T+EF + V H RG F+W + + +P RE
Sbjct: 279 EPLLSRIAEEPTAVVSPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRER 338
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
++RK + P KSPT AGGLFA+ R++F +G YD + +WGGEN E+SF++W CGG +E
Sbjct: 339 QRRKDETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSFRVWQCGGQLEI 398
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
+PCS +GHV+RS P+ F K +I+ N R+ E W D+ +K FY R A +
Sbjct: 399 IPCSVVGHVFRSKSPHTFPKGTQ-----VISRNQVRLAEVWMDD-YKEIFYRRNQQAAQM 452
Query: 365 ----DMGDIS 370
GDI+
Sbjct: 453 AREKTYGDIT 462
>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
catus]
Length = 653
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 187 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 246
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 247 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 301
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 302 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 358
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 359 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 418
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E VPCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 419 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 473
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 474 YKNFYYAAVPSARNVPYGNIQSR 496
>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
glaber]
Length = 567
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 101 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 160
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 161 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 215
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 216 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 272
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 273 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 332
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 333 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 387
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 388 YKNFYYAAVPSARNVPYGNIQSR 410
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 192/322 (59%), Gaps = 17/322 (5%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N + S+ + DR +PD R C WD LP SVI+ FHNE S+L+RTV S++ R
Sbjct: 114 NQQASDKLKCDRPVPDTRNGLCSSNSWDLS-KLPATSVIVTFHNEARSTLLRTVVSVLNR 172
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P + EIILVDDFS A+ D +L I+ KVR++RN +REGL+R+R RGA +
Sbjct: 173 SPPSLVREIILVDDFSDNAE-DGQLLAQIE----KVRVLRNNQREGLMRSRIRGADAAAA 227
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
V+ FLD+H E NWL PLL I DR + P+ID I+ +E+ RG F
Sbjct: 228 PVLTFLDSHVECNKNWLEPLLQRIADDRTAVVCPIIDVINMDNFEYIGASAD---LRGGF 284
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
+W +++K + + E + R N + P +P AGGLF+MD+++F +LG YD + VWGGE
Sbjct: 285 DWNLVFKWDYMSSEERRSRAGNPTAPISTPMIAGGLFSMDKSYFNQLGKYDTAMDVWGGE 344
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
N E+SF++W CGG +E +PCSR+GHV+R PY F + G + T N +R E W D
Sbjct: 345 NLEISFRVWQCGGRLEIIPCSRVGHVFRKQHPYTFPGGS----GNVFTRNTRRAAEVWMD 400
Query: 348 EKHKAYFYTREPLAMFLDMGDI 369
+ +K Y+Y P A + G+I
Sbjct: 401 D-YKEYYYAAVPSAKLIPFGNI 421
>gi|195027660|ref|XP_001986700.1| GH20386 [Drosophila grimshawi]
gi|193902700|gb|EDW01567.1| GH20386 [Drosophila grimshawi]
Length = 666
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 195/345 (56%), Gaps = 16/345 (4%)
Query: 7 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
D + EP K+G G G+ +P R N+ S+ I +RT+ D
Sbjct: 80 DYNINQFEP-----KQGEGADGRPVIVPPRDRFRMQRFFKLNSFNILASDRIPLNRTLKD 134
Query: 67 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
R EC+ Y LP SVI+VFHNE +S L+RT+ S+I R+P L EIILVDD S++
Sbjct: 135 YRTGECRDKRYANSLPNTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASNR 194
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
+ L ++LE YIQ RL R ER GL+ R GA+ +RG+V+ FLDAHCE WL
Sbjct: 195 SFLKRQLEAYIQVLAVPTRLYRMKERSGLVPARLLGAQHARGDVLTFLDAHCECSRGWLE 254
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREA 244
PLLA I R+++ PVID I + + +E +H+ G F W + ++ ++ +
Sbjct: 255 PLLARIGESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKRQTT 311
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K ++ P +P AGGLFA+DR +F E+G YD + +WGGEN E+SF+IW CGG IE
Sbjct: 312 ANTKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSDMRIWGGENVEMSFRIWQCGGRIEI 371
Query: 305 VPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
PCS +GH++RS PY F G +++ ++T N R W D+
Sbjct: 372 SPCSHVGHIFRSSTPYTFPGGMSE-----VLTANLARAATVWMDD 411
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 149/356 (41%), Positives = 209/356 (58%), Gaps = 28/356 (7%)
Query: 18 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
E +EGPGE GK P D L E + N S + R Y
Sbjct: 103 EAEREGPGEHGK----PLKLEKLEDIKLNE---KLFKENGYSALSGVGKKR--------Y 147
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP SVI++F+NE +S+L+RTV+S++ R+P+ L+EI+LV+D S+K L + L+D++
Sbjct: 148 LQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQDFV 207
Query: 138 Q-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
+ KV+LI R GLI R GAK + G+V++ LD+H EV +NWLPPL+ PI D
Sbjct: 208 RTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDY 267
Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKS 256
+ P ID I + T+++R+ D RG F+W LYK LP R A+ +EP++S
Sbjct: 268 RTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFLYKR--LPLR-AQDMVDPTEPFES 321
Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
P AGGLFA+ FF ELGGYD GL +WGGE +ELSFK+W CGG + PCSR+GHVYR
Sbjct: 322 PIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRG 381
Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ P+ + + +T N+KRV E W DE +K + Y R P D GD+++Q
Sbjct: 382 YAPFPNPRGTN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDQTDAGDLTKQ 431
>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
Length = 658
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 189/322 (58%), Gaps = 14/322 (4%)
Query: 55 SNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
S+ + DR +PD+R CK +P +L KAS+I+ F NE +S+L+RTVHS++ R+PA
Sbjct: 188 SSLLPLDRPVPDVRPPACKAKQWPTANLLKASIIICFVNEAWSTLLRTVHSVLNRSPADL 247
Query: 114 LEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 172
+ EIIL+DD S A L KL +YI+ KV+ +R R GLIR R GA+ + G+V++
Sbjct: 248 VHEIILLDDSSDAAWLGDKLTNYIRDNLPDKVKYVRTQHRSGLIRARLVGAEHATGDVLL 307
Query: 173 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 232
FLD+HCE LNWL P++A I DR+ + PVID ID+ T E+ + D G F+W M
Sbjct: 308 FLDSHCEANLNWLEPIMALITEDRRTVVTPVIDSIDHHTMEYSKATQ-DVPAVGTFDWTM 366
Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
+ R ++P SPT AGGLFAM++ +F ELG YD + WGGEN E+S
Sbjct: 367 DFNWKAGVRRAGADA---TDPVDSPTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMS 423
Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNF--GKLADRVKGPLITYNYKRVIETWFDEKH 350
F+IW CGG + PCS +GH++R PY G + D N RV E W D +
Sbjct: 424 FRIWQCGGRLVTAPCSHVGHIFRDSHPYTVPGGSIHD-----TFLRNSMRVAEVWMDH-Y 477
Query: 351 KAYFYTREPLAMFLDMGDISEQ 372
K YF P +D GD+SE+
Sbjct: 478 KQYFLDTRPGQNIIDAGDVSER 499
>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
Length = 737
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 213/355 (60%), Gaps = 13/355 (3%)
Query: 23 GPGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
GPGEGG Y + A + D + N S+ IS R +PD R CK Y
Sbjct: 181 GPGEGGIPYTVNREDISPAEQVIFDKGWKDNAFNQLASDRISVRRYLPDYREGTCKDNKY 240
Query: 78 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
+LP AS+I+ FHNE +S L+R+VHS+I R+P+ L EIILVDDFS + L + LE+Y+
Sbjct: 241 SRNLPSASIIICFHNEAWSVLLRSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYM 300
Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
+ N V+++R REGLIR R GA +S G+V+VFLD+H E WL PLL I +
Sbjct: 301 KMLN-VVKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDRIAYNSS 359
Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
I+ VPVI I+ +T ++ + P G F+W + + +E ER + P +SP
Sbjct: 360 IVVVPVITVINDKTLKY-DLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYSPVQSP 418
Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
T AGGLFA+ R +F LG YDPG+ VWGGEN ELSFKIWMCGGS+E V CS++GH++R
Sbjct: 419 TMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGHIFRDR 478
Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
PY + VK PL N R+ + W D+ +K +++ R M +D+G++SE+
Sbjct: 479 SPYIWDV---DVKDPL-KRNLLRLADVWLDD-YKRFYHARIGFEM-VDIGNVSER 527
>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Cavia porcellus]
Length = 622
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 149/371 (40%), Positives = 215/371 (57%), Gaps = 24/371 (6%)
Query: 15 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
PP +P PG GKA+ P+ + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKSDWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162
Query: 70 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 163 SECIHQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPATLLKEIILVDDASTD 221
Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
L +LE Y+Q+ V+++R ER+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EYLKDELERYVQQLQ-IVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
PLLA I ++ + P I I+ T+EF + + E H RG F+W + + LP E +
Sbjct: 281 PLLARIAENKMAVVSPDIVTINLNTFEFSKPIPEGRIHSRGNFDWILTFGWEALPAHEKQ 340
Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
+RK + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400
Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
PCS +GHV+R+ P+ F K +I N R+ E W D+ +K FY R +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKIA 454
Query: 362 MFLDMGDISEQ 372
GDISE+
Sbjct: 455 QEKSFGDISER 465
>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
lupus familiaris]
Length = 588
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 122 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 181
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 182 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 236
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 237 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 293
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 294 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 353
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E VPCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 354 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 408
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 409 YKNFYYAAVPSARNVPYGNIQSR 431
>gi|443687046|gb|ELT90152.1| hypothetical protein CAPTEDRAFT_141956, partial [Capitella teleta]
Length = 351
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 199/318 (62%), Gaps = 11/318 (3%)
Query: 44 SLGEYGMNMETSNHI-SFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRT 101
+ G + N +S+ + +F +PD RME C Y L L K S+I++FHNE S+L+RT
Sbjct: 3 TTGYHSFNHSSSDLVGNFRHELPDFRMEGCHKKTYDLTTLGKTSIIIIFHNEARSTLLRT 62
Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
+H++++RTP L EI++VDD S+ A L + L+ Y+Q ++R+IR +R+GLIR R+R
Sbjct: 63 IHALLERTPILLLVEILIVDDASTHAWLKEPLDKYLQHL-PRIRIIRLKQRQGLIRARTR 121
Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
GA+E++G+++ F DAH EVG WLPPLL I +RK++ P +D I +Q++E+ +
Sbjct: 122 GAEEAKGDILYFADAHTEVGEGWLPPLLQRIKENRKVLVFPEMDPIQHQSFEY---WRAG 178
Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
Y G F W M +K P+ +R ++P SP G A++R +F E G YD +
Sbjct: 179 DEYHGAFYWHMEFKYKFAPKEILNRRSDPTQPVPSPVMVGCAHAIEREYFFETGAYDTDM 238
Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
+WGGEN E +F++WMCGG +E +PCSR+GHV++ +PY+F + +I N R+
Sbjct: 239 EIWGGENIEHAFRLWMCGGRVEVIPCSRVGHVFKPRLPYSFTGDS----ASIIQRNLIRI 294
Query: 342 IETWFDEKHKAYFYTREP 359
ETW D+ +K +FY +P
Sbjct: 295 AETWMDD-YKKFFYATQP 311
>gi|291236246|ref|XP_002738051.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Saccoglossus kowalevskii]
Length = 321
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 127/274 (46%), Positives = 175/274 (63%), Gaps = 3/274 (1%)
Query: 21 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
+ GPGE G+ Y L + ++G N S+ +S +R +PD+R CK +Y +
Sbjct: 51 RTGPGEQGRPYILSPEEKKNEHQDFSKHGFNKHISDVLSVERALPDIRDPRCKTMEYLVK 110
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP S+++ FHNE S L RTVHSII R+P + L EIILVDDFS + + L DY+
Sbjct: 111 LPNTSIVIPFHNEALSVLKRTVHSIINRSPPELLHEIILVDDFSDHDECKEPLNDYMVTV 170
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
KVR+IR T+REGLIRTR GA + G+V+VFLD+HCE +NWLPPLL I +RK +
Sbjct: 171 -PKVRIIRATKREGLIRTRLLGASRATGQVLVFLDSHCEANVNWLPPLLESIALNRKCIA 229
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
P+ID I + + + + RG F+W + YK L E E K+RK+ +EP+++P A
Sbjct: 230 CPMIDVIGNNDYHYET--QAGDAMRGAFDWELFYKRIPLTEEELKRRKHAAEPFRTPIMA 287
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
GGLFA+DR +F E+GGYD GL +WGGE ++LSFK
Sbjct: 288 GGLFAVDRLYFNEIGGYDAGLEIWGGEQYDLSFK 321
>gi|339234661|ref|XP_003378885.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
gi|316978493|gb|EFV61475.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
Length = 1819
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 156/363 (42%), Positives = 205/363 (56%), Gaps = 30/363 (8%)
Query: 23 GPGEGGKAYHLPE--AYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
G GE G LP A +A D G + TS+ IS R I DLR +CK Y
Sbjct: 1299 GVGEHGNPVELPSSVAEKAEFDRLYKANGYSGWTSDKISLYRAIKDLRHVDCKRKSYLRL 1358
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR- 139
LP SVIL FHNE S L+RTV++I+ RTP + L E+ILV+D S+K +L+ LE ++QR
Sbjct: 1359 LPSTSVILPFHNEHLSVLLRTVYTIVYRTPPELLLEVILVNDASTKPELNDILERHVQRK 1418
Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
F V +IR +G R GA ++ G+V++F+DAH EVG NWLPPLL PI + +
Sbjct: 1419 FPNLVHVIR-AGSDG----RREGAAKASGQVLMFMDAHSEVGYNWLPPLLEPIKLHYRTV 1473
Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
T P ID ID T+ FR+ D RG F+W YK L K +EP++SP
Sbjct: 1474 TCPFIDVIDCDTFAFRA---QDEGARGSFDWKFHYKRLPLLN------KTGAEPFESPVM 1524
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGG FA+ + +F ELG YD L++WG E +ELSFK+W C G + +PCSRI H+YR
Sbjct: 1525 AGGYFAISKRWFDELGRYDDQLMIWGAEQYELSFKLWQCHGRMIDIPCSRIAHIYRC--K 1582
Query: 320 YNFGKLADRVK----------GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
+ F L V G + NYKRV ETW DE +K Y Y R P +D GD+
Sbjct: 1583 FGFAALFSTVHRYAPFEDPGIGNFLERNYKRVAETWMDE-YKEYLYLRMPRLRNVDPGDL 1641
Query: 370 SEQ 372
++Q
Sbjct: 1642 TKQ 1644
>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
mellifera]
Length = 603
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 136/326 (41%), Positives = 200/326 (61%), Gaps = 13/326 (3%)
Query: 51 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
N+ S+ I +RT+PD+R + C +Y + +LPK S+I+VFHNE +S+L+RTV+S+I R
Sbjct: 122 NLMASDRIPLNRTLPDVRRKGCITRYMNLG-NLPKTSIIIVFHNEAWSTLLRTVYSVIDR 180
Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
+P Q LEEIILVDD S + L L+++I+ +++R+ +R GL+ R GA +++G
Sbjct: 181 SPIQLLEEIILVDDNSDRDFLKDALDEHIKNLQVSTKVLRSKKRIGLVNARLLGANKAKG 240
Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
EV+ FLDAHCE + WL PLL + +R + PVID I+ T+ + +E H+ G F
Sbjct: 241 EVLTFLDAHCECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAF 297
Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
W + ++ L R K+R+ N EP+++P AGGLF+M+R +F ELG YD + +WGGE
Sbjct: 298 NWDLHFRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGE 357
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
N ELSF++W CGGSIE PCS +GH++R PY F G + + + G N RV W
Sbjct: 358 NLELSFRVWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGEILYG-----NLARVALVWM 412
Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
DE + YF A D I +
Sbjct: 413 DEWAEFYFKFNAEAARLRDKQTIRSR 438
>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Anolis carolinensis]
Length = 500
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 144/327 (44%), Positives = 191/327 (58%), Gaps = 17/327 (5%)
Query: 48 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
Y N S I DR I D R C Y DLP S+I+ FHNE S+L+RT+ S++
Sbjct: 53 YAFNQRESERIPSDRAIRDTRHHRCTTLHYRTDLPPTSIIITFHNEARSTLLRTIRSVLN 112
Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
RTP + EIILVDDFS D + L KV+ +RN REGLIR+R RGA+ +
Sbjct: 113 RTPVHLVHEIILVDDFSDDPDDCRLLIKL-----PKVKCLRNRRREGLIRSRIRGAEMAE 167
Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
EV+ FLD+HCEV +WL PLL I D + PVID I+ T+ + + RG
Sbjct: 168 AEVLTFLDSHCEVNKDWLLPLLQRIKEDPSHVVSPVIDIINLDTFAYVAA---SSDLRGG 224
Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
F+W + +K +L ++ KR +EP K+P AGGLF +D+A+F LG YD + +WGGE
Sbjct: 225 FDWSLHFKWEQLSPKQKAKRTDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDAAMDIWGGE 284
Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETW 345
NFE+SF++WMCGGS+E +PCSR+GHV+R PY F +G TY N KR E W
Sbjct: 285 NFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------EGNANTYIKNTKRTAEVW 338
Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
DE +K Y+Y P A G+I E+
Sbjct: 339 MDE-YKQYYYAARPAAQGRPYGEIPEE 364
>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
malayi]
Length = 575
Score = 257 bits (656), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 136/340 (40%), Positives = 205/340 (60%), Gaps = 16/340 (4%)
Query: 23 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
G GE G+ L E + + ++ S+ I+ +R++PD+R +C+ Y +
Sbjct: 28 GAGEDGRPVRLSEEDERLSEDTFVINQFSLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 87
Query: 81 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
LP SVI+V+HNE FS+LMRTV S+I+R+P + L+EIILVDDFS++ L +LE ++ +
Sbjct: 88 LPTTSVIIVYHNEAFSTLMRTVMSVIQRSPRENLKEIILVDDFSTRTFLKVELEKFVAQL 147
Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 148 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 207
Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
PVID I+ +T+ ++ E +RG F W + ++ LP K R + ++P SPT
Sbjct: 208 CPVIDIINDRTFAYQKSIEL---FRGGFNWNLQFRWYALPSEMIKSRSDDPTKPIISPTM 264
Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
AGGLF++DR +F E+G YD + +WGGEN E+S +++ E +PCS +GHV+R P
Sbjct: 265 AGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRVF------EILPCSHVGHVFRRTSP 318
Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
++F R G ++ N RV E W DE K +FY P
Sbjct: 319 HDF---PGRKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 354
>gi|170051778|ref|XP_001861920.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
gi|167872876|gb|EDS36259.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
Length = 601
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 143/364 (39%), Positives = 207/364 (56%), Gaps = 13/364 (3%)
Query: 17 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
L + G G+ GK L R G+ L +N E S H+S++RT PD R CK
Sbjct: 81 LAKQERGLGDNGKGVELTGEAREIGEKQLATIALNEELSEHLSYNRTPPDERHPSCKRKS 140
Query: 77 YPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
Y + +LP SVI++F+NE +S L+RTVHS++ + L+EI+LVDD S+ +L KL+
Sbjct: 141 YDIENLPSTSVIIIFYNEPYSVLVRTVHSVLNTADERLLKEIVLVDDGSTNEELKGKLDY 200
Query: 136 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
Y++ R KV+++R R GLIR R GA+ ++ +V+VFLDAHCE WL PLL I
Sbjct: 201 YVRTRLPSKVKVLRQRNRVGLIRARLAGARFAKADVLVFLDAHCECMPQWLEPLLERIRE 260
Query: 195 DRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEW-GMLYKENELPEREAKKRK 248
R + VP+ID I+ Y T F + G F+W + +E E +RE ++
Sbjct: 261 SRTSVLVPIIDVIEAKNFFYSTNGFTDFQIGGFTWDGHFDWHDVTQREKERQKRECSEKD 320
Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
P SPT AGGLFA+ R +F E+G YD + WGGEN E+SF++W CGG++E +PCS
Sbjct: 321 VAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRVWQCGGTLETIPCS 380
Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
RIGH++R F PY+F DR + N R+ W D+ + R L ++GD
Sbjct: 381 RIGHIFRDFHPYSFPN--DRDTHGI---NTVRMATVWMDDYIDLLYLNRPDLRDHPEVGD 435
Query: 369 ISEQ 372
++ +
Sbjct: 436 VTHR 439
>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Apis florea]
Length = 595
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 195/324 (60%), Gaps = 9/324 (2%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRT 109
N+ S+ I +RT+PD+R + C LD LPK S+I+VFHNE +S+L+RTV+S+I R+
Sbjct: 114 NLMASDRIPLNRTLPDVRRKGCISRYMNLDNLPKTSIIIVFHNEAWSTLLRTVYSVIDRS 173
Query: 110 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 169
P Q LEEIILVDD S + L L+++++ +++R+ +R GL+ R GA ++GE
Sbjct: 174 PRQLLEEIILVDDNSDRDFLKDTLDEHVKNLQVSTKVLRSRKRIGLVNARLLGANNAKGE 233
Query: 170 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 229
V+ FLDAHCE + WL PLL + +R + PVID I+ T+ + +E H+ G F
Sbjct: 234 VLTFLDAHCECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSYTRSFEL--HW-GAFN 290
Query: 230 WGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
W + ++ L R K+R+ N EP+++P AGGLF+M+R +F ELG YD + +WGGEN
Sbjct: 291 WDLHFRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGEN 350
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
ELSF++W CGGSIE PCS +GH++R PY F G ++ N RV W DE
Sbjct: 351 LELSFRVWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGV----GEILYGNLARVALVWMDE 406
Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
+ YF A D I +
Sbjct: 407 WAEFYFKFNAEAARLRDKQTIRSR 430
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 145/370 (39%), Positives = 214/370 (57%), Gaps = 22/370 (5%)
Query: 15 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
PP P + PG G + + + + + N S+ IS RT+ D R
Sbjct: 101 PPENP--QAPGADGVPFQYDRMTKEEEKEKQEGMTRHCFNQFASDRISLHRTLGDDTRPP 158
Query: 71 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA +L+EII+VDD S+
Sbjct: 159 ECVDRKFRRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAAFLKEIIMVDDASTAE 217
Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
L KLE+Y++ V+++R ER+GLI R GA ++ GE++ FLDAHCE WL P
Sbjct: 218 HLHGKLEEYVKALK-IVKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEP 276
Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
LLA I + + P I ID T++F + V H RG F+W + + +P+ E K
Sbjct: 277 LLARIVEEPTAVVSPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENAK 336
Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
RK + P K+PT AGGLF++ +A+F ++G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 337 RKDETYPVKTPTFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIP 396
Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
CS +GHV+R+ P+ F K + +IT N R+ E W D+ +K FY R A +
Sbjct: 397 CSVVGHVFRTKSPHTFPKGTE-----VITRNQVRLAEVWMDD-YKLIFYRRSQSAAKMAK 450
Query: 365 --DMGDISEQ 372
GDIS++
Sbjct: 451 EKGFGDISDR 460
>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
Length = 526
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 60 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 119
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 120 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 174
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 175 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 231
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 232 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 291
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 292 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 346
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 347 YKHFYYAAVPSARNVPYGNIQSR 369
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)
Query: 23 GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
PG GK + HL + + ++ N S+ IS R + PD R EC K+
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
P LP SVI+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PLLA I +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297
Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ P I ID T+EF S Y +H+ RG F+W + + LP+ E ++RK + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQRRKDETYP 356
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
+RS P+ F K +I N R+ E W DE +K FY R A + GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470
Query: 370 SEQ 372
S++
Sbjct: 471 SKR 473
>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
Length = 544
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 78 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 137
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 138 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 192
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 193 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 249
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 250 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 309
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 310 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 364
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 365 YKHFYYAAVPSARNVPYGNIQSR 387
>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 571
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQARV 219
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+++F ELG YD + VWGGEN
Sbjct: 277 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 336
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414
>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Taeniopygia guttata]
Length = 631
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 152/372 (40%), Positives = 214/372 (57%), Gaps = 21/372 (5%)
Query: 13 LEPPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLR 68
L+ PL+ PG GKA+ +L + A ++ N S+ IS R + PD R
Sbjct: 109 LDRPLQD-PNAPGASGKAFKTINLNSEEQKEKQAGEEKHCFNAFASDRISLHRDLGPDTR 167
Query: 69 MEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
EC K+ P LP S+I+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S
Sbjct: 168 PPECIEQKFKRCP-PLPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASV 226
Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
L KL++Y+++F V+++R ER+GLI R GA + GE + FLDAHCE WL
Sbjct: 227 DEYLHDKLDEYVKQFQ-IVKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWL 285
Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH-HYRGIFEWGMLYKENELPEREA 244
PLLA I + + P I ID T+EF H H RG F+W + + LP+ E
Sbjct: 286 EPLLARIAENPVAVVSPDIASIDLNTFEFSKPSPYGHSHNRGNFDWSLSFGWESLPKHEN 345
Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
K+RK + P ++PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E
Sbjct: 346 KRRKDETYPIRTPTFAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEI 405
Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
+PCS +GHV+RS P+ F K +IT N R+ E W DE +K FY R A +
Sbjct: 406 MPCSVVGHVFRSKSPHTFPKGTQ-----VITRNQVRLAEVWMDE-YKEIFYRRNTEAAKI 459
Query: 365 ----DMGDISEQ 372
GDIS++
Sbjct: 460 VKQKTFGDISKR 471
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)
Query: 23 GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
PG GK + HL + + ++ N S+ IS R + PD R EC K+
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
P LP SVI+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PLLA I +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297
Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ P I ID T+EF S Y +H+ RG F+W + + LP+ E ++RK + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQRRKDETYP 356
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
+RS P+ F K +I N R+ E W DE +K FY R A + GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470
Query: 370 SEQ 372
S++
Sbjct: 471 SKR 473
>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
aries]
Length = 633
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 155/388 (39%), Positives = 222/388 (57%), Gaps = 25/388 (6%)
Query: 1 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
RP + L+P L+ P ++ PG GKA+ +L + + ++ N
Sbjct: 95 RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSAEEQKEKERGEAKHCFNAFA 154
Query: 55 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
A L+EIILVDD S L KLE+YI++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AILLKEIILVDDASVDEYLHDKLEEYIKQFS-IVKIVRQKERKGLITARLLGATVATAET 272
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + LP+ E ++RK + P K+PT AGGLF++ + +F +G YD + +WGGEN
Sbjct: 332 DWSLSFGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGEN 391
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
E+SF++W CGG +E +PCS +GHV+RS P+ F K +I N R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE 446
Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
+K FY R A + GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKSFGDLSKR 473
>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
caballus]
Length = 539
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 73 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 132
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 133 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 187
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 188 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 244
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 245 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 304
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 305 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 359
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 360 YKNFYYAAVPSARNVPYGNIQSR 382
>gi|363736053|ref|XP_422169.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gallus
gallus]
Length = 811
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/353 (40%), Positives = 211/353 (59%), Gaps = 10/353 (2%)
Query: 22 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
+ PG+ G +P+ + + E N+ S+ I DR I D R C DL
Sbjct: 308 QAPGQFGHPVAVPDDKQEEAKSRWKEGNFNVFLSDMIPVDRAIADTRPAGCLEQQVHNDL 367
Query: 82 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
P ++I+ F +E +S+L+R+VHS++ R+P L+E+ILVDDFS+K L +KL+ Y+ +F
Sbjct: 368 PTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQFP 427
Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
KV+++ ER GLIR R GA+ +RG V+ FLD+H E + WL PLL + R +
Sbjct: 428 -KVKVLHLRERHGLIRARLAGAQVARGTVLTFLDSHVECNVGWLEPLLERVRLRRARVAC 486
Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
PVI+ I + + +V D+ RGIF W M + ++P+ +K K ++ + P A
Sbjct: 487 PVIEVISDKDMSYMTV---DNFQRGIFTWPMNFGWKQIPQEVIEKNKLKETDIIRCPVMA 543
Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
GGLF++++ +F ELG YD GL VWGGEN ELSFK+WMCGG IE VPCSR+GH++R+ PY
Sbjct: 544 GGLFSIEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCSRVGHIFRNDNPY 603
Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
+F K DRV+ + N RV E W D+ K Y + L ++GD+S+Q
Sbjct: 604 SFPK--DRVR--TVERNLARVAEVWLDDYKELFYGHAYHLLQRRAELGDLSQQ 652
>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 548
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 82 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 141
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 142 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 196
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 197 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 253
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 254 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 313
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 314 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 368
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 369 YKHFYYAAVPSARNVPYGNIQSR 391
>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 504
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR+IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 38 NQVESDKLRMDRSIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 97
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 98 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 152
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 153 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 209
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 210 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 269
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 270 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 324
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
K ++Y P A + G+I +
Sbjct: 325 FKHFYYAAVPSARNVPYGNIQSR 347
>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
Length = 536
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 70 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 129
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 130 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 184
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 185 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 241
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 242 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 301
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 302 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 356
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 357 YKHFYYAAVPSARNVPYGNIQSR 379
>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
Length = 699
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/347 (40%), Positives = 200/347 (57%), Gaps = 25/347 (7%)
Query: 34 PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 93
PE R + S+ + N S+ +S R IPD R C+ ++P DLP+A+VI+ F NE
Sbjct: 222 PEQVRKLEEESMKKNAFNEYRSSKLSLHRDIPDSRNPLCRQQEHPRDLPQATVIICFVNE 281
Query: 94 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTER 152
+S+L+RTV S++ RTP L+EI+LVDD S + L KLE ++ KV+L+R+ +R
Sbjct: 282 AWSTLLRTVWSVLDRTPPHLLKEILLVDDASDQEHLLDKLEVEVRDNLPDKVKLVRSPKR 341
Query: 153 EGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW 212
GLIR R GA+ + + +VFLD+HCE L WL PLLA + D+ + P ID I QT
Sbjct: 342 LGLIRARVLGAEHATADYMVFLDSHCEANLGWLEPLLAWMAKDKTRVVCPTIDRISAQTM 401
Query: 213 EFRSVYEPDHHYRGIFEWGM-------LYKENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
++ RG F W + + + E P ++P KSPT AGGLF
Sbjct: 402 DY---VGGGASSRGTFHWTLDFTWEYAVRQHGETP----------ADPIKSPTMAGGLFG 448
Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
++R +F ELG YD G+ WGGEN E+SF+IW CGGS+ +PCSR+GH++R + PY +
Sbjct: 449 INRDYFYELGTYDMGMDGWGGENLEMSFRIWQCGGSLHIIPCSRVGHIFRDWHPY---AI 505
Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
+ N R+ E W DE +K FY +P A +D GD+SE+
Sbjct: 506 PNSTVNETFLKNSIRLAEVWMDE-YKDIFYDIKPSARSVDFGDVSER 551
>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
Length = 618
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 107 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 166
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 167 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 221
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 222 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 278
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 279 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 338
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 339 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 393
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 394 YKHFYYAAVPSARNVPYGNIQSR 416
>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 67 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376
>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
Length = 569
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR+IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 103 NQVESDKLRMDRSIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 162
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 163 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 217
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 218 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 274
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 275 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 334
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 335 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 389
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
K ++Y P A + G+I +
Sbjct: 390 FKHFYYAAVPSARNVPYGNIQSR 412
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)
Query: 23 GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
PG GK + HL + + ++ N S+ IS R + PD R EC K+
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179
Query: 76 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
P LP SVI+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238
Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PLLA I +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297
Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
+ P I ID T+EF S Y ++H RG F+W + + LP+ E ++RK + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPY-GNNHNRGNFDWSLSFGWESLPDHEKQRRKDETYP 356
Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416
Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
+RS P+ F K +I N R+ E W DE +K FY R A + GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470
Query: 370 SEQ 372
S++
Sbjct: 471 SKR 473
>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
Length = 533
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 67 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376
>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
gorilla gorilla]
Length = 533
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 67 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376
>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 514
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 142/327 (43%), Positives = 191/327 (58%), Gaps = 11/327 (3%)
Query: 47 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
+ N S+ IS +R++PD+R EC+ Y LP S+I+ FHNE +S L+RTVHSI+
Sbjct: 37 QNAFNSYVSDLISVNRSLPDMRHIECRDQVYSSKLPSTSIIVCFHNEAWSVLIRTVHSIL 96
Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
R+PA + +IILVDDFS L LE Y+ F KVR++R +REGLIR R GA S
Sbjct: 97 NRSPAHLIHDIILVDDFSDLQLLKDPLERYLSAFP-KVRIVRAEKREGLIRARLLGASHS 155
Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
V+ FLD+H E WL PLL I + + PVID I T E+ + D + G
Sbjct: 156 TAPVLTFLDSHVECTQGWLEPLLDRIAVNSTNVVSPVIDIIADDTLEYNAKESADVNVGG 215
Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
F+W + + + +PER K +P ++PT AGGLF++DR FF LG YDPG +WGG
Sbjct: 216 -FDWSLQFSWHSIPERILKSGYKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGG 274
Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
EN ELSFK WMCGG +E +PCS +GH++R PY + R ++ N R+ + W
Sbjct: 275 ENLELSFKTWMCGGRLEIIPCSHVGHIFRKRSPYKW-----RSGVNVLRRNSIRLAKVWM 329
Query: 347 DEKHKAYFYTREPLAMFL-DMGDISEQ 372
DE YF E L L D GDIS++
Sbjct: 330 DEYANYYF---ERLGNDLGDYGDISDR 353
>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
4 [Pan troglodytes]
Length = 533
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 67 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376
>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
musculus]
gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 570
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 104 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 163
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 164 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 218
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 219 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 275
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 276 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 335
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 336 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 390
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSR 413
>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
Length = 570
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++KR+P
Sbjct: 104 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 163
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 164 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 218
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 219 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 275
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 276 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 335
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 336 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 390
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSR 413
>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
And Ea2
gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
Length = 501
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 35 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 94
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 95 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 149
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 150 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 206
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 207 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 266
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 267 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 321
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 322 YKNFYYAAVPSARNVPYGNIQSR 344
>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_a [Homo sapiens]
Length = 508
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 67 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376
>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
[Macaca mulatta]
Length = 554
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 88 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 147
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 148 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 202
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 203 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 259
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 260 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 319
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 320 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 374
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 375 YKNFYYAAVPSARNVPYGNIQSR 397
>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
Length = 530
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 64 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 123
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 124 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 178
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 179 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 235
Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 236 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 295
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 296 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 350
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 351 YKNFYYAAVPSARNVPYGNIQSR 373
>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Papio anubis]
Length = 571
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 219
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 277 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 336
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414
>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
Length = 571
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)
Query: 51 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
N S+ + DR IPD R ++C+ + +DLP SV++ FHNE S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
++EIILVDD+S+ + D L I+ KVR++RN REGL+R+R RGA ++ +V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 219
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
+ FLD+HCE +WL PLL + DR + P+ID I+ +++ +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276
Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
+++K + + + + R+ N P K+P AGGLF MD+ +F ELG YD + VWGGEN
Sbjct: 277 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 336
Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
E+SF++W CGGS+E +PCSR+GHV+R PY F + G + N +R E W DE
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391
Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
+K ++Y P A + G+I +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414
>gi|167536139|ref|XP_001749742.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771890|gb|EDQ85551.1| predicted protein [Monosiga brevicollis MX1]
Length = 1275
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 188/327 (57%), Gaps = 10/327 (3%)
Query: 46 GEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSI 105
+ N S+ +S R +PD R +CK YP DLP A+VI+ F NE +S+L RTV S+
Sbjct: 214 ARFAFNEYRSSQLSLHRDVPDARPMQCKDVAYPPDLPAATVIICFVNEAWSALFRTVWSV 273
Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
+ RTP L EIIL+DD S + L Q LE+ +QR KV+L+R+ R GLIR R GAK
Sbjct: 274 LDRTPENLLHEIILLDDASDASWLQQPLEEELQRLPAKVKLVRSPRRLGLIRARLLGAKH 333
Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
+ + ++FLD+HCE + W+ PLLA + D + PVID I+ + R
Sbjct: 334 ATADYMIFLDSHCEANVGWIQPLLAWMAGDPSRVVTPVIDSINNNDMSYHGAGGAS--SR 391
Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
G F W + + PE A+ ++P KSPT AGGLF ++R +F ++G YD G+ WG
Sbjct: 392 GTFHWTLDFSWEANPEPVAQV----TDPVKSPTMAGGLFGINRQYFYDVGSYDQGMDGWG 447
Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
GEN E+SF++W CGGS+ +PCS +GH++R PY + + N R+ ETW
Sbjct: 448 GENLEMSFRVWQCGGSLHILPCSHVGHIFRDSHPYT---IPNSTINDTFLRNSIRLAETW 504
Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
D+ +K FY P A +D GD+ E+
Sbjct: 505 MDD-YKEIFYQIRPSARKVDHGDVGER 530
>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Callithrix jacchus]
Length = 633
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 153/388 (39%), Positives = 222/388 (57%), Gaps = 25/388 (6%)
Query: 1 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
RP + L+P L+ P ++ PG GKA+ +L + + ++ N
Sbjct: 95 RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSIEEQKEKERGEAKHCFNAFA 154
Query: 55 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
S+ +S R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRVSLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
A L+EIILVDD S L KL++Y+++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AVLLKEIILVDDASVDEYLHDKLDEYVKQFS-IVKIVRQRERKGLITARLLGASVATAET 272
Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y HH RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPY-GSHHNRGNF 331
Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
+W + + LP+ E ++RK + P K+PT AGGLF++ + +F +G YD + +WGGEN
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGEN 391
Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
E+SF++W CGG +E +PCS +GHV+RS P++F K +I N R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQ-----VIARNQVRLAEVWMDE 446
Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
+K FY R A + GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKTFGDLSKR 473
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.141 0.443
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,661,254,994
Number of Sequences: 23463169
Number of extensions: 301466500
Number of successful extensions: 601944
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2183
Number of HSP's successfully gapped in prelim test: 2738
Number of HSP's that attempted gapping in prelim test: 593067
Number of HSP's gapped (non-prelim): 5446
length of query: 372
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 228
effective length of database: 8,980,499,031
effective search space: 2047553779068
effective search space used: 2047553779068
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)