BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3380
(290 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 303 bits (776), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 202/362 (55%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +A++ + VV P+I I D T E F S +GGFDWNLQ
Sbjct: 276 CECTVGWLEPLLDRIAKDPTTVVCPVIDVIDDTTLEYNF-----RDSGGVNVGGFDWNLQ 330
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+IDK FFE++GTYDSGFDIWGGENLELSF
Sbjct: 331 FNWHAVPEREKKRHKNTAEPVWSPTMAGGLFAIDKNFFERIGTYDSGFDIWGGENLELSF 390
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F + Y SG ++
Sbjct: 391 K--------------------TW---MCGGTLEIVPCSHVGHIFRRRSPYKWRSGVNVLK 427
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGDFGDV++RKELR+ L CKSFKWYL
Sbjct: 428 RNSVRLAEVWLDDYAKYYYQRIGDDKGDFGDVSARKELRKRLNCKSFKWYLDNIYPELFI 487
Query: 211 --------EVSN-------------------------------------DWSGMCIDSAC 225
EV N +WSG C+DS C
Sbjct: 488 PGEAVAGGEVRNKGLGGKTCLDSPARKADLHKAVGLFPCHRQGGNQVSNNWSGQCLDSPC 547
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
K DMHKPVGL+PCHKQGGNQ+WM+SK GEIRRDEACLDYAG DVILYPCHGSKGNQY+
Sbjct: 548 KSEDMHKPVGLWPCHKQGGNQYWMLSKAGEIRRDEACLDYAGQDVILYPCHGSKGNQYWH 607
Query: 286 YD 287
Y+
Sbjct: 608 YN 609
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 165/329 (50%), Positives = 198/329 (60%), Gaps = 71/329 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+DS + D+HKP GLYPCH+QGGNQ+WM+SK GEIRRDE
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQYWMLSKTGEIRRDE 527
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
+CLDY+G DVILYPCHGSKGNQ + Y+++
Sbjct: 528 SCLDYSGTDVILYPCHGSKGNQQWIYNHQ 556
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 165/328 (50%), Positives = 198/328 (60%), Gaps = 73/328 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +AR+ + VV P+I I D T E F S +GGFDWNLQ
Sbjct: 240 CECTTGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHF-----HDSGGVNVGGFDWNLQ 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PE E+KRHKN AEPV++PTMAGGLFSIDK FFE+LGTYD+GFDIWGGENLELSF
Sbjct: 295 FNWHAVPEHEKKRHKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSF 354
Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
K + W + R+ AE VW A
Sbjct: 355 KTWMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLRRNSVRLAE-VWLDEYA------- 406
Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
K +++++G KGDFGD+TSRK LR LGCKSFKWYL
Sbjct: 407 KYYYQRIGNE----------------KGDFGDITSRKALREKLGCKSFKWYLDNIYPELF 450
Query: 211 ---------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
E+ N G C+DS + +D+HKPVGLYPCH+QGGNQFWM SK GEIRRD
Sbjct: 451 IPGEAVASGEIRNLGIGGKTCLDSPARRSDLHKPVGLYPCHRQGGNQFWMYSKSGEIRRD 510
Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYD 287
EACLDY+G +VILYPCHGSKGNQ+++Y+
Sbjct: 511 EACLDYSGQEVILYPCHGSKGNQFWDYN 538
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 298 bits (763), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 165/329 (50%), Positives = 198/329 (60%), Gaps = 71/329 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+DS + D+HKP GLYPCH+QGGNQ+WM+SK GEIRRDE
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQYWMLSKTGEIRRDE 527
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
+CLDY+G DVILYPCHGSKGNQ + Y+++
Sbjct: 528 SCLDYSGTDVILYPCHGSKGNQQWIYNHQ 556
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 297 bits (760), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 163/327 (49%), Positives = 196/327 (59%), Gaps = 69/327 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 266 CECTEGWLEPLLDRIARNQTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 320
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 321 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSF 380
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 381 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 417
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GDV+ RK LR+NLGCKSFKWYL
Sbjct: 418 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKNLGCKSFKWYLDNIYPELFI 477
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
E+ + S +CIDS P D+H+ VG Y CH QGGNQ+WM+SK GEIRRDE+C
Sbjct: 478 PGEAVASGEIRHLASRLCIDSPGNPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESC 537
Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDYK 289
LDY+G DVILYPCHGSKGNQ + Y+ +
Sbjct: 538 LDYSGTDVILYPCHGSKGNQQWTYNTQ 564
>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 683
Score = 295 bits (755), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 167/327 (51%), Positives = 197/327 (60%), Gaps = 71/327 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D TFE + S +GGFDWNLQ
Sbjct: 330 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEYHY-----RDSGGVNVGGFDWNLQ 384
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERER+R K++ +PVW+PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF
Sbjct: 385 FSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 444
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 445 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 481
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGDV++RK LR NL C+SF WY+
Sbjct: 482 RNSVRLAEVWLDEYKQYYYQRIGDDLGDFGDVSARKRLRDNLKCRSFDWYVRTIYPELFV 541
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+DS +MHKPVG+YPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 542 PGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKEGEIRRDE 601
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
ACLDYAG DVILYPCHGSKGNQ + YD
Sbjct: 602 ACLDYAGSDVILYPCHGSKGNQLWIYD 628
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 167/327 (51%), Positives = 197/327 (60%), Gaps = 71/327 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D TFE + S +GGFDWNLQ
Sbjct: 295 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEYHY-----RDSGGVNVGGFDWNLQ 349
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERER+R K++ +PVW+PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF
Sbjct: 350 FSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 409
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 410 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 446
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGDV++RK LR NL C+SF WY+
Sbjct: 447 RNSVRLAEVWLDEYKQYYYQRIGDDLGDFGDVSARKRLRDNLKCRSFDWYVRTIYPELFV 506
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+DS +MHKPVG+YPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 507 PGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKEGEIRRDE 566
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
ACLDYAG DVILYPCHGSKGNQ + YD
Sbjct: 567 ACLDYAGSDVILYPCHGSKGNQLWIYD 593
>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis mellifera]
Length = 635
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 199/362 (54%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 254 CECTEGWLEPLLDRIARNPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 308
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 309 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 368
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 369 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 405
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 406 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKRLGCKSFKWYLDNVYPELFI 465
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G MCIDS
Sbjct: 466 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSPG 525
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 526 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 585
Query: 286 YD 287
Y+
Sbjct: 586 YN 587
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 170/364 (46%), Positives = 202/364 (55%), Gaps = 106/364 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G MCIDSA
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSAG 527
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 528 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 587
Query: 286 YDYK 289
Y+++
Sbjct: 588 YNHQ 591
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 293 bits (751), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 170/364 (46%), Positives = 202/364 (55%), Gaps = 106/364 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G MCIDSA
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSAG 527
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 528 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 587
Query: 286 YDYK 289
Y+++
Sbjct: 588 YNHQ 591
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 293 bits (749), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 164/326 (50%), Positives = 195/326 (59%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 301 CECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHY-----RDSGGVNVGGFDWNLQ 355
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+P+RE+KRHK+ AEPV++PTMAGGLFSIDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 356 FNWHAVPDREKKRHKSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSF 415
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 416 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVIK 452
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ RK+LR NLGCK F+WYL
Sbjct: 453 RNSVRLAEVWLDEYAKYYYQRIGNDKGDYGDVSERKQLRENLGCKPFRWYLDNIFPELFI 512
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+D+ ++ KPVGLYPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 513 PGEAVASGEVRNMGYGNRTCLDAPGGKKNLRKPVGLYPCHNQGGNQYWMLSKTGEIRRDE 572
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
ACLDYAG DVILYPCHGSKGNQY+ Y
Sbjct: 573 ACLDYAGQDVILYPCHGSKGNQYWNY 598
>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis florea]
Length = 537
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 199/362 (54%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 156 CECTEGWLEPLLDRIARNPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 210
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 211 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 270
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 271 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 307
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG +GDV+ RK LR+ LGCKSFKWYL
Sbjct: 308 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKRLGCKSFKWYLDNVYPELFI 367
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G MCIDS
Sbjct: 368 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSPG 427
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 428 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 487
Query: 286 YD 287
Y+
Sbjct: 488 YN 489
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 169/362 (46%), Positives = 200/362 (55%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 257 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 311
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 312 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 371
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 372 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 408
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GDV+ RK LR+ LGCKSFKWYL
Sbjct: 409 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKKLGCKSFKWYLDNVYPELFI 468
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G +CIDS
Sbjct: 469 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSICIDSPG 528
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 529 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 588
Query: 286 YD 287
Y+
Sbjct: 589 YN 590
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 289 bits (740), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 168/362 (46%), Positives = 202/362 (55%), Gaps = 105/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR +S VV P+I I D T E + + +GGFDWNLQ
Sbjct: 288 CECTEGWLEPLLDRIAREASTVVCPVIDVIDDSTLEFHY-----RDAGGVNVGGFDWNLQ 342
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH +P++E+KRHKNAAEPVW+PTMAGGLF+IDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 343 FNWHVVPDKEKKRHKNAAEPVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSF 402
Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
K + W +K AE VW A
Sbjct: 403 KTWMCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLKKNSIRLAE-VWMDDYA------- 454
Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
K ++E++G D+ GD+GD+TSRK+LRR L CKSFKWYL
Sbjct: 455 KYYYERIGN-----DL-----------GDYGDITSRKDLRRKLKCKSFKWYLENIYPELF 498
Query: 211 ---------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------ 241
EV N G C+DS + TD++KP GLYPCHK
Sbjct: 499 IPGDAVASGEVRNLGYGNKTCLDSPARKTDLNKPAGLYPCHKMGGNQIKNIVSNMCVDSK 558
Query: 242 --------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYD 287
QGGNQ+WM+SK GEIRRDE+CLDYAG DVILYPCHGSKGNQY+ YD
Sbjct: 559 GDANKPVDLWQCHQQGGNQYWMLSKIGEIRRDESCLDYAGNDVILYPCHGSKGNQYWNYD 618
Query: 288 YK 289
+K
Sbjct: 619 HK 620
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 160/326 (49%), Positives = 194/326 (59%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 325 CECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHY-----RDSGGVNVGGFDWNLQ 379
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+++HK+AAEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 380 FNWHAVPEREKRKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGGENLELSF 439
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 440 K--------------------TW---MCGGSLEIIPCSHVGHIFRKRSPYKWRTGVNVIK 476
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGDFGDV+SRK+LR L CK F+WYL
Sbjct: 477 RNSVRLAEVWMDEYAQYYYQRIGNDKGDFGDVSSRKKLREELHCKPFRWYLDNIYPELFV 536
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N G C+D+ ++ K VGLYPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 537 PGDAVASGEVRNMGYGNRTCLDAPAGKRNLRKAVGLYPCHNQGGNQYWMLSKTGEIRRDE 596
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
ACLDYAG DV+LYPCHGS+GNQY+ Y
Sbjct: 597 ACLDYAGDDVVLYPCHGSRGNQYWNY 622
>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Acromyrmex echinatior]
Length = 566
Score = 287 bits (734), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 166/362 (45%), Positives = 200/362 (55%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 185 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSSGVNVGGFDWNLQ 239
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERERKRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 240 FNWHAVPERERKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 299
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 300 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRNGVNVLK 336
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GD++ RK LR+ LGCKSFKWYL
Sbjct: 337 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDISERKALRKKLGCKSFKWYLDNVYPELFI 396
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G MCIDS+
Sbjct: 397 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQVTSGMCIDSSG 456
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
K D+H+PVG+YPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 457 KIEDLHQPVGMYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 516
Query: 286 YD 287
Y+
Sbjct: 517 YN 518
>gi|307172175|gb|EFN63700.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Camponotus floridanus]
Length = 433
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 167/361 (46%), Positives = 198/361 (54%), Gaps = 106/361 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 52 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 106
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 107 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 166
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 167 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 203
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GDV+ RK LR+ LGCKSFKWYL
Sbjct: 204 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKTLRKKLGCKSFKWYLDNIYPELFI 263
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G +CIDS
Sbjct: 264 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQIASGICIDSPG 323
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
K D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 324 KSEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 383
Query: 286 Y 286
Y
Sbjct: 384 Y 384
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 168/362 (46%), Positives = 197/362 (54%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +AR+ + VV P+I I D T E F S +GGFDWNLQ
Sbjct: 240 CECTTGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHF-----HDSGGVNVGGFDWNLQ 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PE E+KRHKN AEPV++PTMAGGLFSIDK FFE+LGTYD+GFDIWGGENLELSF
Sbjct: 295 FNWHAVPEHEKKRHKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSF 354
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 355 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 391
Query: 177 GENLELS-----------------FKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGDFGD+TSRK LR LGCKSFKWYL
Sbjct: 392 RNSVRLAEVWLDEYAKYYYQRIGNEKGDFGDITSRKALREKLGCKSFKWYLDNIYPELFI 451
Query: 211 --------EVSNDWSG--MCID-----------------------------------SAC 225
E+ N G C+D S C
Sbjct: 452 PGEAVASGEIRNLGIGGKTCLDSPARRSDLHKPVGLYPCHRQGGNQISVLDRELCIDSPC 511
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
KP D+H P+GL+PCHKQGGNQFWM SK GEIRRDEACLDY+G +VILYPCHGSKGNQ+++
Sbjct: 512 KPEDLHNPIGLWPCHKQGGNQFWMYSKSGEIRRDEACLDYSGQEVILYPCHGSKGNQFWD 571
Query: 286 YD 287
Y+
Sbjct: 572 YN 573
>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Harpegnathos saltator]
Length = 482
Score = 283 bits (725), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 166/362 (45%), Positives = 198/362 (54%), Gaps = 106/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 101 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 155
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+ FFE++GTYDSGFDIWGGENLELSF
Sbjct: 156 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRVFFERIGTYDSGFDIWGGENLELSF 215
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 216 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 252
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GDV+ RK LR+ LGCKSFKWYL
Sbjct: 253 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKTLRKKLGCKSFKWYLDNVYPELFI 312
Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
EV N G +CIDS
Sbjct: 313 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQVASGICIDSPG 372
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
K D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 373 KSEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 432
Query: 286 YD 287
Y+
Sbjct: 433 YN 434
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 283 bits (725), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 166/364 (45%), Positives = 198/364 (54%), Gaps = 106/364 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 265 CECTEGWLEPLLDRIARNQTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 319
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 320 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSF 379
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 380 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 416
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ LS KG++GDV+ RK LR+NLGCKSFKWYL
Sbjct: 417 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKNLGCKSFKWYLDNIYPELFI 476
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
EV N G C+DS + D+HKP GLYPCH+
Sbjct: 477 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLASRLCIDSPG 536
Query: 242 ----------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ +
Sbjct: 537 NPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWT 596
Query: 286 YDYK 289
Y+ +
Sbjct: 597 YNTQ 600
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 280 bits (716), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 158/326 (48%), Positives = 192/326 (58%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D T E + S +GGFDWNLQ
Sbjct: 324 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEFHY-----RDSSGVNVGGFDWNLQ 378
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERE+KRH + +EPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 379 FSWHAVPEREKKRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSF 438
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 439 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 475
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGDFGDV+ RK+LR +L CKSFKWYL
Sbjct: 476 KNSVRLAEVWMDDYAKYYYQRIGMDKGDFGDVSERKKLREDLQCKSFKWYLDNVYPELFI 535
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+DS + KPVGLYPCH+QGGNQ+WM SK GEIRRD+
Sbjct: 536 PGDAVANGEMRNLGYGGRTCLDSPSGKRYLKKPVGLYPCHRQGGNQYWMFSKTGEIRRDQ 595
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
ACLDYAG DVIL+ CHGSKGNQ++ Y
Sbjct: 596 ACLDYAGKDVILFGCHGSKGNQFWTY 621
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 277 bits (709), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 192/326 (58%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I DDT E + S +GGFDWNLQ
Sbjct: 298 CECTEGWLEPLLDRIARNSTTVVCPVIDVINDDTLEYHY-----RDSTGVNVGGFDWNLQ 352
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERE+KRH ++AEPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 353 FSWHAVPEREKKRHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSF 412
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 413 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 449
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ RK+LR +L CKSF+WYL
Sbjct: 450 KNSVRLAEVWMDDYAQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFRWYLDNIYPELFI 509
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D+ + K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 510 PGDAVAHGEIKNLGYGGRTCMDAPAGKKHLKKSVGTYPCHRQGGNQYWMLSKAGEIRRDD 569
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+CLDYAG DV LY CHGSKGNQ++ Y
Sbjct: 570 SCLDYAGKDVTLYACHGSKGNQFWTY 595
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 156/361 (43%), Positives = 191/361 (52%), Gaps = 105/361 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN ++VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 146 CECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNTLEYHY-----RDSTSVNVGGFDWNLQ 200
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH +P RER RHK+ AEPVW+PTMAGGLF+IDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 201 FNWHPVPARERARHKHTAEPVWSPTMAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSF 260
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 261 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 297
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GD++ RKELR L CKSF WYL
Sbjct: 298 KNSVRLAEVWLDDYSKYYYQRVGNDKGDYGDISGRKELREKLKCKSFDWYLKNIYPELFI 357
Query: 211 --------------------------------------------EVSNDWSGMCIDSACK 226
+++N S MC+DSA
Sbjct: 358 PGESVAHGEIRNIGFERTCLDSPTRKSDHHKPVGLYPCHRQGGNQIANPSSDMCVDSAAG 417
Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
P DM KPV +PCH + GNQ+WM SK+GEIRRDE CLDY+G DV+LYPCHG+KGNQ + Y
Sbjct: 418 PEDMKKPVNPWPCHGEYGNQYWMYSKNGEIRRDETCLDYSGHDVVLYPCHGAKGNQLWLY 477
Query: 287 D 287
D
Sbjct: 478 D 478
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 189/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I DDT E + S +GGFDWNLQ
Sbjct: 312 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 366
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERE+KRH + AEPV++PTMAGGLFSID+ +F +LGTYDSGFDIWGGENLELSF
Sbjct: 367 FSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSF 426
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 427 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 463
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ RK+LR +L CKSFKWYL
Sbjct: 464 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFKWYLDNIYPELFI 523
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+DS K VGLYPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 524 PGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDD 583
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
CLDYAG +VILY CHG KGNQ++ Y
Sbjct: 584 YCLDYAGKEVILYSCHGGKGNQFWTY 609
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 273 bits (699), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 189/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I DDT E + S +GGFDWNLQ
Sbjct: 312 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 366
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERE+KRH + AEPV++PTMAGGLFSID+ +F +LGTYDSGFDIWGGENLELSF
Sbjct: 367 FSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSF 426
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 427 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 463
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ RK+LR +L CKSFKWYL
Sbjct: 464 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFKWYLDNIYPELFI 523
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+DS K VGLYPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 524 PGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDD 583
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
CLDYAG +VILY CHG KGNQ++ Y
Sbjct: 584 YCLDYAGKEVILYSCHGGKGNQFWTY 609
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 271 bits (693), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/324 (46%), Positives = 192/324 (59%), Gaps = 70/324 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
E++N +GMC+D+ K ++ PV +Y CH QGGNQ+WM+SK GEIRRD++C
Sbjct: 516 PGDSVAHGEIANVPNGMCLDAKEK-SEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSC 574
Query: 263 LDYAGGDVILYPCHGSKGNQYFEY 286
LDYAG DV L+ CHG KGNQ++ Y
Sbjct: 575 LDYAGKDVTLFGCHGGKGNQFWTY 598
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 159/359 (44%), Positives = 194/359 (54%), Gaps = 104/359 (28%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D T E + S +GGFDWNLQ
Sbjct: 314 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEFHY-----RDSSGVNVGGFDWNLQ 368
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WHA+PERE++RH N AEPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 369 FSWHAVPEREKRRHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSF 428
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 429 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 465
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ RK+LR +L CKSFKWYL
Sbjct: 466 KNSVRLAEVWMDDYSKYYLQRIGMDKGDYGDVSERKKLREDLQCKSFKWYLDNIYPELFI 525
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN--------------- 245
E+ N G C+DS +M KPVGLYPCHKQGGN
Sbjct: 526 PGDAVANGEIRNLGYGGRTCLDSPTGKRNMKKPVGLYPCHKQGGNQIKSINTDMCVDAPK 585
Query: 246 ------------------QFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
Q+WM+SK GEIRRD++CLDYAG DVIL+ CHGSKGNQ++ Y
Sbjct: 586 TGDESPVGVYPCHGQGGHQYWMLSKAGEIRRDQSCLDYAGKDVILFGCHGSKGNQFWTY 644
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D+ K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D+ K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 267 bits (683), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVPK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQCYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D+ K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 266 bits (681), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KG++GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGNWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D+ K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 189/326 (57%), Gaps = 69/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
E+ N + C+DSA +K + YPCH+QGGNQ+WM+SK GEIRRDE+C
Sbjct: 483 PGESVAKGELRNAQTSQCLDSAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESC 542
Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDY 288
+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 VDYAGSDVMVFPCHGMKGNQEWRYNH 568
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/326 (45%), Positives = 189/326 (57%), Gaps = 69/326 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
E+ N + C+DSA +K + YPCH+QGGNQ+WM+SK GEIRRDE+C
Sbjct: 483 PGESVAKGELRNAQTSQCLDSAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESC 542
Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDY 288
+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 VDYAGSDVMVFPCHGMKGNQEWRYNH 568
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 151/333 (45%), Positives = 189/333 (56%), Gaps = 76/333 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + RNSS VV P+I I D+TFE + T+ +GGFDW+LQ
Sbjct: 277 CECMEGWIEPLLDRIKRNSSTVVCPVIDVIDDETFEYHYSKAYFTN-----VGGFDWSLQ 331
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHAIPER+RK K +PV +PTMAGGLFSID+A+FEKLGTYD GFDIWGGENLELSF
Sbjct: 332 FNWHAIPERDRKNRKRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSF 391
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 392 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 428
Query: 177 GENLELS-----------------FKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ GD+GDV+ RK LR L CKSFKWYL
Sbjct: 429 KNSVRLAEVWLDEYKVYYYERINNQTGDYGDVSDRKALRERLKCKSFKWYLDNIYPELFV 488
Query: 211 --------EVSN------DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
EV N + C+DS D+HK V YPCH QGGNQ+WM+SK GEI
Sbjct: 489 PGDSVAKGEVRNYGYKEGGGAPQCLDSVVG-EDVHKDVTPYPCHGQGGNQYWMLSKDGEI 547
Query: 257 RRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
RRDE+C+DYAG +V+++PCHG KGNQ + Y++K
Sbjct: 548 RRDESCIDYAGANVMIFPCHGMKGNQEWRYNHK 580
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 262 bits (669), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 155/360 (43%), Positives = 194/360 (53%), Gaps = 105/360 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D T E + S +GGFDWNLQ
Sbjct: 313 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDATLEFHY-----RDSSGVNVGGFDWNLQ 367
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH++PERE+KRH + +EPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 368 FSWHSVPEREKKRHNSTSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSF 427
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y +G ++
Sbjct: 428 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 464
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGDFGDV+ RK+LR +L CKSF+WYL
Sbjct: 465 KNSVRLAEVWMDDYSKYYYQRIGMDKGDFGDVSDRKKLREDLQCKSFQWYLDTIYPELFI 524
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN--------------- 245
E+ N G C+DS ++ K VGLYPCHKQGGN
Sbjct: 525 PGNAVANGEIRNLGYGGRTCLDSPSGKRNLKKAVGLYPCHKQGGNQIRNINTNMCLDAML 584
Query: 246 -------------------QFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
Q+WM+SK GEIRRD+ACLDYAG DVIL+ CHGS+GNQ+++Y
Sbjct: 585 KNEDESPVGVYECHGQGGHQYWMLSKAGEIRRDQACLDYAGKDVILFGCHGSRGNQFWQY 644
>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 514
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 151/322 (46%), Positives = 189/322 (58%), Gaps = 58/322 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLLD +A NS++VVSP+I I DDT E S +GGFDW+LQ
Sbjct: 167 VECTQGWLEPLLDRIAVNSTNVVSPVIDIIADDTLEYN-----AKESADVNVGGFDWSLQ 221
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH+IPER K +PV TPTMAGGLFSID+ FFE+LG YD GFDIWGGENLELSF
Sbjct: 222 FSWHSIPERILKSGYKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGGENLELSF 281
Query: 124 KFNW---------------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA---FFEKL 165
K W H +R + ++ + ++ +D+ +FE+L
Sbjct: 282 K-TWMCGGRLEIIPCSHVGHIFRKRSPYKWRSGVNVLRRNSIRLAKVWMDEYANYYFERL 340
Query: 166 GTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------- 210
G D+ GD+GD++ R LR L C SFKWY+
Sbjct: 341 GN-----DL-----------GDYGDISDRIALRDKLKCHSFKWYIDEVYPELFVPGDAIG 384
Query: 211 --EVSNDWSG-MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
E+ N SG MC+DS + +HK VGLYPCH QGGNQ+W+ SK+GEIRRDEACLDYAG
Sbjct: 385 SGEMRNLGSGGMCLDSPAGKSSLHKAVGLYPCHGQGGNQYWLYSKNGEIRRDEACLDYAG 444
Query: 268 GDVILYPCHGSKGNQYFEYDYK 289
DVILYPCHGSKGNQY+ YD +
Sbjct: 445 TDVILYPCHGSKGNQYWIYDQQ 466
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 148/328 (45%), Positives = 188/328 (57%), Gaps = 71/328 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 275 CECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 329
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 330 FNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 389
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 390 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 426
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGDV++RK+LR +LGCKSFKWYL
Sbjct: 427 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYLDNIFPELFV 486
Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N C+D + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 487 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 546
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 547 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 574
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 260 bits (664), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 187/328 (57%), Gaps = 71/328 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N C+D + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 542
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 570
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 260 bits (664), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 187/328 (57%), Gaps = 71/328 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
EV N C+D + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 542
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 570
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 260 bits (664), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 155/362 (42%), Positives = 191/362 (52%), Gaps = 107/362 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I DDT E + S +GGFDWNLQ
Sbjct: 306 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 360
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH++PERERKRH N+AEPV++PTMAGGLF+ID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 361 FSWHSVPERERKRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSF 420
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 421 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 457
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDVT RK+LR +L CKSFKWYL
Sbjct: 458 KNSVRLAEVWMDDYAQYYYHRIGNDKGDWGDVTDRKKLRADLKCKSFKWYLDNIYPELFI 517
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
E+ N G C+D+ K VG YPCH+
Sbjct: 518 PGDSVAHGEIRNLGYGGRTCLDAPSGKKHQKKAVGTYPCHRQGGNQIANLPTGMCLDAKE 577
Query: 242 -----------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYF 284
QGGNQ+WM+SK GEIRRD++CLDYAG +V LYPCHG KGNQ++
Sbjct: 578 LSTEGDDTSVSIYECHGQGGNQYWMLSKTGEIRRDDSCLDYAGKEVTLYPCHGGKGNQFW 637
Query: 285 EY 286
Y
Sbjct: 638 SY 639
>gi|322792015|gb|EFZ16120.1| hypothetical protein SINV_06269 [Solenopsis invicta]
Length = 433
Score = 259 bits (662), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 149/309 (48%), Positives = 184/309 (59%), Gaps = 55/309 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +AR+ + VV P+I I D T E + S +GGFDWNLQ
Sbjct: 107 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 161
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHA+PERERKRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 162 FNWHAVPERERKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 221
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 222 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 258
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM 219
++ LS KG++GDV+ RK LR+ LGCKSFKWYL+ N + +
Sbjct: 259 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKALRKKLGCKSFKWYLD--NVYPEL 316
Query: 220 CIDSACKPTDMHKPVGLYP-CHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
I + P +Y ++ +Q+WM+SK GEIRRDE+CLDY+G DVILYPCHGS
Sbjct: 317 FIPGEAVASGEASPCRIYRGINRDRLSQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGS 376
Query: 279 KGNQYFEYD 287
KGNQ + Y+
Sbjct: 377 KGNQQWIYN 385
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 189/328 (57%), Gaps = 74/328 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSNDW--SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N + CID KP+ K VG+Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEMRNAGGKNRQCIDY--KPSG-GKTVGMYQCHNQGGNQYWMLSKDGEIRRDE 539
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 540 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 567
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 254 bits (648), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 189/328 (57%), Gaps = 74/328 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGD++SRK+LR +LGCKSFKWYL
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482
Query: 211 --------EVSNDW--SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N + CID KP+ K VG+Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEMRNAGGKNRQCIDY--KPSG-GKTVGMYQCHNQGGNQYWMLSKDGEIRRDE 539
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 540 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 567
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 152/360 (42%), Positives = 188/360 (52%), Gaps = 105/360 (29%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNSS VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSSTVVCPVIDVINDETLEYHY-----RDSGGVNVGGFDWNLQ 358
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455
Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ KGD+GDV+ R++LR +L CKSFKWYL
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515
Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
E+ N G C+D+ K VG YPCH+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQIANMQHGMCLDAKE 575
Query: 242 ---------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
QGGNQ+WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 KSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 635
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 191/360 (53%), Gaps = 102/360 (28%)
Query: 7 QKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNW 66
+KRW++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQFNW
Sbjct: 294 KKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQFNW 348
Query: 67 HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFN 126
H+IPER+RK A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSFK
Sbjct: 349 HSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKVR 408
Query: 127 WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGEN 179
+ +W M GG I F K Y +G ++ +
Sbjct: 409 ----------------KCIW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLKRNS 449
Query: 180 LELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL------------ 210
+ L+ +K GDFGDV++RK+LR +LGCKSFKWYL
Sbjct: 450 IRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYLDNIYPELFVPGE 509
Query: 211 -----EVSND-------------------------------------WSGMCIDSACKPT 228
EV N + C+DSA
Sbjct: 510 SVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQLRNAQTSQCLDSAVGDE 569
Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDY 288
+K + YPCH+QGGNQ+WM+SK GEIRRDE+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 570 VENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDYAGTDVMVFPCHGMKGNQEWRYNH 629
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/329 (45%), Positives = 183/329 (55%), Gaps = 77/329 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +A ++VV P+I I D TFE +P R + Y +GGFDWNLQ
Sbjct: 295 CECSTGWLEPLLDRIAEADTNVVCPVIDVISDSTFE--YPHRR--AGYTVNVGGFDWNLQ 350
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH++P+R++ K + V +PTMAGGLFSI KA+FEKLG YDSGFDIWG ENLELSF
Sbjct: 351 FSWHSLPQRDKDARKQSWSAVPSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSF 410
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFD--- 173
K VW M GG I F K Y G +
Sbjct: 411 K--------------------VW---MCGGRLEIVPCSHVGHVFRKRSPYKWLKGVNVLK 447
Query: 174 --------IWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
+W E + F GD+GD++ R ELRR+L CKSF WY+
Sbjct: 448 KNSVRLAKVWMDEYAQYYFDRIGPDLGDYGDISERVELRRSLNCKSFDWYVKNIYPDLFI 507
Query: 211 --------EVSNDWSGM----CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
EV N SG C+DSA +H V +YPCH QGGNQ+W+ SK GEIRR
Sbjct: 508 PGDAAASGEVRN--SGFERKWCLDSAAT---VHATVSVYPCHGQGGNQYWLFSKTGEIRR 562
Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEYD 287
DE CLDY+GGDV+LY CHGSKGNQY+ YD
Sbjct: 563 DELCLDYSGGDVVLYSCHGSKGNQYWRYD 591
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/328 (42%), Positives = 182/328 (55%), Gaps = 73/328 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD +++N S+VV+P+I I DDT + ++ + TS +GGFDWNLQ
Sbjct: 159 CECTMGWLEPLLDRISQNKSNVVTPVIDVINDDTIQYQYSSAKSTS-----VGGFDWNLQ 213
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH IP+ E+KR K+ +PV +PTMAGGLFSI + +FE LGTYD G DIWGGENLELSF
Sbjct: 214 FNWHGIPDHEKKRRKSDVDPVRSPTMAGGLFSISREYFEYLGTYDPGMDIWGGENLELSF 273
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
+ +W M GG I F K Y +G ++
Sbjct: 274 R--------------------IW---MCGGSLDIAPCSHVGHIFRKRSPYSWKTGVNVVK 310
Query: 177 GENLELSFK-----------------GDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
++ L+ GD+GDV++RK LR L CKSFKWYL+
Sbjct: 311 KNSIRLAEVWLDEFSKYYYERFNYDLGDYGDVSARKALRERLHCKSFKWYLDNIYPDLFI 370
Query: 212 -----VSNDWSGM--------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
S + +G+ C+DSA +K + L+PCH GGNQ+WM+SK GEIRR
Sbjct: 371 PGESLASGEVNGVFNSQSQPACLDSAADKKAYNKAIKLWPCHNMGGNQYWMLSKSGEIRR 430
Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEY 286
DE C DYAG V++YPCH KGNQ + Y
Sbjct: 431 DEGCFDYAGQFVMIYPCHAMKGNQEWIY 458
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 146/326 (44%), Positives = 184/326 (56%), Gaps = 72/326 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + +N VV P+I I D+TFE + T+ +GGFDW+LQ
Sbjct: 271 CECLEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHAIPE++RK ++ +PV +PTMAGGLFSID+ FFEKLG+YD G DIWGGENLELSF
Sbjct: 326 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSF 384
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG+ I F K Y SG ++
Sbjct: 385 K-TW----------------------MCGGILEIVPCSHVGHIFRKRSPYKWLSGVNVLK 421
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGDV+SRK LR L CKSFKWYL
Sbjct: 422 RNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALREKLQCKSFKWYLDNVYPELFV 481
Query: 211 --------EVSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N G C+D A GLY CHK+GGNQ+WM+SK GEIRRDE
Sbjct: 482 PGDAIGKGEIRNRGGGSKNCLDWASHGRQRSVNAGLYWCHKKGGNQYWMLSKDGEIRRDE 541
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
+C+DYAG DV++YPCHG KGNQ ++Y
Sbjct: 542 SCIDYAGVDVMVYPCHGMKGNQEWKY 567
>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
Length = 580
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 148/350 (42%), Positives = 187/350 (53%), Gaps = 98/350 (28%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + +N VV P+I I D+TFE + T+ +GGFDW+LQ
Sbjct: 252 CECLEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 306
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHAIPE++RK ++ +PV +PTMAGGLFSID+ FFEKLG+YD G DIWGGENLELSF
Sbjct: 307 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSF 365
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG+ I F K Y SG ++
Sbjct: 366 K-TW----------------------MCGGILEIVPCSHVGHIFRKRSPYKWLSGVNVLK 402
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------------EVS 213
++ L+ +GDFGDV+SRK LR L CKSFKWYL EV+
Sbjct: 403 RNSVRLA-EGDFGDVSSRKALREKLQCKSFKWYLDNVYPELFVPGDAIGKGEIRNKGEVA 461
Query: 214 NDWSGMCIDSACKPTDMHKPV-------------------------------------GL 236
D C+DS D+ K V GL
Sbjct: 462 GDVVQHCLDSEVG-EDIQKVVIAFPCHRNGGNQIRNRGGGSKNCLDWASHGRQRSVNAGL 520
Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
Y CHK+GGNQ+WM+SK GEIRRDE+C+DYAG DV++YPCHG KGNQ ++Y
Sbjct: 521 YWCHKKGGNQYWMLSKDGEIRRDESCIDYAGVDVMVYPCHGMKGNQEWKY 570
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 148/366 (40%), Positives = 188/366 (51%), Gaps = 112/366 (30%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + RN VV P+I I D+TFE + T+ +GGFDW+LQ
Sbjct: 185 CECLEGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 239
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWHAIPE++RK ++ +PV +PTMAGGLFSID+ FFE+LG+YD G DIWGGENLELSF
Sbjct: 240 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSF 298
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG+ I F K Y SG ++
Sbjct: 299 K--------------------IW---MCGGILEIVPCSHVGHIFRKRSPYKWRSGVNVLK 335
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
++ L+ +K GDFGDV+SRK LR+ L CKSFKWYL
Sbjct: 336 RNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALRKKLQCKSFKWYLDNVYPELFV 395
Query: 211 --------------EVSNDWSGMCIDS--------------------------------- 223
EV+ D C+DS
Sbjct: 396 PGDAIGKGEIRNKGEVAGDVVQHCLDSEVGEDIQKVVIAYPCHKSGGNQIRNRGGRSKNC 455
Query: 224 ---ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
A VGLY CHK+GGNQ+WM+SK GEIRRDE+C+DYAG DV++YPCHG KG
Sbjct: 456 LDWASHGRQRSANVGLYWCHKKGGNQYWMLSKDGEIRRDESCIDYAGADVMVYPCHGMKG 515
Query: 281 NQYFEY 286
NQ ++Y
Sbjct: 516 NQEWKY 521
>gi|339239855|ref|XP_003375853.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316975462|gb|EFV58902.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 625
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 175/312 (56%), Gaps = 55/312 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL PLL + N S+VV P+I I DDTF+ +T+ +GGFDWNLQ
Sbjct: 298 CECLEGWLPPLLSRIKENWSNVVCPVIDVIDDDTFKYHCGKSWMTN-----VGGFDWNLQ 352
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH IPER RK + PV +PTMAGGLFSIDK +F+ LGTYD GFDIWGGENLELSF
Sbjct: 353 FNWHPIPERVRKSRSDPTAPVESPTMAGGLFSIDKQYFQHLGTYDPGFDIWGGENLELSF 412
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K VW M GG I F K Y G ++
Sbjct: 413 K--------------------VW---MCGGKLEIVPCSHVGHIFRKRSPYKWRPGVNVVK 449
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------------VSNDWSGM---- 219
+ L+ +G+FGDV+ R LR+ L C SF+WY++ + M
Sbjct: 450 RNTVRLA-EGEFGDVSDRIALRQRLNCSSFEWYIKNVYPELFVPGNSIAKGEIRCMGQNK 508
Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
C+D A + +KP+ +YPCH +GGNQ+WM+S GEIRRDE+C+DYAG V L CHG
Sbjct: 509 RHCLDFASGRKEHNKPISMYPCHGEGGNQYWMLSPTGEIRRDESCVDYAGQKVFLSGCHG 568
Query: 278 SKGNQYFEYDYK 289
KGNQ ++Y++K
Sbjct: 569 LKGNQEWKYNFK 580
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 172/309 (55%), Gaps = 70/309 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLLD + R+ + VV P+I I D+TFE TS +GGFDW LQ
Sbjct: 279 CECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 333
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH+IPER+RK A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 334 FNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 393
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG I F K Y +G ++
Sbjct: 394 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 430
Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM 219
++ L+ +K GDFGDV++RK+LR +LGCKSFKWYL
Sbjct: 431 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYL--------- 481
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSK 279
D P P ++WM+SK GEIRRDE+C+DYAG DV+++PCHG K
Sbjct: 482 ---------DNIYPELFVPGESVAKGEYWMLSKDGEIRRDESCVDYAGSDVMVFPCHGMK 532
Query: 280 GNQYFEYDY 288
GNQ + Y++
Sbjct: 533 GNQEWRYNH 541
>gi|405967231|gb|EKC32417.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 570
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 135/331 (40%), Positives = 177/331 (53%), Gaps = 61/331 (18%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD +A + HVV P + NI DDT E R S+ +G FDW L F W +
Sbjct: 196 WLEPLLDRIAEDKRHVVYPQMPNIKDDTLEFR-----AFSARNIQVGRFDWQLIFRWMEL 250
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
PE K K+ P +PTMAGGLFSI + +F +LGTYD G DIWGGENLELSF+ W
Sbjct: 251 PEYINKTRKSFISPTRSPTMAGGLFSISREYFTELGTYDPGMDIWGGENLELSFRV-WMC 309
Query: 128 -------------HAIPERERKRHKNAAEPVWTPTMAGGLFSID--KAFFEKLGTYDSG- 171
H +R + + V ++ +D K ++ + YD G
Sbjct: 310 GGTLEIIPCSHVGHIFRKRSPYKWRTGVNVVKKNSIRLAEVWMDEYKNYYYERFNYDLGD 369
Query: 172 ------------------FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--- 210
FD W +N+ GD+GDVT RK+LR L C SF W++
Sbjct: 370 YGDVTDRKKLRERLQCHSFD-WFVKNVYPDLFGDYGDVTDRKKLRERLQCHSFDWFVKNV 428
Query: 211 --------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
E+ + MCIDSA + HKPV ++PCH QGGNQ+WM+SK+GEI
Sbjct: 429 YPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWMLSKNGEI 488
Query: 257 RRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
RRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 489 RRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 519
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 94/209 (44%), Gaps = 40/209 (19%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD +A + VV P+I NI +T + S ++ +GGFDW+L F W +
Sbjct: 104 WLEPLLDRIATDRKKVVCPVIDNILAETLYFQ-------SLNQYSVGGFDWSLVFRWKSA 156
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSID-------------KAFFEKLGTYDSG------ 110
R + + +A + D + +++
Sbjct: 157 KPHNRYYNSQNKTSLRAIRLARTIARSDSGGKAARGNPGWLEPLLDRIAEDKRHVVYPQM 216
Query: 111 ---------FDIWGGENLEL-----SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFS 156
F + N+++ F W +PE K K+ P +PTMAGGLFS
Sbjct: 217 PNIKDDTLEFRAFSARNIQVGRFDWQLIFRWMELPEYINKTRKSFISPTRSPTMAGGLFS 276
Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFK 185
I + +F +LGTYD G DIWGGENLELSF+
Sbjct: 277 ISREYFTELGTYDPGMDIWGGENLELSFR 305
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 7/82 (8%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR-FPPGRLTSSYKFFIGGFDWNL 62
C++ WL+PLL +A + HVV+P+I NI DDT + F P ++ +G FDW+L
Sbjct: 33 CKLCIGWLEPLLGRIAEDKRHVVAPVIGNINDDTLQFAWFNPDQI------HVGKFDWDL 86
Query: 63 QFNWHAIPERERKRHKNAAEPV 84
FNW IP + + + EP+
Sbjct: 87 TFNWMPIPSYVKDKMNSWLEPL 108
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/329 (40%), Positives = 179/329 (54%), Gaps = 71/329 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE RWL+PLLD +A+NS++VV+P+I DT L L+S + +GGF+W L
Sbjct: 326 CECMNRWLEPLLDRIAQNSTNVVTPVI-----DTINLETLQYHLSSHRRLSVGGFNWGLV 380
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNWH +P+R+ + K+ +P+ +PTMAGGLFSID+ +FEKLG YD GFDIWG ENLE+SF
Sbjct: 381 FNWHILPDRDYQAMKSRIDPIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISF 440
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K +W M GG + F K Y G ++
Sbjct: 441 K--------------------IW---MCGGRLEVVPCSHVGHIFRKKSPYKWRKGINVLQ 477
Query: 177 GENLELS------FKG-----------DFGDVTSRKELRRNLGCKSFKWYLE-------- 211
N+ L+ +K DFGDV+ RK+LR +L C SFKWYL+
Sbjct: 478 RNNIRLAEVWLDDYKEIYYNRINHKLVDFGDVSERKKLREHLKCHSFKWYLDNVFPDLFL 537
Query: 212 -----VSNDWSGM-----CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
S + + C+D ++ V YPCH QGGNQFWM+SK GEIRRDE
Sbjct: 538 PSEAIASGEIRNLGNQKYCVDHDVGRNAVNDSVIPYPCHLQGGNQFWMLSKSGEIRRDEY 597
Query: 262 CLDYAG-GDVILYPCHGSKGNQYFEYDYK 289
C+DY G G + Y CHGSKGNQ ++Y+++
Sbjct: 598 CIDYTGRGSPVTYECHGSKGNQLWDYNHE 626
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 148/399 (37%), Positives = 189/399 (47%), Gaps = 145/399 (36%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARNS+ VV P+I I D+T E + S +GGFDWNLQ
Sbjct: 303 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 358 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 417
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
K W M GG I F K Y SG ++
Sbjct: 418 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 454
Query: 177 GENLELSF-----------------KGDFGDVTSRKELR--------------------- 198
++ L+ KGD+GDV+ R++LR
Sbjct: 455 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRTDLKCKSFKWYLDNIYPELFI 514
Query: 199 ----------RNLG-----C-----------KSFKWYL-------EVSNDWSGMCIDSAC 225
RNLG C K+ Y +++N GMC+D+
Sbjct: 515 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQIANVPKGMCLDAKE 574
Query: 226 KPTDMHKPVGLYPCHKQGGNQ--------------------------------------F 247
K ++ PV +Y CH QGGNQ +
Sbjct: 575 K-SEEETPVSVYECHGQGGNQVSASMSTSSELRKAGGGDSESLIPGFSISLLYGFIFQSY 633
Query: 248 WMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 634 WMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 672
>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
Length = 550
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 176/325 (54%), Gaps = 60/325 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P+LD + ++ SHVV+P+I I D T F P S F +GGFDW +
Sbjct: 194 CECTPGWLEPMLDRIGQDWSHVVTPIIDVIDDKTLMYNFNP----LSRGFSVGGFDWAMG 249
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WHA+P E++R K ++P +PTMAGGLF+ID+ +F +G+YD G +IWGGENLE+SF
Sbjct: 250 FTWHALPNHEKERRKKISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSF 309
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ +P RKR+ N S F + + + +
Sbjct: 310 RIWMCGGTLETLPCSHVGHIFRKRNPN--------------HSAKHGNFVQRNSVRTA-E 354
Query: 174 IWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYLE-------VSND----- 215
+W E L + GDFGDV+ R+ LR L CKSFKWYL+ V +D
Sbjct: 355 VWMDEYKYLYYDRIGNHIGDFGDVSDRRALREELKCKSFKWYLDTIYPTLFVPSDAEASG 414
Query: 216 ----------WSGMCIDSA---CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
S +C+DSA + + K V +PCH QGGNQ WM+S++GEIR+D+ C
Sbjct: 415 EVRCKAHFPKVSQVCLDSADIDPETSANGKEVQTWPCHGQGGNQMWMLSQNGEIRKDKGC 474
Query: 263 LDYAGGDVILYPCHGSKGNQYFEYD 287
LDY G + +YPCH SKG Q ++Y+
Sbjct: 475 LDYNDGKLRIYPCHSSKGPQDWKYN 499
>gi|358332242|dbj|GAA50924.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 403
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 171/313 (54%), Gaps = 47/313 (15%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E K WL+PLLD + + ++VV P+I I D T L++ R S +GGFDW+L F
Sbjct: 61 ECTKGWLEPLLDRIRESETNVVVPIIEVISDKT--LQYNNARAESVQ---VGGFDWSLIF 115
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+WH+ P+R+++R P+ TPTMAGGLF+I +AFF++LG YD G ++WGGENLELSFK
Sbjct: 116 HWHSPPKRDKERPGAPYSPLRTPTMAGGLFAISRAFFKRLGYYDEGMEVWGGENLELSFK 175
Query: 125 FNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W + E R R E +T + + +A LG + +
Sbjct: 176 V-WMCGGQLETIICSHIGHIFRSRSPYKWESKFTSPLRRNTARLAEAV---LGPFAKFYH 231
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDW 216
G S K DFGDV+ RK + L C SF WYL ++ ++
Sbjct: 232 SQSG-----SRKIDFGDVSERKAILERLKCHSFDWYLKNVYPEFFVPTDSVAHGDIESEA 286
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVILY 273
CIDS K D VG++PCH++GGNQ+W+MSK GEIRRD C D AG G V L+
Sbjct: 287 GPHCIDSPLK-GDGKVIVGMWPCHREGGNQYWLMSKLGEIRRDNKCWD-AGIEVGRVALF 344
Query: 274 PCHGSKGNQYFEY 286
CHG +GNQ+F Y
Sbjct: 345 DCHGVRGNQHFVY 357
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 169/312 (54%), Gaps = 42/312 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A++ + VV P+I I D+TFE GGF+W L
Sbjct: 225 CECTEGWLEPLLSEIAKDRTTVVCPIIDVISDETFEFMV-------GSDMTYGGFNWKLN 277
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV +PTMAGGLFSIDK++FE+LGTYD+G DIWGGENLE+S
Sbjct: 278 FRWYPVPQREMDRRKGDRTLPVRSPTMAGGLFSIDKSYFEELGTYDAGMDIWGGENLEIS 337
Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
F+ W I H A P P G + + + + + ++ + F I
Sbjct: 338 FRI-WQCGGTLLIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDSFKNFFYI 396
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
L K ++GD++ R LR L CKSFKWYL E+ N+
Sbjct: 397 ITPGVL----KQEYGDISERVRLREKLQCKSFKWYLENIYPDSQIPGEYYSLGEIRNEEG 452
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
G+C+D+ + + VG++ CH+ GGNQ W + + E+R D+ CLD + GG +++ C
Sbjct: 453 GLCLDTMGRKEN--DKVGIFNCHEMGGNQVWAYTGNQELRCDDICLDASKVGGPIMMVKC 510
Query: 276 HGSKGNQYFEYD 287
H +GNQ +EYD
Sbjct: 511 HHMRGNQLWEYD 522
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 159/311 (51%), Gaps = 50/311 (16%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD +A NSS VV P+I+ I D T ++ F + +GGFDW+L F WH
Sbjct: 351 WLEPLLDRIAYNSSIVVVPVISTINDKTLKMNF-----LKADNVQVGGFDWSLTFRWHEQ 405
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
ER+R R PV +PTMAGGLF+I + +F LG YDSG +IWGGENLELSFK W
Sbjct: 406 TERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKV-WMC 464
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
++ G +F + + D D+W +
Sbjct: 465 ----------GGILETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRF 514
Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
+ FK DFGDV+ RK LR L C+SF WYL S + I+SA P
Sbjct: 515 YYARIGFKTIDFGDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPH 574
Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPC 275
+ P + ++PCHKQGGNQFW++S + EIRRDE C D + + LY C
Sbjct: 575 CLDSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDSGMKNHTIGLYRC 634
Query: 276 HGSKGNQYFEY 286
HG+KGNQ F Y
Sbjct: 635 HGAKGNQKFTY 645
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 159/311 (51%), Gaps = 50/311 (16%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD +A NSS VV P+I+ I D T ++ F + +GGFDW+L F WH
Sbjct: 351 WLEPLLDRIAYNSSIVVVPVISTINDKTLKMNF-----LKADNVQVGGFDWSLTFRWHEQ 405
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
ER+R R PV +PTMAGGLF+I + +F LG YDSG +IWGGENLELSFK W
Sbjct: 406 TERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKV-WMC 464
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
++ G +F + + D D+W +
Sbjct: 465 ----------GGILETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRF 514
Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
+ FK DFGDV+ RK LR L C+SF WYL S + I+SA P
Sbjct: 515 YYARIGFKTIDFGDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPH 574
Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPC 275
+ P + ++PCHKQGGNQFW++S + EIRRDE C D + + LY C
Sbjct: 575 CLDSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDSGMKNHTIGLYRC 634
Query: 276 HGSKGNQYFEY 286
HG+KGNQ F Y
Sbjct: 635 HGAKGNQKFTY 645
>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Loxodonta africana]
Length = 560
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 164/315 (52%), Gaps = 44/315 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKDDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD---IWGGEN 179
F+ +++ E E K V + A + ++ + D +W G N
Sbjct: 324 FRT--YSLMELESK--NTVPYSVMSCHEAHAVVYVNSRALTHVINKKQQEDWQEVWDGMN 379
Query: 180 LELSF--------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN 214
L+ F K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 380 LKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRN 439
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVIL 272
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 440 VETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIM 497
Query: 273 YPCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 498 LKCHHMRGNQLWEYD 512
>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
Length = 488
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 167/316 (52%), Gaps = 48/316 (15%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E + W +PLL +A + + VV P+I I DDTFE + GGF+W L F
Sbjct: 139 ECTEGWAEPLLTRIAEDRTTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNF 191
Query: 65 NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W+ +P+RE +R + P+ TPTMAGGLF+IDK++FE++GTYDSG DIWGGENLE+SF
Sbjct: 192 RWYPVPQREMDRRGGDRTMPLRTPTMAGGLFAIDKSYFEEIGTYDSGMDIWGGENLEISF 251
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
+ W E H TP T GG I +L ++W +N
Sbjct: 252 RI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVW-MDNF 303
Query: 181 ELSF--------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ F K D+GDVT RKELR L CK FKWYL E+ N
Sbjct: 304 KDFFYIISPGVTKVDYGDVTGRKELRDKLNCKPFKWYLENIYPDSQIPTSYHSLGEIRNV 363
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
S CID+ + + + VG++ CH GGNQ + +K E+R D+ CLD + GG V+L+
Sbjct: 364 DSNQCIDNMARKEN--EKVGIFSCHGMGGNQVFSYTKEKELRTDDLCLDVSKPGGPVMLF 421
Query: 274 PCHGSKGNQYFEYDYK 289
CH GNQ +EYD K
Sbjct: 422 KCHHLGGNQLWEYDEK 437
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 163/317 (51%), Gaps = 50/317 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+P+L +A + + V P+I I DDTF+ + +GGF W+L
Sbjct: 152 CEVTEGWLEPMLARIAEDRTTSVCPVIDVISDDTFQYQH-------GNDPQMGGFGWSLF 204
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +P+RE+ R K + EPV TMAGGLF+IDK++FE+LG YD GF+IWGGENLELS
Sbjct: 205 FKWFPVPKREQIRRKGDPTEPVRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENLELS 264
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
FK IP P P + +K E +W
Sbjct: 265 FKLWMCGGKLEFIPCSHVGHVFRKKSPYHFPPGTNYVNKNNKRLAE----------VWLD 314
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
E + K D GD++ R LR++L CKSFKWYL EV
Sbjct: 315 EYKNFYYRISPSVAKTDPGDISDRLNLRKSLSCKSFKWYLENIYPESSWPVNYQFMGEVR 374
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA-GGDVIL 272
N + +C+D+ K + VGLY CH QGGNQ W +K+ E+R D+ CLD A GG V++
Sbjct: 375 NTEAHVCLDTMMK--EAGNKVGLYGCHGQGGNQIWAFTKNNELRHDDLCLDVARGGPVMM 432
Query: 273 YPCHGSKGNQYFEYDYK 289
CH GNQ++ YD K
Sbjct: 433 LSCHMQGGNQHWNYDEK 449
>gi|405966388|gb|EKC31681.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 815
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 89/158 (56%), Positives = 112/158 (70%), Gaps = 18/158 (11%)
Query: 147 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSF 206
+PTMA GLFSI + +F +LGTYD G DIWGGENLELSF+GD+G VT RK+LR L C SF
Sbjct: 607 SPTMARGLFSISREYFTELGTYDPGIDIWGGENLELSFRGDYGHVTDRKKLRERLQCHSF 666
Query: 207 KWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWM 249
W++ E+ + MCIDSA + HKPV ++PCH QGGNQ+WM
Sbjct: 667 DWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWM 726
Query: 250 MSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
+SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 727 LSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 764
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/313 (39%), Positives = 165/313 (52%), Gaps = 42/313 (13%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL+PLL + ++ ++V+ P+I I DDTFE LT S GGF+W L F
Sbjct: 241 EASPGWLEPLLYEIKKDRTNVICPIIDVISDDTFEF------LTGS-DLTYGGFNWKLNF 293
Query: 65 NWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W+ +P+RE +R + + P+ TPTMAGGLFSIDK++F ++G+YDSG DIWGGENLE+SF
Sbjct: 294 RWYPVPQREVDRRGGDRSLPMQTPTMAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSF 353
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT-----YDSGFDIW 175
+ W H TP T GG I +L Y F I
Sbjct: 354 RI-WMCGGTVLIATCSHVGHVFRKATPYTFPGGTSQIINKNNRRLAEVWMDDYKKFFYIV 412
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSG 218
K +GDV+ RK LR +L CKSF+WYL E+ N +
Sbjct: 413 N----PTVMKHKYGDVSDRKTLRNDLQCKSFQWYLDNVYPDAQIPRRYKVLGEIKNTGAN 468
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCH 276
+C+D+ + + K VG Y CH QGGNQ + + EIR D+ CLD A G V++ CH
Sbjct: 469 ICLDTMGRKEN--KKVGCYSCHGQGGNQVFSFTMDNEIRIDDLCLDVANSKGPVMMVKCH 526
Query: 277 GSKGNQYFEYDYK 289
KGNQY+EY+ K
Sbjct: 527 HQKGNQYWEYNIK 539
>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 569
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 166/304 (54%), Gaps = 29/304 (9%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E + W +PL+D +ARN S V++P+I I +TF+ F T+ +GGFDW+L F
Sbjct: 220 ECAEGWFEPLIDPIARNWSTVMTPVIDVIDKETFQYGFQAASATN-----VGGFDWSLMF 274
Query: 65 NWHAIPERERKRHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
WH +PE E+KR +N PV +PTMAGGLF+I + +FE +GTYD G DIWGGENLELSF
Sbjct: 275 TWHFVPETEQKRRQNKHYLPVRSPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSF 334
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGENL 180
+ W H P G ++ K ++ + F + +++
Sbjct: 335 RI-WMCGGTLLTAPCSHVGHVFRHTPPYSFGPKKNVVKNNLVRMAEVWLDDFKYYYYQHI 393
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDS 223
+ G++GDV++R+ LR NL C SF WYL E+ + +C++S
Sbjct: 394 NYTL-GNYGDVSARRALRANLQCHSFDWYLVNVYPELLIPAEALYSGEIRSKAEPLCLES 452
Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD-EACLDYAGGDVILYPCHGSKGNQ 282
+ ++KP+ ++ CH Q GNQ+W+ ++ GEIR D C+D AG V + CHG GNQ
Sbjct: 453 PYRFGKINKPLTVFHCHGQKGNQYWLYTQKGEIRHDLYGCMDDAGSTVYVNSCHGLGGNQ 512
Query: 283 YFEY 286
+ Y
Sbjct: 513 KWTY 516
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 161/314 (51%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 210 CECTTGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+TSR LR+ L CK F WYL E+ N
Sbjct: 376 KNFFYIISPGVTKVDYGDITSRTALRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNV 435
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V++
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 46/317 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 138 CECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMS 250
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W +
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 303
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL C F WYL E+ N
Sbjct: 304 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 363
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421
Query: 274 PCHGSKGNQYFEYDYKY 290
CH +GNQ +EYD ++
Sbjct: 422 KCHHMRGNQLWEYDAEH 438
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFKCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 138 CECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMS 250
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W +
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 303
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL C F WYL E+ N
Sbjct: 304 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 363
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 422 KCHHMRGNQLWEYD 435
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDVT RK LR NL CK F WYL E+ N
Sbjct: 378 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PL+ + + VV P+I I D+TFE + GGF+W L
Sbjct: 211 CECTTGWLEPLMARIKEDRRAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYDSG DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP + GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + + D+GDV+SRK LR +L CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVRVDYGDVSSRKALRESLKCKPFSWYLENVYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++
Sbjct: 437 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDASRLNGPVVML 494
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ FEYD
Sbjct: 495 KCHHMKGNQMFEYD 508
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDVT RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 174 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 226
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 227 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 286
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 287 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 339
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 340 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 399
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 400 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 457
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 458 KCHHMRGNQLWEYD 471
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 166 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 218
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 219 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 278
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 279 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 331
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 332 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 391
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 392 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 449
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 450 KCHHMRGNQLWEYD 463
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLQCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 161/314 (51%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTFGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR+NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRQNLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 171 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 223
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 224 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 283
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 284 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 336
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 337 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 396
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 397 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 454
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 455 KCHHMRGNQLWEYD 468
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 138 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 250
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 303
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDVT RK LR NL CK F WYL E+ N
Sbjct: 304 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 363
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 422 KCHHMRGNQLWEYD 435
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 152 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 264
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 265 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 317
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 318 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 377
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 378 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 435
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 436 KCHHMRGNQLWEYD 449
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 495
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 145 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 197
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 198 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 257
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 258 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 310
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 311 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 370
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 371 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 428
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 429 KCHHMRGNQLWEYD 442
>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Macaca mulatta]
Length = 375
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 30 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 82
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 83 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 142
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 143 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 195
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 196 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 255
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 256 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 313
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 314 KCHHMRGNQLWEYD 327
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 187 bits (474), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 201 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 253
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 254 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 313
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 314 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 366
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 367 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 426
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 427 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 484
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 485 KCHHMRGNQLWEYD 498
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTTGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLEYVYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ C+D + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCMDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 46/317 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTFGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKKYFEELGTYDSGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W +
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL C F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYDYKY 290
CH +GNQ +EYD ++
Sbjct: 495 KCHHMRGNQLWEYDAEH 511
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + + VV P+I I D+TFE + GGF+W L
Sbjct: 213 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 265
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 325
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP + GG + +L ++W E
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 378
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + + D+GDV+SRK LR L CK F WYL E+ N
Sbjct: 379 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 438
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++
Sbjct: 439 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 496
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ FEYD
Sbjct: 497 KCHHMKGNQMFEYD 510
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + + VV P+I I D+TFE + GGF+W L
Sbjct: 212 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 324
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP + GG + +L ++W E
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + + D+GDV+SRK LR L CK F WYL E+ N
Sbjct: 378 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++
Sbjct: 438 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ FEYD
Sbjct: 496 KCHHMKGNQMFEYD 509
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 186 bits (473), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + + VV P+I I D+TFE + GGF+W L
Sbjct: 211 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP + GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + + D+GDV+SRK LR L CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++
Sbjct: 437 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 494
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ FEYD
Sbjct: 495 KCHHMKGNQMFEYD 508
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ ++YD
Sbjct: 495 KCHHMRGNQLWDYD 508
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 161/314 (51%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 210 CECTLGWLEPLLTRIKQDKRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+++R LR+ L CK F WYL E+ N
Sbjct: 376 KNFFYIISPGVTKVDYGDISTRTSLRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNV 435
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V++
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 210 CECTTGWLEPLLSRIKLDKKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+++R LR+ L CK F WYL E+ N
Sbjct: 376 KNFFYIISPGVTKVDYGDISTRTSLRQRLQCKPFSWYLENVYPDSQIPRHYYSLGEIRNV 435
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V++
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 213 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 265
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 378
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 379 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 438
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 439 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 496
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ ++YD
Sbjct: 497 KCHHMRGNQLWDYD 510
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 125 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 177
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 178 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 237
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 238 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 290
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 291 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 350
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 351 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 408
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 409 KCHHLKGNQLWEYD 422
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 510
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 125/330 (37%), Positives = 165/330 (50%), Gaps = 61/330 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + ++VV P+I I D TFE + S +GGFDW L
Sbjct: 144 CECTRGWLEPLLARIAEDKTNVVCPVINIISDTTFEF------INGSDATQVGGFDWRLI 197
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
FNWH +P RE +R K + PV +PTMAGGLFSI K FF +LGTYD GFD+WG ENLELS
Sbjct: 198 FNWHVVPHRELQRIKFDRTSPVRSPTMAGGLFSIHKEFFTRLGTYDPGFDVWGAENLELS 257
Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFE--KLGTYD 169
FK W E RKR + P M + + + + K Y+
Sbjct: 258 FK-TWMCGGTLEFVPCSHVGHVFRKRSPHRFPPTTHNVMQRNNRRLAEVWLDEYKYLYYN 316
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------V 212
+ +I K D GD++ R LR L CKSFKWYLE +
Sbjct: 317 AHPEI---------LKTDPGDISERLALRERLQCKSFKWYLENVYPENVFPIHFYGVVTI 367
Query: 213 SNDWSGMCID-----------SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
+ SG C+D + TD + V L+ CH G Q ++ +K EIR ++
Sbjct: 368 KHIISGNCLDYGNLKMRGKQPTKAGKTDSGQKVELWKCHG-GPVQTFIYTKAKEIRLEKE 426
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEYDYK 289
CLDY+ G + LYPCHG GNQ + Y+ K
Sbjct: 427 CLDYSAITGSLTLYPCHGQGGNQVWGYNKK 456
>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
Length = 400
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 53 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 105
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 106 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 165
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 166 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 218
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 219 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 278
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 279 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 336
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 337 KCHHLKGNQLWEYD 350
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DD FE T+ GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKESRKTVVCPIIDLISDDNFEY-------TAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++G YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 437 ETNQCLDNMGRKDN--EKVGMFNCHGMGGNQVFSYTAEKEIRTDDFCLDVSRLSGPVIML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 162/314 (51%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + S K + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY------MAGSDKTY-GGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LRR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCRPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKQDRRTVVCPIICVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GDV SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDVASRIGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 592
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 153/307 (49%), Gaps = 44/307 (14%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL+ + ++S+VV P+I I D ++ T +GGFDW+L F WH
Sbjct: 253 WLEPLLERIKASTSNVVVPVIEIINDQDLSMK-----ATQEASVQVGGFDWSLTFTWHLP 307
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
P+R++ R P+ +PTMAGGLF+I + FF LG YD ++WGGENLELSFK W
Sbjct: 308 PKRDQIRLGAPYSPIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLELSFK-TWMC 366
Query: 130 IPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ E R R + E T + L + + + + D F +
Sbjct: 367 GGQLETVVCSHVGHIFRSRSPYSWESKRTSPIKFNLVRLAETWLD-----DYKFLYYDSL 421
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCI 221
N +L GD+GD++SRK +R CKSF+WYL ++ N S CI
Sbjct: 422 NFDL---GDYGDISSRKAIRERNNCKSFQWYLDTIYPELFLPTRALASGDIENMVSPHCI 478
Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD--YAGGDVILYPCHGSK 279
D V LYPCH+Q GNQ W + EIRR +AC D G V L+ CHG
Sbjct: 479 DGVFNDQKTDNLVKLYPCHRQKGNQLWFYTNKNEIRRHDACFDGNVKPGHVGLFSCHGLG 538
Query: 280 GNQYFEY 286
G Q+FEY
Sbjct: 539 GTQFFEY 545
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 158/312 (50%), Gaps = 42/312 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV P+I I D+TFE + GGF+W L
Sbjct: 211 CECTVGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
F+ W E + A P P G + + + + + + F I
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYI 382
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
+ + D+GDV+SRK LR L CK F WYL E+ N +
Sbjct: 383 ISPGVMRV----DYGDVSSRKGLRDALRCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 438
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++ C
Sbjct: 439 NQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVLMLKC 496
Query: 276 HGSKGNQYFEYD 287
H KGNQ FEYD
Sbjct: 497 HHMKGNQMFEYD 508
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 158/312 (50%), Gaps = 42/312 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV P+I I D+TFE + GGF+W L
Sbjct: 211 CECTVGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
F+ W E + A P P G + + + + + + F I
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYI 382
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
+ + ++GDV+SRK LR L CK F WYL E+ N +
Sbjct: 383 ISPGVMRV----EYGDVSSRKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 438
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
C+D+ + + + VG + CH GGNQ + + EIR D+ CLD + G V++ C
Sbjct: 439 NQCMDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVMLKC 496
Query: 276 HGSKGNQYFEYD 287
H KGNQ FEYD
Sbjct: 497 HHMKGNQMFEYD 508
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTRGWLEPLLARIREDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++G+YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRV-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV++RK LR L CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G V +
Sbjct: 437 DTNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVTML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 244 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 296
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 297 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 356
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 357 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 409
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 410 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 469
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 470 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 527
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 528 KCHHLKGNQLWEYD 541
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLD 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRIGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 157/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+ SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDIASRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ +
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRKE 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 106 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 158
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 159 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 218
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 219 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 271
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 272 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 331
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 332 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 389
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 390 KCHHLKGNQLWEYD 403
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 217 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 269
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 270 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 329
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 330 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 382
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 383 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 442
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 443 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 500
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 501 KCHHLKGNQLWEYD 514
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|405956426|gb|EKC23041.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 203
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 87/152 (57%), Positives = 111/152 (73%), Gaps = 15/152 (9%)
Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY 209
MAGGLFSI + +F +LGTYD G DIWGGENLELSF+GD+GDVT+RK+LR L C SF W+
Sbjct: 1 MAGGLFSISREYFTELGTYDPGMDIWGGENLELSFRGDYGDVTNRKKLRERLQCYSFDWF 60
Query: 210 --------------LEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
+++ + MCIDSA + HK V ++PCH QGGNQ+WM+SK+GE
Sbjct: 61 VKNVYPDPFVPVEAIDLESKAKPMCIDSAVDNHNYHKLVNMWPCHNQGGNQYWMLSKNGE 120
Query: 256 IRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
IRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 121 IRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 152
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 172 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 224
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 225 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 284
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 285 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 337
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 338 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 397
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 398 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 455
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 456 KCHHLKGNQLWEYD 469
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L C+ F WYL E+ N
Sbjct: 378 KTFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYSSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKVDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+++R LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 157/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 152 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID +F+++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENLEIS 264
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 265 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 317
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 318 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 377
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 378 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 435
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 436 KCHHLKGNQLWEYD 449
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKVDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+++R LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKFDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTVGWLEPLLARIKFDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 323
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 324 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 377 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 437 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 494
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 495 KCHHLKGNQLWEYD 508
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTRGWLEPLLARIWEDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++G+YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRV-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV++RK LR L CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G V +
Sbjct: 437 DTNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVTML 494
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARINHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R + + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRRGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+ +R LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDIATRVGLRHKLQCKPFSWYLENVYPDSQIPRHYYSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTASKEIRTDDLCLDVSKLNGPVIML 495
Query: 274 PCHGSKGNQYFEYD 287
CH +GNQ +EYD
Sbjct: 496 KCHHLRGNQLWEYD 509
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 164/316 (51%), Gaps = 48/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PL+ +A + S VV P+I I D+TFE + GGF+W L
Sbjct: 218 CECTQGWLEPLMARIAEDRSRVVCPIIDVISDETFEFH-------AGSDMTYGGFNWKLN 270
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+++P+RE R K + P+ TPTMAGGLF+I K +FE++GTYD+G DIWGGENLE+S
Sbjct: 271 FRWYSVPKREMDRRKGDRTIPLNTPTMAGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMS 330
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP + GG +I +L ++W +
Sbjct: 331 FRI-WMCGGTLEIVTCSHVGHVFRKTTPYSFPGGTGAIINKNNRRLA------EVWMDDY 383
Query: 180 LEL-------SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
S K ++GDVT+RK+LR L CKSFKWYL E+ N
Sbjct: 384 KTFFYKISPGSKKSEYGDVTNRKQLRDKLQCKSFKWYLENIYPESQFMMDYNMIGEIRNM 443
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG----GDVI 271
+ C+D+ + + VG+Y CH QGGNQ + +K E++ D+ CLD + D++
Sbjct: 444 ETKQCLDNMGRKEN--NKVGIYACHGQGGNQIFAWTKKKELKHDDLCLDASRQSGFNDIM 501
Query: 272 LYPCHGSKGNQYFEYD 287
CH GNQ + ++
Sbjct: 502 QLRCHNQGGNQEWSFN 517
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 153/310 (49%), Gaps = 39/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL + N S+VV P I I + F + G G F+WNL
Sbjct: 173 CEVVDGWLEPLLARIHENRSNVVCPEIDVISFENFGYSYASG--------IRGVFNWNLH 224
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E++R K+ +P+ +PTMAGGLF+I K +FE +G YD DIWGGENLE+SF
Sbjct: 225 FRWRTLPAVEQQRRKSVIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSF 284
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ N IP ++P P AG + + ++ D+ DI+
Sbjct: 285 RIWQCGGNLEIIPCSHVGHVFRKSQPYTFPKGAGETLNKNLQRVAEVWM-DNYKDIFYNR 343
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMC 220
L + +GD++ R ELR+ L CKSF WYL E+ N +G C
Sbjct: 344 FPNLR-QHSYGDISKRIELRKKLKCKSFDWYLKNVFTDVQYPDMIFLAKGELRNPSTGYC 402
Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAGGDVILYPCH 276
+DS + +G+YPCH QGGNQ + E+ DE CLD G V + PCH
Sbjct: 403 LDSMGNKE--YADIGIYPCHGQGGNQLLTYTIRKELEMDEVCLDALSRRVAGTVKMAPCH 460
Query: 277 GSKGNQYFEY 286
KG Q +E+
Sbjct: 461 RKKGTQLWEH 470
>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 106/305 (34%), Positives = 150/305 (49%), Gaps = 77/305 (25%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 236 CECTTGWLEPLLARIKKDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 288
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 289 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 348
Query: 123 FKFN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
F+ W + + G+ +
Sbjct: 349 FRLQMWFVV------------------CVCVGVTKV------------------------ 366
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSA 224
D+GD++SR LR+ L CK F WYL E+ N + C+D+
Sbjct: 367 -----DYGDISSRTTLRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNVETNQCLDNM 421
Query: 225 CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQ 282
+ + + VG++ CH GGNQ + + + EIR D+ CLD + G V++ CH KGNQ
Sbjct: 422 ARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQ 479
Query: 283 YFEYD 287
+EYD
Sbjct: 480 LWEYD 484
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ +D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V +
Sbjct: 438 ETNQFLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 114/317 (35%), Positives = 164/317 (51%), Gaps = 52/317 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + ++ + VV P+I I DD+FE +T S GGF+W L
Sbjct: 271 CECTEGWLEPLLYEIHKDRTAVVCPIIDVIGDDSFEY------ITGS-DMTWGGFNWKLN 323
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R + + P TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 324 FRWYPVPQRELDRRGGDRSNPTKTPTMAGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMS 383
Query: 123 FKF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
F+ + R + + W GG+ I +++ ++W
Sbjct: 384 FRVWMCGGKVYIVTCSRVGHVFRKTSPYSW----PGGVARIINHNTQRI------VEVWM 433
Query: 177 GENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL-----------------EV 212
E + +K +GDV+ RK LR L CKSFKWYL E+
Sbjct: 434 DEYKDFFYKINPGVRSTSYGDVSERKALREKLHCKSFKWYLQNVYPESQMPVEYHALGEI 493
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDV 270
N +G CIDS + + + VG+ CH GGNQ + +K ++ D+ CLD + G V
Sbjct: 494 RNKATGQCIDSMGRKSG--EKVGMVQCHGMGGNQIFSYTKKQALQTDDVCLDVSSLHGPV 551
Query: 271 ILYPCHGSKGNQYFEYD 287
L+ CHG GNQ +EYD
Sbjct: 552 KLFQCHGLGGNQKWEYD 568
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 168/320 (52%), Gaps = 51/320 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLL + + +VV P+I I D +L + L + +GGF W+L
Sbjct: 239 CEATEGWVEPLLFRIKEDKRNVVCPVIEVI--DAVDLSYKKTELDRITQ--VGGFTWDLF 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNW I E E++ + +P+ +PTMAGGLF+IDK++F ++G+YD+ +IWGGENLE+SF
Sbjct: 295 FNWKEITEDEKRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSF 354
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ IP R + P P G+ F +L ++W
Sbjct: 355 RIWMCGGKLEIIPCSRVGHIFRKENSPYSFPN---GVSKTLAKNFNRLA------EVWMD 405
Query: 178 ENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL------------------E 211
E EL ++ +GD++ R ELR+ LGCKSFKWY+ E
Sbjct: 406 EYKELYYRRKPPEDKLVKYGDISERVELRKKLGCKSFKWYIDNVIPDMIGADPNPPAHGE 465
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRRDEACLDYA---- 266
V N S MC+DS + + + ++PCH+ GGNQF+++SK GE I DE+CLDY+
Sbjct: 466 VRNVASNMCLDSMGNKGNRAQ-IKVFPCHRLGGNQFFVLSKRGEIIHNDESCLDYSLENE 524
Query: 267 GGDVILYPCHGSKGNQYFEY 286
V ++ CHG GNQ + Y
Sbjct: 525 ENKVDMWNCHGLGGNQEWIY 544
>gi|449685123|ref|XP_002167708.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
partial [Hydra magnipapillata]
Length = 411
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 113/319 (35%), Positives = 167/319 (52%), Gaps = 48/319 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL + S+VV P I +I DT E R S GGF W+L
Sbjct: 51 CETTPGWIEPLLARINEAKSNVVVPTIESIDADTLEYR------ASDNPEQRGGFSWDLM 104
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
++W++IPE E+ ++ ++P+ TPTMAGGLF+IDK++F ++G+YD DIWGGENLELSF
Sbjct: 105 YDWNSIPENEKHLRQSPSDPIRTPTMAGGLFAIDKSYFFEMGSYDQEMDIWGGENLELSF 164
Query: 124 KFNWHA---IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGEN 179
+ W I R + V +P ++ K E L + ++W E
Sbjct: 165 RI-WMCGGRIEILPCSRVGHIFRKVTSP------YTFPKGVTETLSKNLNRLAEVWMDEY 217
Query: 180 LELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------EVSN 214
E ++ ++G++T R ELR+ L CKSFKWY+ E+ N
Sbjct: 218 KEYYYRSRPLFRGKEYGNITQRLELRQKLQCKSFKWYMENIYSDMEIPDLYPPAEGEIRN 277
Query: 215 DWSGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYA----GG 268
S +CIDS ++ VGLYPCH +GG Q + +S GEI +D+ CLD A G
Sbjct: 278 GASNLCIDSMGVVKENVKHQVGLYPCHGEGGAQHFQLSLKGEIIFQDKFCLDVAVASPGA 337
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ + CH +GNQ ++++
Sbjct: 338 FIEFFKCHKQRGNQLWQHN 356
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 159/318 (50%), Gaps = 50/318 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N VV P+I I DDTFE ++ GGF+W L
Sbjct: 266 CECADGWLEPLLARIVLNRKTVVCPVIDVISDDTFEY-------VTASDMTWGGFNWKLN 318
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R+++ P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H TP T GG I +L + D W
Sbjct: 379 FRI-WMCGGTLEISPCSHVGHVFRKSTPYTFPGGTSHIVNHNNARLA--EVWMDEWKHFY 435
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
G N+E+ GDV+ R LR L CKSF+WYL E+
Sbjct: 436 YAINPGASNVEV------GDVSERLALREKLKCKSFRWYLENIYPESQMPLDYYYLGEIK 489
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N S C+D+ + + + VG+ CH GGNQ + +K +I D+ CLD + G V
Sbjct: 490 NVDSQQCLDTMSRKSG--EKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIVGPVS 547
Query: 272 LYPCHGSKGNQYFEYDYK 289
L CHG +GNQ + YD K
Sbjct: 548 LIRCHGLEGNQAWVYDSK 565
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 117/320 (36%), Positives = 156/320 (48%), Gaps = 54/320 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N VV P+I I DDTFE ++ GGF+W L
Sbjct: 266 CECADGWLEPLLARIVLNRKTVVCPVIDVISDDTFEY-------VTASDMTWGGFNWKLN 318
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R+++ P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW-- 175
F+ IP P P GG+ I + D W
Sbjct: 379 FRVWQCGGTLEIIPCSHVGHVFRDKSPYSFP---GGVSKI--VLHNAARVAEVWMDEWRD 433
Query: 176 -------GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------E 211
G N+E+ GDV+ R LR L CKSF+WYL E
Sbjct: 434 FYYAMNPGASNVEV------GDVSERLALREKLKCKSFRWYLENIYPESQMPLDYYYLGE 487
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GD 269
+ N S C+D+ + + + VG+ CH GGNQ + +K +I D+ CLD + G
Sbjct: 488 IKNVDSQQCLDTMSRKSG--EKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIVGP 545
Query: 270 VILYPCHGSKGNQYFEYDYK 289
V L CHG +GNQ + YD K
Sbjct: 546 VSLIRCHGLEGNQAWVYDSK 565
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 164/312 (52%), Gaps = 38/312 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A N VV P+I I D++FE ++ GGF+W L
Sbjct: 286 CECTEGWLEPLLARVAENRKIVVCPIIDVISDESFEY-------VTASDMTWGGFNWKLN 338
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R+ + +P+ TPTMAGGLFSIDK +FE++GTYD G DIWGGENLE+S
Sbjct: 339 FRWYRVPQREMDRRNGDRTQPLRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMS 398
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E E H +P + GG+ I ++ + D W
Sbjct: 399 FRV-WQCGGELEIIPCSHVGHVFRDKSPYSFPGGVAKIVNKNAARVA--EVWMDRWKDFF 455
Query: 180 LEL---SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
E+ + + GDV+SR+ LR+ L CKSF+WYL E+ N +
Sbjct: 456 YEMNPGARSVEVGDVSSRRSLRKKLQCKSFRWYLENVYPESQMPLDYFFLGEIRNAETQT 515
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPCHG 277
C+D+ + + VG+ CH GGNQ + +K +I D+ CLD G D V L CHG
Sbjct: 516 CLDTMGRKGGEN--VGISYCHGLGGNQVFAYTKRQQIMSDDNCLDATGTDGIVKLIRCHG 573
Query: 278 SKGNQYFEYDYK 289
GNQ + Y+ +
Sbjct: 574 MGGNQAWLYEAQ 585
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 159/319 (49%), Gaps = 44/319 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + +VV P+I I D F + S GGF W+L
Sbjct: 157 CEATPGWLEPLLVRIAEDRRNVVCPVIEVINADDFRYQ------ASDVIHERGGFTWDLF 210
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W AIPE E+KR K+ + + +PTMAGGLF+I K +F LG+YDS +IWGGENLE+SF
Sbjct: 211 FTWKAIPEAEKKRRKDETDYIRSPTMAGGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSF 270
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
+ +P R + P P G + F +L D D +
Sbjct: 271 RIWMCGGQLEIVPCSRVGHVFRKYTSPYKFPK---GTTTTLARNFNRLAEVWMDEYKDHY 327
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------EVSND 215
+ E D GD++ R LR+ LGCKSFKWYL ++ N
Sbjct: 328 YRKKTEEERNVDIGDISDRVALRKRLGCKSFKWYLDNIYPDMTNKLPPKSYLYSHQIRNK 387
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYAGGD----V 270
S +C+D+ + K VGLY CH GGNQF+ ++K EI D+ CLD GD V
Sbjct: 388 ESSLCLDTLGEKN--IKRVGLYTCHGMGGNQFFTLTKSNEILFNDDKCLDSPNGDPGSYV 445
Query: 271 ILYPCHGSKGNQYFEYDYK 289
+ CHG KGNQ ++++ +
Sbjct: 446 EMITCHGLKGNQEWKHNKR 464
>gi|358332241|dbj|GAA27774.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 584
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 160/318 (50%), Gaps = 56/318 (17%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E K WL+PLLD + +N S VVSP+I I DDTF + P L+ + +GGFDW++ +
Sbjct: 231 ECNKGWLEPLLDCIQKNQSTVVSPVIDRINDDTFA--YEPLLLS---QIQVGGFDWDMTY 285
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
NWH P+R+ +R P+ PT+AGGLFS+ + FF LG YD D+WGGENLELSFK
Sbjct: 286 NWHVPPKRDLERPGAPFTPIRAPTIAGGLFSVHRDFFAYLGYYDPQMDVWGGENLELSFK 345
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWGG 177
W V + G +F + K T D+ ++W
Sbjct: 346 -TWMC----------GGTLQVHPCSHVGHVFRTKSPYSAKNNTGDTLRHNLVRLAEVWMD 394
Query: 178 ENL-----ELSFK-GDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN 214
E SFK GD+GDV+ RK LR L C+SFKWYL ++ +
Sbjct: 395 EYKGYFYERFSFKLGDYGDVSERKALRERLKCRSFKWYLNNVFPELFVPSNSLANGDIES 454
Query: 215 DWSGMCIDSACKPTDMHKP----VGLYPCHKQGGNQFWMMSKHGEIRRDEAC--LDYAGG 268
+C+D++ D H+P + YPCH+ GGNQ W + EIRRD C +D A G
Sbjct: 455 FKMAICLDAS---ADDHQPELHLLRGYPCHRLGGNQLWYWTPDKEIRRDNRCWSVDEASG 511
Query: 269 DVILYPCHGSKGNQYFEY 286
+ + C G+ Q F Y
Sbjct: 512 FIGMAKCGGTD-KQKFNY 528
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I DDTFE +T+S + GGF+W L
Sbjct: 286 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDDTFEY------ITASDSTW-GGFNWKLN 338
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 339 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMS 398
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 399 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 455
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 456 YAMSTGARKASAGDVSDRKALRERLQCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 515
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG CH GGNQ + +K +I D+ CLD A G V + CH
Sbjct: 516 CLDTMGR--KYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 573
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 574 MGGNQEWVYD 583
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 274 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 326
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 327 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 386
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 387 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 443
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 444 YAMSTGARKASAGDVSDRKSLRDRLQCKSFRWYLENVYPESLMPLDYYYLGEIRNSETET 503
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD A G V + CH
Sbjct: 504 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 561
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 562 MGGNQEWVYD 571
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 164/310 (52%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + + GGF+W L
Sbjct: 236 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 288
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE ++R+ + P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 289 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 348
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ +I + D W
Sbjct: 349 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 405
Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
++S K GDV+ RKELR L CKSF+WYL E+ N +G
Sbjct: 406 YQMSPGARKASAGDVSERKELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVETGN 465
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ + +G CH GGNQ + +K ++ D+ CLD + G V L CHG
Sbjct: 466 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDASNALGPVNLVRCHG 523
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 524 MGGNQEWVYD 533
>gi|405967230|gb|EKC32416.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 347
Score = 173 bits (438), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 129/231 (55%), Gaps = 41/231 (17%)
Query: 86 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPV 145
+PTMA GLFSI + +F KLGTYD G DIWGGENLELSF+ W E +
Sbjct: 77 SPTMARGLFSISREYFTKLGTYDPGMDIWGGENLELSFRV-WMCCGTLE----------I 125
Query: 146 WTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGGENLELSFK------GDFGDVTS 193
+ G +F F + G ++W E ++ GD+GDVT
Sbjct: 126 IPCSHVGHIFRKRSLFKCRTGVNVVKKNSIRLAEVWMDEYKNYYYERFNYDLGDYGDVTD 185
Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
RK+LR L C SF W++ E+ + MCIDSA + HKPV +
Sbjct: 186 RKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNM 245
Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
+PCH QGGNQ+WM+SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 246 WPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 296
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 157/310 (50%), Gaps = 34/310 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + S VV P+I I D TFE + GGF+W L
Sbjct: 276 CECTEGWLEPLLSRIVEDRSTVVCPIIDVISDTTFEY-------IQASDMTWGGFNWKLN 328
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +PERE ++R + P+ TPTMAGGLF+ID+ +F K+G+YD G DIWGGENLE+S
Sbjct: 329 FRWYRVPEREMQRRGGDRTAPLRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMS 388
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W + E H +P + GG+ ++ + D WG
Sbjct: 389 FRV-WQCGGVLEIVPCSHVGHVFRDKSPYSFPGGVQAV--VLKNAARVAEVWMDEWGEFY 445
Query: 180 LEL---SFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID------------SA 224
+ + GDV+ RK LR L CKSF+WYLE S M +D S
Sbjct: 446 YAMNPGALNVPVGDVSERKALRERLKCKSFRWYLENIYPESQMPLDYYYLGEIRNAETSN 505
Query: 225 CKPT---DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSK 279
C T +P+G+ CH GGNQ + +K +I D+ CLD A G + L CHG +
Sbjct: 506 CLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQIMSDDNCLDAAHPRGPIKLIRCHGMR 565
Query: 280 GNQYFEYDYK 289
GNQ + YD K
Sbjct: 566 GNQEWTYDTK 575
>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
Length = 368
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 166/314 (52%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + + GGF+W L
Sbjct: 12 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 64
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE ++R+ + P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 65 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 124
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ +I ++ ++W E
Sbjct: 125 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANIVLKNAARVA------EVWLDEW 177
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
E + K GDV+ RKELR L CKSF+WYL E+ N
Sbjct: 178 KEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNV 237
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVILY 273
+G C+D+ + ++ + +G CH GGNQ + +K ++ D+ CLD A G V L
Sbjct: 238 ETGNCLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDASNALGPVNLV 295
Query: 274 PCHGSKGNQYFEYD 287
CHG GNQ + YD
Sbjct: 296 RCHGMGGNQEWVYD 309
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK+LR L CKSF+WYL E+ N +
Sbjct: 453 YAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD A G V + CH
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 570
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 571 MGGNQEWVYD 580
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK+LR L CKSF+WYL E+ N +
Sbjct: 453 YAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD A G V + CH
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 570
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 571 MGGNQEWVYD 580
>gi|161077160|ref|NP_001097343.1| CG30463, isoform E [Drosophila melanogaster]
gi|157400368|gb|ABV53824.1| CG30463, isoform E [Drosophila melanogaster]
Length = 264
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 133/239 (55%), Gaps = 65/239 (27%)
Query: 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTP 148
MAGGLFSID+ FF++LGTYDSGFDIWGGENLELSFK W
Sbjct: 1 MAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFK--------------------TW-- 38
Query: 149 TMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGENLELSF----------------- 184
M GG I F K Y SG ++ ++ L+
Sbjct: 39 -MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLKKNSVRLAEVWMDEYSQYYYHRIGND 97
Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKP 227
KGD+GDV+ R++LR +L CKSFKWYL E++N +GMC+D A +
Sbjct: 98 KGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIANVPNGMCLD-AKEK 156
Query: 228 TDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
++ PV +Y CH QGGNQ+WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 157 SEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 215
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I D+TFE S+ GGF+W L
Sbjct: 113 CECTQNWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDLTWGGFNWKLN 165
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R + PV TPTMAGGLF+IDK +F +LG YD G DIWGGENLELS
Sbjct: 166 FRWYRVPQRELDRRGGDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELS 225
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E E H TP T GG I +L ++W E
Sbjct: 226 FRI-WMCGGELEIVPCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA------EVWLDEW 278
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
E F D GD++ R+ LR+ L C SF+WYL E+ +
Sbjct: 279 KEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENIYPESHMPLDYYHLGEIKHA 338
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
S +C+D+ + + + V + CH QGGNQ + +K +I D+ CLD + G V L
Sbjct: 339 DSPVCLDTFGRKSGEN--VAVSTCHGQGGNQVFAYTKRQQIMSDDNCLDASSPRGPVKLL 396
Query: 274 PCHGSKGNQYFEYD 287
CHG GNQ + YD
Sbjct: 397 RCHGMGGNQLWIYD 410
>gi|405966386|gb|EKC31679.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 206
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/155 (54%), Positives = 106/155 (68%), Gaps = 18/155 (11%)
Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY 209
MAGGLFSI + +F + GTYD G DIWGGE LELSF+ D+G VT RK+L L C SF W+
Sbjct: 1 MAGGLFSISREYFTEPGTYDPGMDIWGGEKLELSFRVDYGVVTDRKKLLERLQCHSFDWF 60
Query: 210 L-----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK 252
+ E+ + MCIDSA + HKPV ++PCH QGGNQ+WM+SK
Sbjct: 61 VKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWMLSK 120
Query: 253 HGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 121 NGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 155
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/307 (36%), Positives = 150/307 (48%), Gaps = 40/307 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + +N + VV P I I D+TFE + G + G F+WNL
Sbjct: 168 CEANVGWLEPLLYRIMQNRTIVVCPEIDVISDETFEYTYSSGNVR-------GSFNWNLN 220
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W A+PE E KR + + +PTMAGGLF+I +F+ +G YD +IWGGENLELSF
Sbjct: 221 FRWKAVPEYENKRRAARTDGIRSPTMAGGLFTIHSQYFKDIGLYDKQMEIWGGENLELSF 280
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP ++P P G S K + G+ + +
Sbjct: 281 RIWQCGGQLEIIPCSHVGHVFRKSQPYSFPKGTGETLS--KNLQRVAEVWMDGYKRYFYK 338
Query: 179 NLELSFKGD-FGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGM 219
+ KG FGD++ R ELR+ L CK+F WY+ E+ N SG
Sbjct: 339 R-QPHLKGHPFGDISKRLELRKKLKCKNFDWYIKNVVPEIFLPNSSIIARGELRNPASGD 397
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD----VILYPC 275
CIDS H +G+Y CHKQ GNQ+ + +K+ EI D+ C DYA V + C
Sbjct: 398 CIDSLG--AGEHAYIGIYKCHKQMGNQYLVYTKNEEIIVDDNCFDYANSQPSSKVKMLDC 455
Query: 276 HGSKGNQ 282
H KGNQ
Sbjct: 456 HSMKGNQ 462
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGGENLEMS 395
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 453 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD + G V + CH
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 570
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 571 MGGNQEWVYD 580
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 157/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + ++ VV P+I I D+TFE ++ GGF+W L
Sbjct: 249 CECTEGWLEPLLARIVQDRKTVVCPIIDVISDETFEY-------ITASDMTWGGFNWKLN 301
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R N P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 302 FRWYRVPQREMERRNNDRTAPLRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMS 361
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ IP P T GG+ I L ++W
Sbjct: 362 FRVWQCGGKLEIIPCSHVGHVFRDKSPY---TFPGGVSKI------VLHNAARVAEVWMD 412
Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
E + + + GDV++R+ELR L CKSF+WYLE +
Sbjct: 413 EWRDFYYAMNPGARSVPVGDVSARRELRERLKCKSFRWYLENVYPESQMPLEYYYLGDIR 472
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + + + +G+ CH GGNQ + +K +I D+ CLD + G V
Sbjct: 473 NVETKNCLDTMGRKSGEN--LGMTYCHNLGGNQVFAYTKRQQIMSDDNCLDASNKKGPVK 530
Query: 272 LYPCHGSKGNQYFEYD 287
L CHG GNQ + YD
Sbjct: 531 LVRCHGMGGNQAWAYD 546
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 161/338 (47%), Gaps = 65/338 (19%)
Query: 4 CEVQKRWLQPLLD----------------VLARNS-------SHVVSPLIANICDDTFEL 40
CE WL+PLL V R S + VV P+I I D+TFE
Sbjct: 211 CECTVGWLEPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEY 270
Query: 41 RFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKA 99
+ GGF+W L F W+ +P+RE R K + PV TPTMAGGLFSIDK
Sbjct: 271 -------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKT 323
Query: 100 FFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNA------AEPVWTPTMAGG 153
+FE++G+YD G DIWGGENLE+SF+ W E + A P P G
Sbjct: 324 YFEEIGSYDPGMDIWGGENLEMSFRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQ 382
Query: 154 LFSIDKAFFEK--LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL- 210
+ + + + + + F I + + D+GDV+SRK LR L CK F WYL
Sbjct: 383 VINKNNRRLAEVWMDDFKDFFYIISPGVMRV----DYGDVSSRKGLRDALHCKPFSWYLE 438
Query: 211 ----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG 254
E+ N + C+D+ + + + VG + CH GGNQ + +
Sbjct: 439 NIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADK 496
Query: 255 EIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYDYKY 290
EIR D+ CLD + G V++ CH KGNQ FEYD +Y
Sbjct: 497 EIRTDDLCLDVSRLNGPVLMLKCHHMKGNQMFEYDAEY 534
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 285 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 337
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 338 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 397
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 398 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 454
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 455 YAMSTGARKASAGDVSDRKALRDRLQCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 514
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHG 277
C+D+ + ++ VG CH GGNQ + +K +I D+ CLD A G V + CH
Sbjct: 515 CLDTMGRK--YNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSSGPVNMVRCHN 572
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 573 MGGNQEWVYD 582
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 165/335 (49%), Gaps = 83/335 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PL+ +A + VV+P+I I DDTFE +T+S + GGF+W+L
Sbjct: 268 VEVTDGWLEPLVTRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 320
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P+RE +R + + P+ TPT+AGGLF+IDK FF +G+YD G +WGGENLE+S
Sbjct: 321 FRWYAVPKRELNRRGADRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 380
Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
F+ W E RK+ H NAA VW
Sbjct: 381 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 433
Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
KAFF K+ + N+E GDVT RK+LR L CKSFKWYLE
Sbjct: 434 --KAFFYKM--------VPAARNVEA------GDVTERKKLRETLQCKSFKWYLENIYPE 477
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
+ N ++ CID+ K D P GL CH GGNQ W ++ GEIR D
Sbjct: 478 APLPADFKSLGAIVNRFTEKCIDTNGK-KDGQSP-GLQGCHGSGGNQAWSLTGKGEIRSD 535
Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYD 287
+ CL G ++ L C SK N FE+D
Sbjct: 536 DLCLSSGHVYQIGSELKLERCSVSKINIKHVFEFD 570
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/310 (36%), Positives = 163/310 (52%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + + GGF+W L
Sbjct: 241 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 293
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE ++R+ + P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 294 FRWYRVPSREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 353
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ +I + D W
Sbjct: 354 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 410
Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
++S K GDV+ R+ LR L CKSF+WYL E+ N+ S
Sbjct: 411 YQMSPGARKASAGDVSERRALREKLKCKSFRWYLENIYPESQMPLDYYFLGEIRNEESQN 470
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ + +G CH GGNQ + +K +I D+ CLD + G V L CHG
Sbjct: 471 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHG 528
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 529 MGGNQEWVYD 538
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 270 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 322
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 323 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 382
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 383 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 439
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 440 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 499
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD + G V + CH
Sbjct: 500 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 557
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 558 MGGNQEWVYD 567
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/309 (37%), Positives = 157/309 (50%), Gaps = 35/309 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTY--DSGFDIWGG 177
F+ W E H +P T GG+ + ++ D D +
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYA 429
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
N E + GDV+ R +LR L CKSF+WYLE V N + C
Sbjct: 430 MNPEGARNVAVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVETQSC 489
Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGS 278
+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V + CHG
Sbjct: 490 LDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIVRCHGM 547
Query: 279 KGNQYFEYD 287
GNQ + Y+
Sbjct: 548 GGNQAWVYN 556
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 453 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD + G V + CH
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 570
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 571 MGGNQEWVYD 580
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 189 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 241
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 242 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 301
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ I + D W
Sbjct: 302 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 358
Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
+S K GDV+ RK LR L CKSF+WYL E+ N +
Sbjct: 359 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 418
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD + G V + CH
Sbjct: 419 CLDTMGR--KYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 476
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 477 MGGNQEWVYD 486
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 114/315 (36%), Positives = 159/315 (50%), Gaps = 50/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 254 CECTEGWLEPLLARITEDRKTVVCPIIDVISDETFEY------ITASDTTW-GGFNWRLN 306
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R N P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 307 FRWYRVPKREMDRRNNDKTVPIRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P P T GG+ I L + ++W
Sbjct: 367 FRVWQCGGTLEIVPCSHVGHVFRDKSPY---TFPGGVSQI------VLHNANRVAEVWMD 417
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
E + + K + GD+TSR +LR +L CKSF+WYL ++
Sbjct: 418 EWRDFYYAMNPGAKKIEVGDITSRLKLREDLKCKSFRWYLTNIYPESTMPLDYYFLGDIK 477
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + + + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 478 NVETEQCLDTMGRKSGEN--VGMSYCHGYGGNQVFSYTKRHQITADDNCLDAASVRGPVK 535
Query: 272 LYPCHGSKGNQYFEY 286
L CHG GNQ ++Y
Sbjct: 536 LVRCHGMGGNQEWKY 550
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 156/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + VV P+I I DDTFE P +T GGF+W L
Sbjct: 257 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R+ + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRNSDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ I + D W
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 426
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ D GDV+ R +LR L CKSF+WYLE V
Sbjct: 427 YAMNPGARNV------DVGDVSERVKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 481 NVEAQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 166/318 (52%), Gaps = 41/318 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +WL+PL+ + + S ++ P+I I D + + S +GGF W+L
Sbjct: 154 CECNVQWLEPLVARIKESRSALLCPMIDVI--DAKAMSYNGIGAGS-----VGGFWWSLH 206
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +P+RERKR K++ E + +PTMAGGLF+ D+ +F ++G YD G D+WGGENLE+SF
Sbjct: 207 FSWRPLPQRERKRRKSSVETIRSPTMAGGLFAADRKYFFEIGGYDPGMDVWGGENLEISF 266
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
+ +P ++ P P K E + Y F
Sbjct: 267 RVWMCGGTLEFVPCSRVGHIFRSSHPYTFPGNKDTHGLNSKRLAEVWMDGYKRLFYHHRR 326
Query: 178 ENLELS--FKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
+ L ++ F D GD + R +LRR+L CKSFKWYLE V N S
Sbjct: 327 DLLVINPQFNADAGDFSDRLQLRRDLKCKSFKWYLENVYPEKFIPDENVIAYGMVRNPSS 386
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG---NQFWMMSKHGEIRRDEACLDYAGGD---VI 271
+C+D+ K M +GLY C QGG NQ + +S+ E+RR+E+C+D GG+ V
Sbjct: 387 NLCLDTLSKDEKMVFNLGLYGC--QGGVSSNQLFSLSQSNELRREESCMDSVGGEGSPVK 444
Query: 272 LYPCHGSKGNQYFEYDYK 289
L PCHGS+G+Q + Y+ +
Sbjct: 445 LMPCHGSRGHQEWTYNLE 462
>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
Length = 737
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 119/315 (37%), Positives = 154/315 (48%), Gaps = 56/315 (17%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD +A NSS VV P+I I D T + P S + IGGFDW+L F WH
Sbjct: 346 WLEPLLDRIAYNSSIVVVPVITVINDKTLKYDLP-----SPSRVQIGGFDWSLSFIWHEQ 400
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
ER + R PV +PTMAGGLF+I + +F LG YD G ++WGGENLELSFK W
Sbjct: 401 TERHKNRPGAPYSPVQSPTMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKI-WMC 459
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
+ + + G +F + + D D+W +
Sbjct: 460 ----------GGSLEIVICSQVGHIFRDRSPYIWDVDVKDPLKRNLLRLADVWLDDYKRF 509
Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
+ + F+ D G+V+ RK LR L C SF WYL S + I+SA P
Sbjct: 510 YHARIGFEMVDIGNVSERKALREKLKCHSFDWYLTNIYPELFVPSKALASGDIESAAGPH 569
Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD-----YAGGDVIL 272
+ P + PCHKQGGNQFW++S EIRRD+ C D Y+ G L
Sbjct: 570 CLDAPLPSENDSSSVIIKTRPCHKQGGNQFWLLSSENEIRRDDYCFDSGIQKYSIG---L 626
Query: 273 YPCHGSKGNQYFEYD 287
Y CHGS GNQ F Y+
Sbjct: 627 YHCHGSHGNQEFTYE 641
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 105/313 (33%), Positives = 152/313 (48%), Gaps = 44/313 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W +PLL + + +VV P+I I + F P F G F W L+
Sbjct: 260 CECTLGWAEPLLAKIKEDRQNVVMPVIDEISETNFNYNAVPE------PFQRGVFKWRLE 313
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP E +R K+ ++ + TP MAGGLFSI++ +F ++G+YD+G DIWGGEN+E+SF
Sbjct: 314 FTWRPIPSYEEQRRKHESDGIKTPVMAGGLFSINRDYFYEMGSYDTGMDIWGGENIEISF 373
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P GG + ++ D+W E
Sbjct: 374 RIWMCGGSIEMLPCSRVGHVFRPRFPYSFPNRRGGDGDVVSRNLMRVA------DVWMDE 427
Query: 179 ------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
N+ K DVT+R +LR L CKSF+WYL E+
Sbjct: 428 YAKHFYNIRFDLKRKKHDDVTARVKLRSKLQCKSFQWYLENVYPELEIPDDKFLAAGEIR 487
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILY 273
N SG+C+D+ K PVGLY CH QGGNQ++ + GEI+ ++ C+D+ G D+ +
Sbjct: 488 NPESGICLDTLGKQEG--APVGLYACHGQGGNQYYTYNNKGEIKAEDNCMDFNGHDLYIR 545
Query: 274 PCHGSKGNQYFEY 286
C G NQ + Y
Sbjct: 546 ECDGLGLNQKWTY 558
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 154/315 (48%), Gaps = 50/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + VV P+I I DDTFE S+ GGF+W L
Sbjct: 257 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-------ISASDMTWGGFNWKLN 309
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R+ + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRNSDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ I + D W
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 426
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ D GDV+ R +LR L CKSF+WYLE V
Sbjct: 427 YAMNPGARNV------DVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 481 NIETQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAANPQGPVK 538
Query: 272 LYPCHGSKGNQYFEY 286
+ CHG GNQ + Y
Sbjct: 539 IVRCHGMGGNQAWVY 553
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 82/350 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 213 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 265
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 378
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 379 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 438
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGN------------------------------ 245
+ C+D+ + + + VG++ CH GGN
Sbjct: 439 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEECCSNHPLYGLVY 496
Query: 246 ------QFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
Q + + EIR D+ CLD + G VI+ CH +GNQ +EYD
Sbjct: 497 TPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYD 546
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 156/319 (48%), Gaps = 53/319 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PL+ +A + + VVSP+I +I D+TFE P + GGF+W L
Sbjct: 210 CECTKGWLEPLIARIAEDRTRVVSPVIDSISDETFEYNSVP-------ELGCGGFNWRLN 262
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE+KR K +A P+ TPTMAGGLFSI K +F ++GTYD G DIWGGENLE+S
Sbjct: 263 FRWYPMSKREKKRRKGDATIPINTPTMAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMS 322
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P P T GG+ ++ +L ++W
Sbjct: 323 FRIWMCGGTLEIVPCSHVGHVFRGKSPY---TFPGGVATVVHNNNRRLA------EVWMD 373
Query: 178 ENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
E +K ++GD+ RK+LR L C SF+WYL EV
Sbjct: 374 EYKSFYYKTVPNARNAEYGDIEDRKQLREKLQCNSFRWYLENIFPDSQFLLDNYFRFCEV 433
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG----G 268
N + C+D+ + L CH QGG+Q + SK E++ D+ CLD +
Sbjct: 434 RNMETKQCLDNMGQKE--KSKAALSRCHGQGGHQIYAWSKLNELKHDDLCLDASAPSGFK 491
Query: 269 DVILYPCHGSKGNQYFEYD 287
DV C+ G Q + Y+
Sbjct: 492 DVEQSRCNSHGGTQEWRYN 510
>gi|74215848|dbj|BAE28617.1| unnamed protein product [Mus musculus]
Length = 330
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/293 (37%), Positives = 149/293 (50%), Gaps = 46/293 (15%)
Query: 25 VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
VV P+I I DDTFE + GGF+W L F W+ +P+RE R K + P
Sbjct: 4 VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 56
Query: 84 VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKNA 141
V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+ W E H
Sbjct: 57 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSHVGH 115
Query: 142 AEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVTS 193
TP T GG I +L ++W E + K D+G+++S
Sbjct: 116 VFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGVTKVDYGNISS 169
Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
R LRR L CK F WYL E+ N + C+D+ + + + VG+
Sbjct: 170 RLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN--EKVGI 227
Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
+ CH GGNQ + + + EIR D+ CLD + G V + CH KGNQ +EYD
Sbjct: 228 FNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 280
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 82/350 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG + +L ++W E
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ + K D+GDV+ RK LR NL CK F WYL E+ N
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGN------------------------------ 245
+ C+D+ + + + VG++ CH GGN
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEECCSNHPLYGLVY 494
Query: 246 ------QFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
Q + + EIR D+ CLD + G VI+ CH +GNQ +EYD
Sbjct: 495 TPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYD 544
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 155/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + VV P+I I DDTFE P +T GGF+W L
Sbjct: 260 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 312
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R+ + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 313 FRWYRVAQREMDRRNGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 372
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ I + D W
Sbjct: 373 FRV-WQCGGTLEISSCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 429
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ D GDV+ R +LR L CKSF+WYLE V
Sbjct: 430 YAMNPGARNV------DVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 483
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 484 NVEMQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 541
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 542 IVRCHGMGGNQAWVYN 557
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 157/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H TP T GG I +L + D W
Sbjct: 371 FRI-WMCGGTLEIATCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA--EVWLDQWKYFY 427
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ + GDV+ R +LR L CKSF+WYLE V
Sbjct: 428 YNINPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 482 NVETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A +++ VV P+I I D+ F P T GGF+W L
Sbjct: 232 CECTEGWLEPLLARIAEDNTRVVCPVIDVISDENFAY-VPASDQT------WGGFNWKLN 284
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R + PV TPTMAGGLF++DKA+FEKLG YD G DIWGGENLE+S
Sbjct: 285 FRWYRVPQRENDRRGGDRTLPVRTPTMAGGLFAMDKAYFEKLGKYDEGMDIWGGENLEMS 344
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L D+W E
Sbjct: 345 FRI-WMCGGTLEIVTCSHVGHVFRKSTPYTFPGGTGKIVNHNNARLA------DVWLDEW 397
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ F K D GD + R +LR++L CKSF+WYL E+ N
Sbjct: 398 KDFYFAINPVAKKVDRGDTSGRHKLRQDLQCKSFRWYLENIYPESHMPLDYYHLGEIKNA 457
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+C+D+ K + +G CH GGNQ + +K +I D++CLD + G V L+
Sbjct: 458 DGNLCLDTYGKKSGDVLYMG--KCHGLGGNQVFAYTKRQQIMADDSCLDASSPSGPVKLF 515
Query: 274 PCHGSKGNQYFEYD 287
CH GNQ + YD
Sbjct: 516 RCHNMGGNQMWTYD 529
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 156/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A N S VV P+I I DDTFE P +T GGF+W L
Sbjct: 257 CECTEGWLEPLLARIAENRSTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ + + D W
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 426
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ + GDV+ R +LR L CKSF+WYLE V
Sbjct: 427 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 481 NIDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 165/335 (49%), Gaps = 83/335 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + WL+PL+ +A + VV+P+I I DDTFE +T+S + GGF+W+L
Sbjct: 267 VEVTEGWLEPLISRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 319
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+++P+RE +R + + P+ TPT+AGGLF+IDK FF +G+YD G +WGGENLE+S
Sbjct: 320 FRWYSVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 379
Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
F+ W E RK+ H NAA VW
Sbjct: 380 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 432
Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
KAFF K+ + N+E GDVT RK+LR L CKSFKWYLE
Sbjct: 433 --KAFFYKM--------VPAARNVEA------GDVTERKKLRETLQCKSFKWYLENIYPE 476
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
+ N ++ C+D+ K +P G+ CH GGNQ W ++ GEIR D
Sbjct: 477 APLPADFRSLGAIVNRFTEKCVDTNGKKDG--QPPGMQACHGAGGNQAWSLTGKGEIRSD 534
Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYD 287
+ CL G ++ L C SK N F +D
Sbjct: 535 DLCLSSGHVYQIGSELKLERCSVSKINPKHVFTFD 569
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 166/337 (49%), Gaps = 83/337 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PL+ +A + VV+P+I I DDTFE +T+S + GGF+W+L
Sbjct: 267 VEVTDGWLEPLVHRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 319
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P+RE +R + + P+ TPT+AGGLF+IDK FF +G+YD G +WGGENLE+S
Sbjct: 320 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 379
Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
F+ W E RK+ H NAA VW
Sbjct: 380 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 432
Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
KAFF K+ + +N+E GDVT RK+LR L CKSFKWYLE
Sbjct: 433 --KAFFYKM--------VPAAKNVEA------GDVTDRKKLRETLQCKSFKWYLENIYPE 476
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
+ N ++ CID+ K D P G+ CH GGNQ W ++ GEIR D
Sbjct: 477 APLPADFRSLGSIVNRFTEKCIDTNGK-KDGQAP-GMQACHGAGGNQAWSLTGKGEIRSD 534
Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYDYK 289
+ CL G ++ L C SK N F +D +
Sbjct: 535 DLCLSSGHVYQIGSELKLERCSVSKLNPKHIFAFDAQ 571
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 162/310 (52%), Gaps = 38/310 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + + GGF+W L
Sbjct: 239 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 291
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE ++R+ + P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 292 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 351
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W I E H +P T GG+ +I + D W
Sbjct: 352 FRI-WQCGGILEISPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 408
Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
++S K GDV+ R+ LR L CKSF+WYL E+ N +
Sbjct: 409 YQMSPGARKASAGDVSERRALRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVKTHN 468
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
C+D+ + ++ + +G CH GGNQ + +K +I D+ CLD + G V L CHG
Sbjct: 469 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHG 526
Query: 278 SKGNQYFEYD 287
GNQ + YD
Sbjct: 527 MGGNQEWIYD 536
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 158/317 (49%), Gaps = 52/317 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKFNWHA-----IPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
F+ W I R H TP T GG I +L ++W
Sbjct: 396 FRV-WMCGGVLEIAPCSRVGHVFRKS---TPYTFPGGTTEIVNHNNARL------VEVWL 445
Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EV 212
+ E + K GDV+ RK LR L CKSF+WYL E+
Sbjct: 446 DDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEI 505
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDV 270
N + C+D+ + ++ VG+ CH GGNQ + +K +I D+ CLD + G V
Sbjct: 506 RNAETETCLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPV 563
Query: 271 ILYPCHGSKGNQYFEYD 287
+ CH GNQ + YD
Sbjct: 564 NMVRCHNMGGNQEWVYD 580
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 107/309 (34%), Positives = 154/309 (49%), Gaps = 34/309 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + WL+PLLD ++ + VV+P+I I D+ FE ++ GGF+W+L
Sbjct: 242 VEVTEGWLEPLLDRVSTDRKRVVAPIIDVISDENFEY-------ITASDVTWGGFNWHLN 294
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE +R+ + + P+ TPT+AGGLF+ID+ FF +G+YD G +IWGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEIS 354
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + P P P + + A ++ D DI+ G
Sbjct: 355 FRVWMCGGSLEIHPCSRVGHVFRKHTPYSFPGGTARVIHHNAARTAEVWM-DEYKDIFYG 413
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
+ + D GD+T RK LR NL CKSF+WYLE V N C
Sbjct: 414 -MVPAAKNVDVGDLTERKILRENLQCKSFRWYLETIYPESPIPIDFFSLGQVQNMGVMEC 472
Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
+D+A + + PCH +GGNQ W + GEIR DE CL + V + C GS
Sbjct: 473 LDTAGRSAG--DSPAMLPCHGKGGNQLWTYTGKGEIRSDELCLAFTTKGVSMEKCTGSVP 530
Query: 281 NQYFEYDYK 289
+DY+
Sbjct: 531 LSKMIFDYE 539
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/316 (33%), Positives = 155/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE + GGF+W L
Sbjct: 153 CECTEGWLEPLLFEIHKNRKSVVCPIIDVISDETFEY-------ITGSDMTWGGFNWKLN 205
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R + + P+ +PTMAGGL +I++ +F ++G+YD G DIWGGENLE+S
Sbjct: 206 FRWYPVPQREVERRGGDRSLPLRSPTMAGGLLAIERDYFYEIGSYDDGMDIWGGENLEMS 265
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + A P P G + + + A ++W
Sbjct: 266 FRIWMCGGTLLIVTCSHVGHVFRKATPYTFPGGTGRIINHNNARLA---------EVWMD 316
Query: 178 ENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
E +K D+GD++ R +LR L CKSF+WYL E+
Sbjct: 317 EWRSFYYKINPGVKQTDYGDLSPRIQLREKLECKSFRWYLQNIYPESQMPLDYYSLGEIR 376
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+DS + + VG+ CH GGNQ + SK + D+ CLD + G V
Sbjct: 377 NKETNQCLDSMGRKAG--EKVGIVGCHGMGGNQIFSYSKKKAFQTDDLCLDVSALTGPVK 434
Query: 272 LYPCHGSKGNQYFEYD 287
LY CHG GNQ +E+D
Sbjct: 435 LYQCHGLGGNQLWEHD 450
>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 580
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 105/295 (35%), Positives = 152/295 (51%), Gaps = 48/295 (16%)
Query: 24 HVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERER-KRHKNAAE 82
+VV P+I I DD F + GGF+W LQF W+ +P+RE +R +
Sbjct: 252 NVVCPIIDVISDDNFAFH-------TGSDMTYGGFNWKLQFRWYPVPQREADRRGGDRTI 304
Query: 83 PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKN 140
P+ +PTMAGGLFSIDK +FE++GTYD+G D+WGGENLE+SF+ W E H
Sbjct: 305 PLRSPTMAGGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRI-WMCGGTLEIVTCSHVG 363
Query: 141 AAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVT 192
TP T GG I ++L ++W + + K +FGDV+
Sbjct: 364 HVFRKSTPYTFPGGTGRIINRNNQRLA------EVWMDDFRHFYYRISPGVRKTEFGDVS 417
Query: 193 SRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVG 235
RK+LR L C +F+WYL E+ N + C+D+ + + + VG
Sbjct: 418 QRKKLRDRLKCHTFEWYLENIYPESQFRLDFKTIGEIRNIETHKCLDNMGRKEN--EKVG 475
Query: 236 LYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG----DVILYPCHGSKGNQYFEY 286
++ CH QGGNQ + ++K EI+ D+ CLD + DV++ CHG GNQ + Y
Sbjct: 476 IFSCHGQGGNQIFALTKQNEIKHDDLCLDASANSHYKDVVMIKCHGKHGNQEWLY 530
>gi|344237432|gb|EGV93535.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Cricetulus
griseus]
Length = 413
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 109/293 (37%), Positives = 148/293 (50%), Gaps = 46/293 (15%)
Query: 25 VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
VV P+I I DDTFE + GGF+W L F W+ +P+RE R K + P
Sbjct: 87 VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 139
Query: 84 VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKNA 141
V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+ W E H
Sbjct: 140 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSHVGH 198
Query: 142 AEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVTS 193
TP T GG I +L ++W E + K D+G+++S
Sbjct: 199 VFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGFTKVDYGEISS 252
Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
R LR L CK F WYL E+ N + C+D+ + + + VG+
Sbjct: 253 RLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN--EKVGI 310
Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
+ CH GGNQ + + + EIR D+ CLD + G V + CH KGNQ +EYD
Sbjct: 311 FNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 363
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 157/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 257 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ + + D W
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 426
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ + GDV+ R +LR+ L CKSF+WYLE V
Sbjct: 427 YAMNPGARNVAV------GDVSERIKLRQRLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 481 NVDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 157/330 (47%), Gaps = 76/330 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + WL+PLLD ++ + VV+P+I I D+ FE +T+S + GGF+W+L
Sbjct: 242 VEVTEGWLEPLLDRVSTDRKRVVAPIIDVISDENFEY------ITASDVTW-GGFNWHLN 294
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE +R+ + + P+ TPT+AGGLF+ID+ FF +G+YD G ++WGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEIS 354
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD-------- 169
F+ VW M GG I F K Y
Sbjct: 355 FR--------------------VW---MCGGSLEIHPCSRVGHVFRKHTPYSFPGGTARV 391
Query: 170 ------SGFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----- 211
++W E ++ + D GD+T RK LR NL CKSF+WYLE
Sbjct: 392 IHHNTARTAEVWMDEYKDIFYSMVPAARNVDVGDLTERKILRENLQCKSFRWYLETIYPE 451
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
V N C+D+A + + PCH QGGNQ W + GEIR D
Sbjct: 452 SPIPIDFFSLGQVQNMGVMECLDTAGRSAG--DSPAMLPCHGQGGNQLWTYTGKGEIRSD 509
Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
E CL + V + C GS +DY+
Sbjct: 510 ELCLAFTTKGVGMEKCIGSVPLSKMIFDYE 539
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 156/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ + + D W
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 427
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ + GDV+ R +LR L CKSF+WYLE V
Sbjct: 428 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 482 NVDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 156/316 (49%), Gaps = 50/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W E H +P T GG+ + + D W
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 427
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
G N+ + GDV+ R +LR L CKSF+WYLE V
Sbjct: 428 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
N + C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V
Sbjct: 482 NVETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539
Query: 272 LYPCHGSKGNQYFEYD 287
+ CHG GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 115/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H +P T GG+ + L ++W E
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV------VLHNAARVAEVWMDEW 423
Query: 180 LELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
+ + + GDV+ R +LR L CKSF+WYLE V N
Sbjct: 424 RDFYYAMNPGARSVAVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYFYLGDVQNV 483
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
+ C+D+ + T + VG+ CH GGNQ + +K +I D+ CLD A G V +
Sbjct: 484 ETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIV 541
Query: 274 PCHGSKGNQYFEYD 287
CHG GNQ + Y+
Sbjct: 542 RCHGMGGNQAWVYN 555
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 153/307 (49%), Gaps = 32/307 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I D+TFE S+ GGF+W L
Sbjct: 274 CECTQHWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDMTWGGFNWKLN 326
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLELS
Sbjct: 327 FRWYRVPQREVERRGGDRTLPIRTPTMAGGLFSIDKDYFNELGKYDEGMDIWGGENLELS 386
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + + + A ++ D D +
Sbjct: 387 FRIWMCGGELEIVPCSHVGHVFRKSTPYSFPGGTSRIVNHNNARLAEVW-LDEWKDFYFA 445
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID------------SAC 225
N + D GD++ RK+LR L C +F+WYLE S M +D S C
Sbjct: 446 IN-PAAKNVDKGDLSYRKQLRTKLKCNTFRWYLENIYPESHMPLDYYHLGEIKHADTSDC 504
Query: 226 KPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKG 280
T K V + CH GGNQ + +K +I D+ CLD + G V L CHG G
Sbjct: 505 LDTFGRKSGENVAVSKCHGMGGNQVFAYTKRQQIMSDDNCLDASSPRGPVKLLRCHGMGG 564
Query: 281 NQYFEYD 287
NQ + Y+
Sbjct: 565 NQLWIYN 571
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/283 (38%), Positives = 142/283 (50%), Gaps = 46/283 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + N + VV P+I I +TFE G + IGGFDW L
Sbjct: 234 CECVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WHA+PE+ER+R K+ +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 288 FQWHAVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSF 347
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E E H P P L ++W E
Sbjct: 348 RV-WQCGGTLEIEPCSHVGHVFPKKAPYARPNF----------LQNTARAAEVWMDGYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
L + K ++GD++ RK LR L CKSF WYL+ + D W G M
Sbjct: 397 LFYNRNPPAQKENYGDISERKLLRERLQCKSFDWYLKKVFPELHIPEDRPGWHGAVRSMG 456
Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
I S C + H P G L+ CH QGGNQF+ + EIR
Sbjct: 457 ISSECLDYNAPEHNPTGAHLSLFGCHGQGGNQFFEYTTKREIR 499
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/327 (36%), Positives = 161/327 (49%), Gaps = 81/327 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PL+ +A + VV+P+I I DDTFE +T+S + GGF+W+L
Sbjct: 266 VEVTDGWLEPLVSRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 318
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P+RE +R + + P+ TPT+AGGLF+IDK FF +G+YD G +WGGENLE+S
Sbjct: 319 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 378
Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
F+ W E RK+ H NAA VW
Sbjct: 379 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 431
Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
KAFF K+ + N+E GDV+ RK+LR L CKSFKWYLE
Sbjct: 432 --KAFFYKM--------VPAARNVEA------GDVSERKKLRETLQCKSFKWYLENIYPE 475
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
+ N ++ C+D+ K D P G+ CH GGNQ W ++ GEIR D
Sbjct: 476 APLPADFRSLGAIVNRFTEKCVDTNGK-KDGQAP-GIQACHGAGGNQAWSLTGKGEIRSD 533
Query: 260 EACLDYA-----GGDVILYPCHGSKGN 281
+ CL G ++ L C SK N
Sbjct: 534 DLCLSSGHVYQIGSELKLERCSVSKIN 560
>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
Length = 515
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 152/319 (47%), Gaps = 53/319 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + + S VV P I I D+ F ++ P L G F+W+L
Sbjct: 165 CEANTGWLEPLLERIYNDRSTVVCPEIDVISDENFAYQYGPSGLMR------GIFNWDLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W A+ E+KR ++ +PV TPTMAGGLF+I++ +F+++GTYD DIWGGENLE+SF
Sbjct: 219 FRWRAVSTEEQKRRQSPIDPVRTPTMAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISF 278
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
+ +P ++P P K + LG ++W
Sbjct: 279 RIWQCGGTLEIVPCSHVGHVFRKSQPYGFP----------KGVVDTLGKNSQRVAEVWMD 328
Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------------------V 212
E ++ +GD++ R E+R+ L CKSFKWYLE V
Sbjct: 329 GYKEFFYQRQPHLRGHAYGDISKRLEIRKKLKCKSFKWYLENIYTDAVLPNESVIAKGKV 388
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GG 268
N S MC+DS +P + +GL PC + E+ + CLD + G
Sbjct: 389 RNPASNMCLDSLSRPKLSY--IGLSPCTLSAMTMIISFTVRQELVVQDICLDVSDYNPGT 446
Query: 269 DVILYPCHGSKGNQYFEYD 287
V LY CHG KGNQ + ++
Sbjct: 447 KVQLYECHGMKGNQLWMHE 465
>gi|358341053|dbj|GAA48824.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 424
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 112/309 (36%), Positives = 158/309 (51%), Gaps = 36/309 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A +S VVSP I I + TFE F PG + G FDW L
Sbjct: 67 CEATTGWLEPLLHQIALDSHRVVSPSIDVIQESTFE--FVPGAPNT-----WGYFDWRLS 119
Query: 64 FNWHAIPERERKR-HKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F+W ERER R H + P+ TPTMAGGLFSI KAFFE+LGTYD G +WGGEN+E+S
Sbjct: 120 FHWGQATERERARTHGDPNIPLRTPTMAGGLFSISKAFFEELGTYDEGMVVWGGENVEMS 179
Query: 123 FKFNWHAIPE---RERKRHKNAAEPVWTPTMAGGLFSI-DKAFFEKLGTYDSGFDIWGGE 178
+ W E R + V + GG+ + + + ++ +
Sbjct: 180 LRV-WQCGGELLILPCSRVGHVFRKVSPYSWPGGVSHVLSRNAMRTALVWMDDHKLFYLK 238
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCI 221
+ + D+GD++ R+ LR+ L CKSF+WYL E+ ++ SG+C+
Sbjct: 239 SSPDAVHTDYGDISERQALRKRLRCKSFRWYLENVDVESVFPVDFHGIGEIRHESSGLCL 298
Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVILYPC-HG 277
D+ + H PVGL CH QGGNQ ++ + GEI+ + C+ D ++ PC
Sbjct: 299 DTLGQ--KQHGPVGLSSCHGQGGNQLFVWTTKGEIQAEVGCVSPTDDGDTPLLFKPCLRL 356
Query: 278 SKGNQYFEY 286
G Q F+Y
Sbjct: 357 DTGPQLFDY 365
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/285 (38%), Positives = 143/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N S ++ P+I I +TFE PG + IGGFDW L
Sbjct: 235 CECVPGWLEPLLQRVAENESVIICPVIDTIDWNTFEFYMQPG------EPMIGGFDWRLT 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER+R K+ +P+ +PTMAGGLF++ K +FE LGTYD G D+WGGENLELSF
Sbjct: 289 FQWHSVPDYERQRRKSKVDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSF 348
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W I E H P P L ++W + E
Sbjct: 349 RV-WQCGGILEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDDYKE 397
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +FGD++ RK LR+ L C +F WYL+ V D W G M
Sbjct: 398 HFYNRNPPARKENFGDLSERKLLRKKLQCNNFDWYLKNIFPNLHVPEDRPGWHGAIRSMG 457
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
I S C PT H V L+ CH QGGNQF+ + + EIR
Sbjct: 458 ISSECLDYNSPEHNPTGAH--VSLFGCHGQGGNQFFEYTVNQEIR 500
>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
Length = 539
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/299 (36%), Positives = 149/299 (49%), Gaps = 52/299 (17%)
Query: 25 VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
VV P+I I DDTFE + GGF+W L F W+ +P+RE R K + P
Sbjct: 207 VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 259
Query: 84 VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA-----IPERERKRH 138
V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+ W I
Sbjct: 260 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSXXXX 318
Query: 139 KNAAEPVW---TP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGD 187
V+ TP T GG I +L ++W E + K D
Sbjct: 319 XXXVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGVTKVD 372
Query: 188 FGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDM 230
+GD++SR LR L C+ F WYL E+ N + C+D+ + +
Sbjct: 373 YGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN- 431
Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
+ VG++ CH GGNQ + + + EIR D+ CLD + G V + CH KGNQ +EYD
Sbjct: 432 -EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 489
>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
fascicularis]
Length = 542
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/298 (38%), Positives = 153/298 (51%), Gaps = 48/298 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 200 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 253
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 254 FTWHTVPERERIRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 313
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W + E H P P +S +KA L ++W E E
Sbjct: 314 RI-WQCGGVLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 362
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM----CIDS 223
L + + FGDVT RK+LR L CK FKW+LE V D G C D
Sbjct: 363 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMYCFDY 422
Query: 224 ACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGGDVIL 272
P D ++ VG LY CH G NQF+ + EIR + E C+ AG D+++
Sbjct: 423 --NPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIRYNTHQPEGCIAVEAGMDILI 478
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 153/320 (47%), Gaps = 52/320 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W +PLL +A + +VV P I I DTF + G + + GGF W+L
Sbjct: 163 CEATPGWAEPLLARIAADRRNVVCPAIEVINADTFAYQ---GSTNADQR---GGFSWDLF 216
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP E+K + ++P+ TPTMAGGLFSI + +F +G+YD DIWGGENLELSF
Sbjct: 217 FKWKGIPPEEQKLRNDDSDPIRTPTMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSF 276
Query: 124 KFNWHA-----IPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
+ W I R H + P P G+ F +L ++W
Sbjct: 277 RV-WMCGGRLEIVTCSRVGHVFRKYTSPYKFP---DGVERTLTKNFNRLA------EVWM 326
Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------E 211
E +L + D+GD++ R ELR+ L CKSFKWY+ E
Sbjct: 327 DEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKCKSFKWYINNIYPDVQMPELDPPARGE 386
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAG 267
V N S C+DS + + VG+Y CH QGGNQ I +E C D + G
Sbjct: 387 VRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGNQVSKYMPRELIFEEENCFDVSKTHPG 446
Query: 268 GDVILYPCHGSKGNQYFEYD 287
V L CHG +GNQ +++D
Sbjct: 447 APVELMKCHGMRGNQEWKHD 466
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 150/311 (48%), Gaps = 38/311 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLLD + +N S VV P I I D TF+ R S G F+W+++
Sbjct: 185 CEVTIGWLEPLLDRVHQNRSVVVCPEIDVIDDKTFQYR------AGSSGDIRGVFNWDMK 238
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W P +E+KR N +PTMAGGLF+ID+ +F+++G YDS DIWGGENLELS
Sbjct: 239 FRWRLTPSQEQKRRNNYNVLFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELS 298
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P P P AG +I+K + G+ +
Sbjct: 299 FRIWQCGGQLEIMPCSHVGHVFRNVIPYKFPKDAG--LTINKNSVRTAEVWMDGYKEFVY 356
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
+ FG++T R ELR+ L CKSFKWYL+ V N S M
Sbjct: 357 QRQPYMRNIHFGNITERLELRKKLQCKSFKWYLDHVFTDVILPNESAIAKGKVRNPESEM 416
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDVILYPC 275
C+++ +P H +GL PC +G ++ E+ DE C D +GG + L C
Sbjct: 417 CLNTLGRPK--HAFLGLSPCAHEGKTMIISLTVLNELAMDEVCFDVSDHQSGGKITLLDC 474
Query: 276 HGSKGNQYFEY 286
H GNQ++ +
Sbjct: 475 HSMGGNQFWSH 485
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 154/321 (47%), Gaps = 53/321 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + VV P+I I DDTFE ++ GGF+W L
Sbjct: 280 CECTEGWLEPLLARIAHDKKTVVCPIIDVISDDTFEY-------ITASDMTWGGFNWKLN 332
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R+ + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 333 FRWYRVAQREMDRRNGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 392
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
F+ W I E H +P T GG+ I + D W
Sbjct: 393 FRV-WQCGGILEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 449
Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID----SACK 226
G N+ + GDV+ R +LR L CKSF+WYLE S M +D K
Sbjct: 450 YAMNPGARNVPV------GDVSERVKLREQLKCKSFRWYLENIYPESPMPLDYYYLGDIK 503
Query: 227 PTDMHKP------------------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG- 267
D + P VG+ CH GGNQ + +K +I D+ CLD A
Sbjct: 504 NADPNNPEKVQNYCLDTMGRRTGENVGMSYCHGLGGNQIFAYTKRQQIMSDDMCLDAASP 563
Query: 268 -GDVILYPCHGSKGNQYFEYD 287
G V + CHG GNQ + Y+
Sbjct: 564 QGPVKIVRCHGMGGNQAWIYN 584
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 110/284 (38%), Positives = 147/284 (51%), Gaps = 48/284 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N + VV P+I I +TFE G + IGGFDW L
Sbjct: 234 CECISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WHA+PE+ER+R K+ +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 288 FQWHAVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK---LGTYDSGFDIWGGENL 180
+ W E EP + G +F KA + + L ++W
Sbjct: 348 RV-WQCGGTLE-------IEPC---SHVGHVFP-KKAPYARPNFLQNTARAAEVWMDGYK 395
Query: 181 ELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----M 219
EL + K ++GD++ RK LR L CKSF WYL+ + D W G M
Sbjct: 396 ELFYNRNPPARKENYGDISERKLLRERLQCKSFDWYLKNVFPDLHIPEDRPGWHGAVRSM 455
Query: 220 CIDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
I + C + H P G L+ CH QGGNQF+ + EIR
Sbjct: 456 GISNECLDYNAPDHNPTGAHLSLFGCHGQGGNQFFEYTTMREIR 499
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 154/319 (48%), Gaps = 57/319 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PL++ + R+SS + P+I I D+F P GG +W LQ
Sbjct: 739 CEVNYNWLEPLIERIYRDSSTIACPVIDIIDPDSFAYSASP--------LVRGGVNWGLQ 790
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + EP+ +P MAGGLF++D+ +FE +G+YD IWGGE+LELSF
Sbjct: 791 FKWKNVPPVELLRRNSEIEPIKSPIMAGGLFAVDRNYFEHIGSYDKDMQIWGGEHLELSF 850
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
+ +P + P T+ GG+ E + T++S ++W
Sbjct: 851 RIWQCGGTLEIVPCSRVGHIFRKSHPY---TIPGGM--------ENVFTHNSIRVAEVWM 899
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------E 211
+ + +GD++ R +L+ L CK FKWYL E
Sbjct: 900 DDYKRFFYATRPDAQGKTYGDLSERLKLKSRLKCKDFKWYLDNVYPELSVPNENAYAWGE 959
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV- 270
N S +C+D+ + + +PVGLY CH GGNQ + +K GE+R +E CLD + V
Sbjct: 960 CQNAASNVCLDTLMR--EAGQPVGLYICHGGGGNQVFSYTKLGEVRHEELCLDVSTKKVG 1017
Query: 271 ---ILYPCHGSKGNQYFEY 286
+ CH GNQ +E+
Sbjct: 1018 ETPVFEQCHALGGNQMWEH 1036
>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mustela putorius
furo]
Length = 452
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 153/303 (50%), Gaps = 51/303 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++ + + VV P+I I +TFE G + IGGFDW L
Sbjct: 104 CECNSGWLEPLLERISYDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 157
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 158 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 217
Query: 124 KFNWHA--IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W E E
Sbjct: 218 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANCVRAAEVWMDEFKE 266
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT RK+LR L CK F+W+LE V D + GM +
Sbjct: 267 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 326
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDYAGGD 269
K P + ++ +G LY CH G NQF+ + EIR + EAC+ G
Sbjct: 327 LKDYCFDYNPPNENQIMGHQVLLYLCHGMGQNQFFEYTSQKEIRYNTHQPEACIAVEAGT 386
Query: 270 VIL 272
IL
Sbjct: 387 DIL 389
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 112/321 (34%), Positives = 160/321 (49%), Gaps = 59/321 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + + + VV+P+I NI TF P +T G F W+L
Sbjct: 249 CECNVGWLEPLLERIYLDRTTVVTPVIDNIDKKTFAYTGSPTVITR------GIFTWSLT 302
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +P E+K+ K+ P+ +PTMAGGLFS+D+ +F ++G+YD G D+WGGENLE+SF
Sbjct: 303 FSWLDLPWFEQKKRKDPIAPLPSPTMAGGLFSMDREYFFEIGSYDMGMDVWGGENLEISF 362
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP P P +G + +I+K + ++W E
Sbjct: 363 RIWQCGGTLEFIPCSRVGHVYRDFHPYKFP--SGAVQTINKNL-------NRVAEVWMDE 413
Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------------------V 212
EL + GD++ R ELR+ L CK FKWYL+ V
Sbjct: 414 YKELYYGVRPHHRAIGTGDISDRLELRKKLNCKPFKWYLDNVFPDMMVPLPENLLGKGAV 473
Query: 213 SNDWSGMCIDS-ACKPTDMHKPVGLYPCH--KQGGNQFWMMSKHGEIRRD----EACLDY 265
N + MC+DS + + DM GLYPC K F+ +K+GEIRR+ CLD+
Sbjct: 474 KNAATNMCLDSLSSREVDMK--AGLYPCANGKSENQMFYFTTKYGEIRREGTFGARCLDF 531
Query: 266 AGG----DVILYPCHGSKGNQ 282
AGG + +Y CH KGNQ
Sbjct: 532 AGGKPGSTLSMYGCHLMKGNQ 552
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 285 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 337
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 338 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 397
Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
F+ + + P K + A V M GG+ I
Sbjct: 398 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGILEIAPC 457
Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
F K Y +I N L + K GDV+ R
Sbjct: 458 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 517
Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
K LR L CKSF+WYL E+ N + C+D+ + ++ VG+
Sbjct: 518 KNLRERLKCKSFRWYLENVYPESLMPLDYYYLGEIRNSETETCLDTMGR--KYNEKVGIS 575
Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
CH GGNQ + +K +I D+ CLD + G V + CH GNQ + YD
Sbjct: 576 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 627
>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
partial [Taeniopygia guttata]
Length = 402
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 155/322 (48%), Gaps = 56/322 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 51 CECTLGWLEPLLSRIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 103
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE +R K + PV TPTMAGGLFSID+++FE++GTYD+G DIWGGENLE+S
Sbjct: 104 FRWYPVPQREMERRKGDRTLPVRTPTMAGGLFSIDRSYFEEIGTYDAGMDIWGGENLEMS 163
Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSID------------KAFFEK 164
F+ W E + A P P G + + + K FF
Sbjct: 164 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYI 222
Query: 165 LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
+ F +W L +G V L+ + C+ F WYL
Sbjct: 223 ISPGAPRF-VWDKRIL-------YGIVPWCGTLKIRMKCQPFSWYLENVYPDSQIPRRYY 274
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
E+ N + C+D+ + + + VG + CH GGNQ + + EIR D+ CLD +
Sbjct: 275 SLGEIRNVETNQCLDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 332
Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
G V++ CH +GNQ +EYD
Sbjct: 333 LNGPVLMLKCHHLRGNQLWEYD 354
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/352 (33%), Positives = 161/352 (45%), Gaps = 80/352 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 152 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 204
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 205 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 264
Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
F+ + + P K + A VW M GG+ I
Sbjct: 265 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVW---MCGGVLEIAPC 321
Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
F K Y +I N L + K GDV+ R
Sbjct: 322 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 381
Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
K LR L CKSF+WYL E+ N + C+D+ + ++ VG+
Sbjct: 382 KALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 439
Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
CH GGNQ + +K +I D+ CLD + G V + CH GNQ + YD
Sbjct: 440 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 491
>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Meleagris gallopavo]
Length = 528
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/305 (37%), Positives = 149/305 (48%), Gaps = 55/305 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 176 CECHEGWLEPLLERIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 229
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH PERE+KR K+ + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 230 FTWHTTPEREQKRRKSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 289
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 290 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 338
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
L + + +GDVT R+ LR L CK FKW+LE V D G
Sbjct: 339 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 398
Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAG 267
C D P + H+ G LYPCH G NQF+ + H EI R+ EAC
Sbjct: 399 MANFCFDY--NPPNEHEITGHRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEACAAVIA 456
Query: 268 GDVIL 272
G L
Sbjct: 457 GTEYL 461
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 157/325 (48%), Gaps = 61/325 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 238 CECHEGWLEPLLERIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 291
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH PERE+KR K+ + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 292 FTWHTTPEREQKRRKSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 351
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 352 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 400
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
L + + +GDV+ R+ LR L CK FKW+LE V D G
Sbjct: 401 LYYHRNPHARLEPYGDVSERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 460
Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEAC----- 262
C D P++ H+ G LYPCH G NQF+ + H EI R+ EAC
Sbjct: 461 MANFCFDY--NPSNEHEITGHRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEACAAVIA 518
Query: 263 -LDYAGGDVILYPCHGSKGNQYFEY 286
DY ++ H NQ F +
Sbjct: 519 GTDYLTMNLCQENIHRVPENQKFAF 543
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 161/347 (46%), Gaps = 93/347 (26%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLLD +A N S VV P+I I D +F + ++ IGGFDWN+
Sbjct: 255 CECSKGWLEPLLDRIAANRSTVVCPVINQIDDRSFAF------VNATEVSHIGGFDWNII 308
Query: 64 FNWHAIPERERKR-HKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
FNW+ IP+ E+ R + +EPV +PTMAGGLFSIDK++FE+LG+YD F+ WGGEN+ELS
Sbjct: 309 FNWYNIPQSEKDRIGGDKSEPVRSPTMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELS 368
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTY---DSGFDI 174
K +W M GG+ F K + ++ +++
Sbjct: 369 LK--------------------IW---MCGGILEFVPCSHVGHVFRKHNPHKYKNTTYNV 405
Query: 175 WGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYL------ 210
G N L + K D GD++ R +LR+NL CKSF+W+L
Sbjct: 406 VGRNNRRLAEVWLDEYKYLFYANQPETMKIDPGDISQRVQLRKNLQCKSFRWFLQNIYPD 465
Query: 211 -----------EVSNDWSGMCID---------------SACKPTDMHKPVGLYPCHKQGG 244
++ N SG C+D A T V L+PCH G
Sbjct: 466 SHYNFAFVGVGQLKNVASGACLDFGKAAGHGGKEFKGKDATNVTS--NTVELWPCH-DGK 522
Query: 245 NQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYDYK 289
Q ++ + E R CLDY LY CHG NQ + +D K
Sbjct: 523 IQLFIRTDKKEFRYIHMCLDYNVQFSFPFLYECHGQGANQQWIHDLK 569
>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
Length = 628
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 113/321 (35%), Positives = 150/321 (46%), Gaps = 60/321 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 288 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 340
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 341 FKWEYLSPAERAARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 400
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ +
Sbjct: 401 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFAKNTR---------RAA 446
Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------- 216
++W E + L+ FG++ R L+ L CK FKWYLE V D
Sbjct: 447 EVWMDEYKQHYYNAVPLAKNIPFGNIDDRLALKEKLQCKPFKWYLEHVYPDLQTPDPQDV 506
Query: 217 ------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA---- 266
+ C+D+ D VGL+PCH GGNQ W SK GEI+ DE CL
Sbjct: 507 GQFRQDATECLDTMGHIVD--GTVGLFPCHNTGGNQEWTFSKRGEIKHDELCLTLVQFAR 564
Query: 267 GGDVILYPCHGSKGNQYFEYD 287
G VIL PC S+ ++ D
Sbjct: 565 GSQVILKPCDESENQRWVMKD 585
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
F+ + + P K + A V M GG+ I
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPC 455
Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
F K Y +I N L + K GDV+ R
Sbjct: 456 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 515
Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
K LR L CKSF+WYL E+ N + C+D+ + ++ VG+
Sbjct: 516 KALRERLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 573
Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
CH GGNQ + +K +I D+ CLD + G V + CH GNQ + YD
Sbjct: 574 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 625
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/275 (38%), Positives = 136/275 (49%), Gaps = 46/275 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PL +A NSS+VV P+I I D TF P F G F W L+
Sbjct: 153 CECSKGWLEPLAAKIAENSSNVVMPVIDEISDTTFYYHAVPE------PFHRGVFRWRLE 206
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E +R K+ A+ + TP MAGGLFSIDK +FEK+GTYD+G DIWGGENLE+SF
Sbjct: 207 FGWKPVPQYEMERRKDEADGIRTPVMAGGLFSIDKNYFEKIGTYDTGMDIWGGENLEISF 266
Query: 124 KFNWH---AIPERERKRHKNAAEPVWT---PTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ W AI R + P + P G + ++ D+W
Sbjct: 267 RI-WMCGGAIEMLPCSRVGHVFRPRFPYSFPARPGHNTDVVSNNLMRVA------DVWMD 319
Query: 178 E------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
E N+ K DV+ R LR L CK+FKWYL +V
Sbjct: 320 EYKKHFYNIRFDLKRKQHDDVSQRLALREKLKCKNFKWYLDNVYPELEVPDTNFAASGQV 379
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQF 247
N S MC+D+ K D P+GLY CH QGGNQ
Sbjct: 380 RNPSSDMCLDTLGKKDDT--PLGLYQCHGQGGNQV 412
>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
Length = 496
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 153/304 (50%), Gaps = 52/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE PG IGGFDW L
Sbjct: 147 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 200
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 201 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 260
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W E E
Sbjct: 261 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 309
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT RK+LR L CK F+W+LE V D + GM +
Sbjct: 310 LYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 369
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
K P + ++ VG LY CH G NQF+ + EIR + EAC+ AG
Sbjct: 370 LKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIRYNTHQPEACIAVEAGK 429
Query: 269 DVIL 272
DV++
Sbjct: 430 DVLI 433
>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Ailuropoda melanoleuca]
Length = 540
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 153/304 (50%), Gaps = 52/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE PG IGGFDW L
Sbjct: 191 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 244
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 245 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 304
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W E E
Sbjct: 305 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 353
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT RK+LR L CK F+W+LE V D + GM +
Sbjct: 354 LYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 413
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
K P + ++ VG LY CH G NQF+ + EIR + EAC+ AG
Sbjct: 414 LKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIRYNTHQPEACIAVEAGK 473
Query: 269 DVIL 272
DV++
Sbjct: 474 DVLI 477
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + +N VV P+I I D+TFE +T+S + GGF+W L
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE R N P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
F+ + + P K + A V M GG+ I
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPC 455
Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
F K Y +I N L + K GDV+ R
Sbjct: 456 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 515
Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
K LR L CKSF+WYL E+ N + C+D+ + ++ VG+
Sbjct: 516 KALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 573
Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
CH GGNQ + +K +I D+ CLD + G V + CH GNQ + YD
Sbjct: 574 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 625
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/304 (32%), Positives = 151/304 (49%), Gaps = 52/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + WL+PLLD ++ + VV+P+I I D+ FE ++ GGF+W+L
Sbjct: 242 VEVTEGWLEPLLDRVSVDRKRVVAPIIDVISDENFEY-------ITASDITWGGFNWHLN 294
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE +R+ + + P+ TPT+AGGLF+ID+ FF +G+YD G ++WGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEIS 354
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIW 175
F+ W + + + G +F + GT + ++W
Sbjct: 355 FRV-WMC----------GGSLEIHPCSRVGHVFRKHTPYSFPGGTANVIHRNAARTAEVW 403
Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
E ++ +K D GD+T RK LR NL CKSF+WYLE
Sbjct: 404 MDEYKDIFYKMVPAAKNVDIGDLTERKVLRENLQCKSFRWYLETIYPESPIPIDFLSLGQ 463
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
+ N C+D+A + + PCH +GGNQ W + GEIR DE CL + +
Sbjct: 464 IQNMGVVGCLDTAGRSAG--DSPAILPCHGKGGNQLWAYTGKGEIRADELCLAFTVKGIS 521
Query: 272 LYPC 275
+ C
Sbjct: 522 MEKC 525
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 165/356 (46%), Gaps = 85/356 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + VV P+I I D+TFE +T+S + + GGF+W L
Sbjct: 246 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 298
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P RE ++R+ + P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 299 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 358
Query: 123 FKFNWH--AIPERERKRH----------------------KNAAE--PVWTPTMAGGLFS 156
F+ W I E H KNAA VW M GG
Sbjct: 359 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVW---MCGGTLE 414
Query: 157 IDKA-----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGD 190
I F K Y I N L + K GD
Sbjct: 415 IAPCSRVGHVFRKSTPYSFPGGTSQIVNKNNARLAEVWLDGWSEFYYNINPGARKASAGD 474
Query: 191 VTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKP 233
V+ R+ELR L CKSF+WYL E+ N S C+D+ + + +
Sbjct: 475 VSERRELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVESQNCLDTMGRKAN--EK 532
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVILYPCHGSKGNQYFEYD 287
+G CH GGNQ + +K +I D+ CLD A G V L CHG GNQ + YD
Sbjct: 533 IGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHGMAGNQEWIYD 588
>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
Length = 564
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 154/311 (49%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDWNL
Sbjct: 222 CECNVNWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 274
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER+ R ++ + + TP +AGGLF I+KA+FEKLG YD D+WGGENLE+S
Sbjct: 275 FKWEYLGYAERESRQRDPTQAIRTPMIAGGLFVINKAYFEKLGKYDMKMDVWGGENLEIS 334
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 335 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKH 389
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------VSNDWS 217
+ + L+ FGD++ R ELRRNL CK FKWYL+ V
Sbjct: 390 FYYAA--VPLAKNIPFGDISERLELRRNLQCKPFKWYLQHVYPELAIPQATSAHVGELRQ 447
Query: 218 GM-CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGG-DVIL 272
GM C+D+ D V LY CH GGNQ W ++ G I+ + CL DY G V++
Sbjct: 448 GMYCLDTMGHLID--GTVALYQCHHTGGNQEWGLTSGGLIKHHDLCLTLDDYMKGVQVVM 505
Query: 273 YPCHGSKGNQY 283
C GS ++
Sbjct: 506 RICDGSDSQKW 516
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 142/285 (49%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A N + +V P+I I TFE + + IGGFDW L
Sbjct: 174 CECVSGWLEPLLERIAENETVIVCPVIDTIDWKTFEYYM------QTAEPMIGGFDWRLT 227
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +FE LGTYD+G D+WGGENLELSF
Sbjct: 228 FQWHSVPKHERLRRKSETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSF 287
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W + E H P P L ++W E E
Sbjct: 288 RV-WQCGGMLEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDEYKE 336
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGM----- 219
+ K ++GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 337 HFYNRNPSARKENYGDLSERKILRERLKCKSFNWYLKNIFAELHVPEDRPGWHGAIRSAG 396
Query: 220 ----CIDSAC---KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
C+D A PT H + L+ CH QGGNQF+ + + EIR
Sbjct: 397 IASECLDYALPENHPTGAH--LSLFGCHGQGGNQFFEYTSNKEIR 439
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 140/285 (49%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A N + V+ P+I I +TFE S + IGGFDW L
Sbjct: 231 CECVSGWLEPLLERIAENETVVICPVIDTIDWNTFEYYM------QSAEPMIGGFDWRLT 284
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +FE LGTYD+G D+WGGENLELSF
Sbjct: 285 FQWHSVPKHERLRRKSETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSF 344
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W + E H P P L ++W E E
Sbjct: 345 RV-WQCGGMLEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDEYKE 393
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K ++GD++ RK LR L CKSF WYL V D W G
Sbjct: 394 HFYNRNPPARKENYGDISERKLLRERLKCKSFNWYLRNVFSELHVPEDRPGWHGAVRSVG 453
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
I S C PT H + L+ CH QGGNQF+ + + E R
Sbjct: 454 ISSECLDYVLPEHNPTGAH--LSLFGCHGQGGNQFFEYTSNKEFR 496
>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oryctolagus cuniculus]
Length = 476
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 157/318 (49%), Gaps = 54/318 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE PG IGGFDW L
Sbjct: 127 CECHEGWLEPLLHRIHEKESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 180
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 181 FTWHVVPERERLRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 240
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W E E
Sbjct: 241 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 289
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT R++LR L CK FKW+LE V D + GM +
Sbjct: 290 LYYHRNPRARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNKG 349
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
K P D ++ G LY CH G NQF+ + EIR + E C+ A
Sbjct: 350 LKNFCFDYNPPDENQITGHQVILYTCHGMGQNQFFEYTSQMEIRYNTHQPEGCVAVEADK 409
Query: 269 DV-ILYPCHGSK-GNQYF 284
DV +++PC + NQ F
Sbjct: 410 DVLVMHPCQDTTPENQKF 427
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 151/314 (48%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 210 CECTTGWLEPLLARIKKDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV AGG + +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD+ +R LR+ L CK F WYL E+ N
Sbjct: 376 KNFFYIISPGVTKVDYGDIATRTALRQKLQCKPFSWYLESIYPDSQIPRHYYSLGEIRNV 435
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD + G V++
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493
Query: 274 PCHGSKGNQYFEYD 287
CH KGNQ ++YD
Sbjct: 494 KCHHLKGNQLWDYD 507
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 103/299 (34%), Positives = 146/299 (48%), Gaps = 47/299 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ + + S +V P+I I +TFE + ++ IGGFDW L
Sbjct: 236 CECVEGWLEPLLERIMEDESVIVVPVIDTIDWNTFEYYY------GGHEPQIGGFDWRLT 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH IP+ ERKR K+ +P+ +PTMAGGLF++ K +F ++GTYD+G +IWGGENLELSF
Sbjct: 290 FQWHTIPDHERKRRKSPVDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSF 349
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP P P + + + Y F I
Sbjct: 350 RTWMCGGKLETIPCSHVGHVFPKQSPYPRPKFLTNTLRAAEVWMDD---YKRHFYIRNPP 406
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND------------WSGM 219
+ K ++GD+++RK+LR +L C FKWYL+ V D S
Sbjct: 407 ----ASKENYGDISARKDLRNSLQCHDFKWYLDNVYPDLHVPEDRPGYYGAFRNSGMSSF 462
Query: 220 CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVI 271
C+D A H P G ++ CH QGGNQF+ + E+R E C+ D I
Sbjct: 463 CLDYA---PPQHNPTGGRVSIFGCHGQGGNQFFEYTSKREVRFNSEKEMCMSAVEDDTI 518
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 114/314 (36%), Positives = 155/314 (49%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I D+TFE S+ GGF+W L
Sbjct: 235 CECTQNWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDLTWGGFNWKLN 287
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F + +P+RE +R + PV TPTMAGGLF+IDK +F +LG YD G DIWGGENLELS
Sbjct: 288 FRGYRVPQRELDRRGGDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELS 347
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E E H TP T GG I +L ++W E
Sbjct: 348 FRI-WMCGGELEIVPCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA------EVWLDEW 400
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
E F D GD++ R+ LR+ L C SF+WYL E+ +
Sbjct: 401 KEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENIYPESHMPLDYYHLGEIKHA 460
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
S +C+D+ + + + V + CH NQ + +K +I D+ CLD + G V L
Sbjct: 461 DSPVCLDTFGRKSGEN--VAVSTCHGXXXNQVFAYTKRQQIMSDDNCLDASSPRGPVKLL 518
Query: 274 PCHGSKGNQYFEYD 287
CHG GNQ + YD
Sbjct: 519 RCHGMGGNQLWIYD 532
>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Ornithorhynchus anatinus]
Length = 467
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 113/309 (36%), Positives = 158/309 (51%), Gaps = 53/309 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ + S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 118 CECHEGWLEPLLERIREEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 171
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH IPERE+KR ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 172 FTWHPIPEREQKRRRSKVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 231
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E
Sbjct: 232 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDGYKE 280
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
L + + +GDVT+R++LR L C+ FKW+LE V D + GM
Sbjct: 281 LYYHRNPHARLEPYGDVTARRDLRSKLKCRDFKWFLENVYPELHVPEDRPGYFGMLKNKG 340
Query: 221 IDSAC---KPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRD----EAC--LDYAG 267
+++ C P D ++ G LYPCH G NQF+ + H EIR + EAC +D
Sbjct: 341 MENHCFDYNPPDENEVTGQRLILYPCHGMGQNQFFEYTSHHEIRYNTRHPEACAAVDVGT 400
Query: 268 GDVILYPCH 276
V +Y C
Sbjct: 401 DYVTMYLCQ 409
>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Oryzias
latipes]
Length = 600
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 158/320 (49%), Gaps = 43/320 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A+N + VVSP I+ I +TFE P + + G FDW L
Sbjct: 250 CECFNGWLEPLLARIAQNYTAVVSPDISTIDLNTFEFMKPSPYGQNHNR---GNFDWGLS 306
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +F ++G+YD +IWGGEN+E+SF
Sbjct: 307 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFYQIGSYDEEMEIWGGENIEMSF 366
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 367 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 420
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +++ + FGD++ RK+LR L CK+F WYL+ V
Sbjct: 421 FYRRNQQAAQIAKEETFGDISKRKDLRERLQCKNFSWYLKNIYPEIFMPDLNPLLFGSVK 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N C+D A + + K + +YPCH GGNQ++ S H E+R + E CL AGG V
Sbjct: 481 NVGKASCLD-AGENNEGGKELIMYPCHGLGGNQYFEYSTHREVRHNIQKELCLHGAGGVV 539
Query: 271 ILYPCHGSKGNQYFEYDYKY 290
L C N + + K+
Sbjct: 540 KLEECQYKGRNTFVGAEQKW 559
>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
Length = 466
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 114/303 (37%), Positives = 149/303 (49%), Gaps = 52/303 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 116 CECHEGWLEPLLQRIQEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 169
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 170 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 229
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 230 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRKKA----LANSVRAAEVWMDEFKE 278
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT RK+LR L CK FKW+LE V D + GM +
Sbjct: 279 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNKG 338
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
K P H G LY CH G NQF+ + EIR + EAC+ AG
Sbjct: 339 LKDYCFDYNPPSEHDLTGHQVLLYLCHGMGQNQFFEHTSQNEIRYNTHQPEACIAVEAGA 398
Query: 269 DVI 271
D +
Sbjct: 399 DTL 401
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 150/319 (47%), Gaps = 61/319 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VV P+I I DTF+ L GGFDWNL
Sbjct: 323 CECNVHWLEPLLARVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 375
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ERK R ++ P+ TP +AGGLF ID+++FEKLGTYD+ DIWGGENLE+S
Sbjct: 376 FKWEYLSGAERKERQRDPTAPIRTPMIAGGLFVIDRSYFEKLGTYDTQMDIWGGENLEIS 435
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P GG +I F K
Sbjct: 436 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFP--GGGSGNI----FAK--NTRRAA 482
Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + L+ FGD+ R LR L CK F+WYLE
Sbjct: 483 EVWMDEYKRYYYAAVPLATNIPFGDIEDRLRLREELQCKPFRWYLENVYPQLSVPERRNN 542
Query: 213 -SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---- 267
S C+DS VGLY CH GGNQ W++++ GE++ + CL
Sbjct: 543 GSIRQGAFCLDSLGNVAGA--IVGLYSCHGNGGNQNWILNRKGEVKHHDLCLTLIKFSVN 600
Query: 268 ---GDVILYPCHGSKGNQY 283
VI+ C GS+ Q+
Sbjct: 601 ARYNSVIMKYCDGSENQQW 619
>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oreochromis niloticus]
Length = 624
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 150/306 (49%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP I I +TFE P + + G FDW+L
Sbjct: 274 CECFNGWLEPLLARIAENYTAVVSPDITTIDLNTFEFMKPSPYGQNHNR---GNFDWSLS 330
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +F ++G+YD +IWGGEN+E+SF
Sbjct: 331 FGWESLPDHEKRRRKDETYPIKTPTFAGGLFSISKEYFYRIGSYDEEMEIWGGENIEMSF 390
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 391 RVWQCGGQLEIIPCSIVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 444
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +++ G FGD++ R ELR L CKSF WYL+ V
Sbjct: 445 FYRRNQQAAQIAKDGAFGDISKRVELREKLQCKSFSWYLQNVYPEVFMPDLNPLRFGSVK 504
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N C+D A + + K + +YPCH GGNQ++ S H EIR + E CL A G V
Sbjct: 505 NVGKDSCLD-AGENNEGGKQLIMYPCHGLGGNQYFEYSTHHEIRHNIQKELCLHGAEGAV 563
Query: 271 ILYPCH 276
L C
Sbjct: 564 KLEDCQ 569
>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
Length = 2376
Score = 160 bits (404), Expect = 9e-37, Method: Composition-based stats.
Identities = 102/294 (34%), Positives = 138/294 (46%), Gaps = 61/294 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV + W +PLL + + HVV+P+I I D F P GGFDW L F
Sbjct: 1124 EVNRDWAEPLLQRINEDPLHVVTPIIDVISDSNFRYSASP--------VVRGGFDWGLTF 1175
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W ++P ++ A P+ +PTMAGGLF++ + F +LGTYD G DIWG ENLE+SF+
Sbjct: 1176 KWKSVPRSQQSSDPTA--PIASPTMAGGLFAMKRTTFYELGTYDLGMDIWGAENLEMSFR 1233
Query: 125 FNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W E RK H P P G + + +L +
Sbjct: 1234 I-WQCGARLEIMPCSRVGHVFRKHH-----PYSFPGGGSGHVFLRNSL--RLA------E 1279
Query: 174 IWGGENLEL--SFKG------DFGDVTSRKELRRNLGCKSFKWYL--------------- 210
+W E E S KG D GD++ R++LR +L CK FKWYL
Sbjct: 1280 VWMDEYAEFFKSRKGSAARKIDIGDISERQKLREDLHCKPFKWYLDNVYPELRVPDPNPV 1339
Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
E G C+DSA K + V LY CH GGNQ W +S +GE+ ++AC+
Sbjct: 1340 GEGQVQSGGFCLDSAGK--SVGHAVALYRCHGLGGNQLWTLSHNGELAHEDACV 1391
>gi|410978730|ref|XP_003995741.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Felis catus]
Length = 469
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 112/306 (36%), Positives = 154/306 (50%), Gaps = 56/306 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ + S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 120 CECHEGWLEPLLERIHEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 173
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 174 FTWHVVPERERTRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 233
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W E E
Sbjct: 234 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 282
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM-------- 219
L + + FGDVT RK+LR L CK F+W+LE V D G
Sbjct: 283 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 342
Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-A 266
C D P + ++ VG LY CH G NQF+ + EIR + EAC+ A
Sbjct: 343 LRDYCFDY--NPPNENQIVGHQVLLYHCHGMGQNQFFEYTSRNEIRYNTHQPEACIAVDA 400
Query: 267 GGDVIL 272
G D+++
Sbjct: 401 GMDILI 406
>gi|405966385|gb|EKC31678.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 1019
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/175 (46%), Positives = 107/175 (61%), Gaps = 35/175 (20%)
Query: 147 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELR-------- 198
+PTMAGGLFSI + +F +LGTY G DIWGGENLELSF+ +V + +R
Sbjct: 794 SPTMAGGLFSISREYFTELGTYHLGMDIWGGENLELSFRRTGVNVVKKNSIRLAKVWMDE 853
Query: 199 -RNL--------GCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHK 232
+N C +F W++ E+ + MCIDSA + HK
Sbjct: 854 YKNYYYERFNYDLCHNFDWFVKNVYPDLFVPGEAIASGEILSKAKPMCIDSAVDNRNYHK 913
Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
PV ++PCH QGGNQFWM+SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 914 PVNMWPCHNQGGNQFWMLSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 968
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/323 (34%), Positives = 150/323 (46%), Gaps = 61/323 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VV P+I I DTF+ L GGFDWNL
Sbjct: 160 CECNVNWLEPLLARVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 212
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ERK R ++ P+ TP +AGGLF IDKA+FE+LGTYD+ DIWGGENLE+S
Sbjct: 213 FKWEYLSNAERKARQRDPTAPIRTPMIAGGLFVIDKAYFERLGTYDTQMDIWGGENLEIS 272
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P G F K
Sbjct: 273 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSG------NIFAK--NTRRAA 319
Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM------ 219
++W E + L+ FGD+ R +LR+ L CK F+WYLE G+
Sbjct: 320 EVWMDEYKKYYYAAVPLATNIPFGDIDDRLQLRKELQCKPFRWYLEHVYPQLGIPERRNN 379
Query: 220 --------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---- 267
C+DS VGLY CH GGNQ W++++ GE++ + CL
Sbjct: 380 GSIRQGVYCLDSLGNVAG--AVVGLYSCHGNGGNQNWILNRKGELKHHDLCLTLVKFTIS 437
Query: 268 ---GDVILYPCHGSKGNQYFEYD 287
V++ C S+ Q+ D
Sbjct: 438 ARYNSVLMKYCDDSENQQWHLKD 460
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++++ + VV P+I I +TFE G + IGGFDW L
Sbjct: 355 CECNSGWLEPLLERISKDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 408
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 409 FQWHSVPKHERDRRKSRIDPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 468
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 469 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 517
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR+ L CKSF WYL+ V D W G M
Sbjct: 518 HFYNRNPPARKEAYGDISERKLLRKRLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 577
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + EIR
Sbjct: 578 IPSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 620
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + +V P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR LGCKSF WYL+ V D W G +
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIG 456
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSEC--LDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIR 499
>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis mellifera]
gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis florea]
Length = 571
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/311 (35%), Positives = 155/311 (49%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
+ + L+ +G++ R EL+R L CK F WYL+ S
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G IR CL YA G +L
Sbjct: 455 GSACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIRHHGLCLTLPVYAKGTTLLM 512
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 513 QICDGSENQKW 523
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/322 (35%), Positives = 153/322 (47%), Gaps = 64/322 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRF---PPGRLTSSYKFFIGGFDW 60
CE W +PLL +A SS+VV P+I I DT + P R GGF W
Sbjct: 282 CETTPGWAEPLLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQR---------GGFSW 332
Query: 61 NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
+L + W IP E+ K+ + + TPTMAGGLF+ID+ +F +GTYD DIWGGENLE
Sbjct: 333 DLFYKWKPIPLDEQHLRKSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLE 392
Query: 121 LSFKF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
+SF+ IP R + P P ++K + L ++
Sbjct: 393 MSFRIWMCGGRIDIIPCSRVGHIFRKFTSPYKFPD------GVEKTLSKNLNRL---AEV 443
Query: 175 WGGENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL----------------- 210
W E EL ++ D+GD++ R LR L CKSFKWY+
Sbjct: 444 WLDEYKELYYQKRPQSKGKDYGDISQRLALRNKLNCKSFKWYIENIYPDVQLPDLYPPAR 503
Query: 211 -EVSNDWSGMCIDSACKPTDMH----KPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLD 264
E+ N S C+DS DM K +G++PCH QGGNQ ++ S+ GEI DE CLD
Sbjct: 504 GEIKNPASSYCLDSM---GDMKGNNVKKLGIFPCHGQGGNQNFVFSRKGEIVFDEEYCLD 560
Query: 265 YA----GGDVILYPCHGSKGNQ 282
+ G + + CH GNQ
Sbjct: 561 VSSSKPGVLIDIMKCHNFGGNQ 582
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/323 (34%), Positives = 155/323 (47%), Gaps = 61/323 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + VV P+I I +TFE L ++ + IGGFD L
Sbjct: 242 CECHEGWLEPLLARIAEEETAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDXRLV 295
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH+ PERE+KR K+ + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 296 FTWHSTPEREQKRRKSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 355
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF--FEKLGTYDSGFDIWGGENLE 181
+ W + + + G +F + + L ++W E +
Sbjct: 356 RI-WQC----------GGSLEIHPCSHVGHVFPKQAPYSRAKALANSVRAAEVWMDEYKQ 404
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
L + + +GDVT R+ LR L CK FKW+LE V D G
Sbjct: 405 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 464
Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRDE------ACLDY 265
C D PT+ H+ G LYPCH G NQF+ + H EIR + A +D
Sbjct: 465 MENFCFDY--NPTNEHQITGQRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEVCAAVDS 522
Query: 266 AGGDVILYPC----HGSKGNQYF 284
+ +Y C H NQ F
Sbjct: 523 GTDYLTMYLCQENAHSVPENQKF 545
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR LGCKSF WYL+ V D W G +
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIR 495
>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
lupus familiaris]
Length = 578
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 154/304 (50%), Gaps = 52/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L + + IGGFDW L
Sbjct: 229 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNPREPQIGGFDWRLV 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 283 FTWHVVPERERMRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 342
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W + E
Sbjct: 343 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDDFKE 391
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT RK+LR L CK F+W+LE V D + GM +
Sbjct: 392 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 451
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
K P + ++ VG LY CH G NQF+ + EIR + EAC+ AG
Sbjct: 452 LKDYCFDYNPPNENQVVGYQVLLYICHGMGQNQFFEYTSQNEIRYNTHQPEACIAVDAGT 511
Query: 269 DVIL 272
DV++
Sbjct: 512 DVLV 515
>gi|90078941|dbj|BAE89150.1| unnamed protein product [Macaca fascicularis]
Length = 311
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 100/262 (38%), Positives = 136/262 (51%), Gaps = 39/262 (14%)
Query: 56 GGFDWNLQFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 114
GGF+W L F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIW
Sbjct: 9 GGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIW 68
Query: 115 GGENLELSFKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
GGENLE+SF+ W E H TP T GG I +L
Sbjct: 69 GGENLEISFRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA----- 122
Query: 172 FDIWGGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
++W E + K D+GD++SR LR L CK F WYL
Sbjct: 123 -EVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYF 181
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
E+ N + C+D+ + + + VG++ CH GGNQ + + + EIR D+ CLD +
Sbjct: 182 SLGEIRNVETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 239
Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
G V + CH KGNQ +EYD
Sbjct: 240 LNGPVTMLKCHHLKGNQLWEYD 261
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/73 (57%), Positives = 58/73 (79%), Gaps = 3/73 (4%)
Query: 114 WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+GG N +L+F+ W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G
Sbjct: 8 YGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGM 65
Query: 173 DIWGGENLELSFK 185
DIWGGENLE+SF+
Sbjct: 66 DIWGGENLEISFR 78
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 231 CECNSGWLEPLLERIERDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 284
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 285 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 344
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W + E
Sbjct: 345 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 393
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K D+GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 394 HFYNRNPPARKEDYGDISERKLLRERLKCKSFDWYLKNVFSSLHVPEDRPGWHGAIRSKG 453
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 454 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 496
>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
Length = 543
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 148/303 (48%), Gaps = 51/303 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 193 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPHIGGFDWRLV 246
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +P RER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 247 FTWHVVPTRERMRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 306
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGENLE 181
+ W + + G +F + K +S ++W E E
Sbjct: 307 RI-WQC----------GGTLEIHPCSHVGHVFPKQAPYSRKKALANSVRAAEVWMDEFKE 355
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
L + + FGDVT R++LR L CK FKW+LE V D + GM +
Sbjct: 356 LYYHRNPHARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNRG 415
Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDYAGGD 269
K P + H G LY CH G NQF+ + EIR + EAC+ G
Sbjct: 416 LKDYCFDYNPPNEHDITGHQVLLYLCHGMGQNQFFEYTSQREIRYNTHQPEACIAVEAGT 475
Query: 270 VIL 272
IL
Sbjct: 476 DIL 478
>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus terrestris]
Length = 571
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
+ + L+ +G++ R EL+R L CK F WYL+ S
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G I+ + CL YA G +L
Sbjct: 455 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPMYAKGTTLLM 512
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 513 QICDGSENQKW 523
>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus impatiens]
Length = 571
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
+ + L+ +G++ R EL+R L CK F WYL+ S
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G I+ + CL YA G +L
Sbjct: 455 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 512
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 513 QICDGSENQKW 523
>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
echinatior]
Length = 442
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 153 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 212
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 213 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 265
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
+ + L+ +G++ R EL+R L CK F WYL+ S
Sbjct: 266 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 325
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G I+ + CL YA G +L
Sbjct: 326 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 383
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 384 QICDGSENQKW 394
>gi|326670821|ref|XP_003199296.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Danio rerio]
Length = 435
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/301 (34%), Positives = 146/301 (48%), Gaps = 43/301 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A NSS VVSP I I +TFE P + G FDW L
Sbjct: 84 CECFHGWLEPLLARIAENSSAVVSPDITTIDLNTFEFMKPSPYGQHHNR---GNFDWGLS 140
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ ER+R K+ P+ TPT AGGLFSI + +F +G+YD +IWGGEN+E+SF
Sbjct: 141 FGWETLPDHERRRRKDETYPIKTPTFAGGLFSISRDYFYHIGSYDEEMEIWGGENIEMSF 200
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 201 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 254
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +++ + FGDV+ R +LR L CKSF WYL+ +
Sbjct: 255 FYRRNQQAAQIAKEHSFGDVSRRVDLRERLQCKSFSWYLKNVYPEVFMPDLNPLQFGAIR 314
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N C+D + + KP+ +YPCH GGNQ++ S H EIR + E CLD G +
Sbjct: 315 NMGKEACLDVG-ESNEGGKPLIMYPCHGMGGNQYFEYSTHHEIRHNIQKELCLDGTDGAM 373
Query: 271 I 271
+
Sbjct: 374 V 374
>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
Length = 632
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 154/312 (49%), Gaps = 40/312 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N VVSP IA+I +TFE P + G FDW+L
Sbjct: 279 CECFYGWLEPLLARIAENPVAVVSPDIASIDLNTFEFTKPS---PYGHGHNRGNFDWSLS 335
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E KR K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 336 FGWESLPKHENKRRKDETYPIRTPTFAGGLFSISKDYFEHIGSYDEEMEIWGGENIEMSF 395
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T + + + + ++ Y F
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + FGD++ R +LR+ L CK+F WYL + N
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNT 511
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+ MC+D + KP+ +Y CH GGNQ++ S H EIR + E CL + G V L
Sbjct: 512 GNRMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570
Query: 273 YPCHGSKGNQYF 284
CH KG + F
Sbjct: 571 RECH-YKGQKTF 581
>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Otolemur garnettii]
Length = 508
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/304 (36%), Positives = 148/304 (48%), Gaps = 57/304 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 158 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 211
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER+R K+ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 212 FTWHTVPERERQRMKSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 271
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W + E
Sbjct: 272 RI-WQCGGSLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDDYKE 320
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPV 234
L + + FGDVT R++LR L CK FKW+LE ++H P
Sbjct: 321 LYYHRNPRARLEPFGDVTERRQLREKLQCKDFKWFLETVF-------------PELHVP- 366
Query: 235 GLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--------VILYPCHGSKGNQYFEY 286
+ F M+ G + C DY D VILY CHG NQ+FEY
Sbjct: 367 ------EDRPGFFGMLQNKG---LKKYCFDYNPPDENQVAGRQVILYLCHGLGQNQFFEY 417
Query: 287 DYKY 290
+Y
Sbjct: 418 TSQY 421
>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Otolemur garnettii]
Length = 582
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR+ L CKSF WYL+ V D W G
Sbjct: 397 HFYNRNPPARKETYGDISERKLLRQRLRCKSFDWYLKTVFPNLHVPEDRPGWHGAIRSSG 456
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 152/300 (50%), Gaps = 55/300 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL+ + NSS +V P+I I +TFE + + IGGFDW L
Sbjct: 236 CECVPGWIEPLLERIGENSSTIVCPVIDTIDWNTFEF------YMQTEEPMIGGFDWRLT 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++PERERKR K+ +P+ +PTMAGGLF+++K FFE LGTYD G ++WGGENLELSF
Sbjct: 290 FQWHSVPERERKRRKSPVDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSF 349
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK---LGTYDSGFDIWGGENL 180
+ W + + + G +F KA + + L ++W
Sbjct: 350 RV-WQC----------GGSLEIHPCSHVGHVFP-KKAPYARPNFLQNTVRAAEVWMDSYK 397
Query: 181 ELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG-------M 219
+ + K +GD++ R LR L C+SF WYL+ + D +G +
Sbjct: 398 QHFYNRNPPARKETYGDISGRLLLRDKLKCQSFNWYLKNIYPDLHIPEDRAGWHGAVRHL 457
Query: 220 CIDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGG 268
I+S C D + P + L+ CH QGGNQ++ + EIR + E C + G
Sbjct: 458 GINSEC--LDYNAPEHSVTGAHLSLFGCHGQGGNQYFEYTSQKEIRFNTVTELCAEVVEG 515
>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Megachile rotundata]
Length = 571
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ + GGFDW+L
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQY-------IGASADLRGGFDWSLV 281
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 282 FKWEYLSQSERLARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------- 218
+ + L+ +G++ R EL+R L CK F WYL+ + G
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454
Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G I+ + CL YA G +L
Sbjct: 455 GPACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 512
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 513 QICDGSENQKW 523
>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
Length = 639
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 148/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + S VV P+I I D F+ L GGFDWNL
Sbjct: 299 VECNEKWLEPLLERVREDPSRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 351
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 352 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 411
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 412 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 466
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 467 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 524
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 525 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 582
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 583 ACDESENQRWI 593
>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
Length = 638
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 148/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + S VV P+I I D F+ L GGFDWNL
Sbjct: 298 VECNEKWLEPLLERVREDPSRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 350
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 351 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 410
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 411 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 465
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 466 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 523
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 524 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 581
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 582 ACDESENQRWI 592
>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Meleagris
gallopavo]
Length = 632
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 154/314 (49%), Gaps = 40/314 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A NS VVSP IA+I +TFE P + G FDW+L
Sbjct: 279 CECFYGWLEPLLARIAENSVAVVSPDIASIDLNTFEFSKPS---PYGHNHNRGNFDWSLS 335
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E KR K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 336 FGWESLPKYENKRRKDETYPIRTPTFAGGLFSISKKYFEHIGSYDDEMEIWGGENIEMSF 395
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T + + + + ++ Y F
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + FGD++ R LR+ L CK+F WYL + N
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLNLRQRLQCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNI 511
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+ MC+D + KP+ +Y CH GGNQ++ S H EIR + E CL + G V L
Sbjct: 512 GNHMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570
Query: 273 YPCHGSKGNQYFEY 286
C KG + F +
Sbjct: 571 REC-SYKGQKIFAF 583
>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
Length = 630
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 108/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E +RWL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 290 VECNERWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 342
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 343 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 402
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 403 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 457
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
+ + L+ FG++ R L+ L CK FKWYLE V D
Sbjct: 458 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEIGQFRQDG 515
Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 516 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFSKRGEIKHDDLCLTLVQFARGSQVVLK 573
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 574 ACDESENQRWI 584
>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryzias latipes]
Length = 584
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/295 (36%), Positives = 145/295 (49%), Gaps = 37/295 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL+ +A N+S +V P+I I ++FE G + IGGFDW L
Sbjct: 236 CECVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTG------EPMIGGFDWRLT 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++PE ERKR K+ +P +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 290 FQWHSVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSF 349
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
+ W E H P P L + +A + +Y F
Sbjct: 350 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKHHF----YNRN 404
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMC----IDSACK 226
+ K ++GD+T R +LR L C SF WYL E W G I S C
Sbjct: 405 PPAKKENYGDITERLQLRERLKCNSFDWYLKNIYPELHVPEDREGWHGAIRSSGIQSECL 464
Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+ H P G L+ CH QGGNQ++ + EIR + E C + G +
Sbjct: 465 DYNAPDHNPTGAHLSLFGCHGQGGNQYFEYTSQKEIRFNSVTELCAEVLDGQTSI 519
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 101/248 (40%), Gaps = 99/248 (39%)
Query: 109 SGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 168
+G + GG + L+F+ WH++PE ERKR K+ +P +PTMAGGLF++ K +FE LGTY
Sbjct: 276 TGEPMIGGFDWRLTFQ--WHSVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTY 333
Query: 169 DSG------------FDIWG-GENLEL--------------------------------- 182
D G F +W G +LE+
Sbjct: 334 DMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWM 393
Query: 183 -------------SFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTD 229
+ K ++GD+T R +LR L C SF WYL+
Sbjct: 394 DSYKHHFYNRNPPAKKENYGDITERLQLRERLKCNSFDWYLK------------------ 435
Query: 230 MHKPVGLYP-CHKQGGNQFWMMSKHGEIRR---DEACLDY-------AGGDVILYPCHGS 278
+YP H + W HG IR CLDY G + L+ CHG
Sbjct: 436 -----NIYPELHVPEDREGW----HGAIRSSGIQSECLDYNAPDHNPTGAHLSLFGCHGQ 486
Query: 279 KGNQYFEY 286
GNQYFEY
Sbjct: 487 GGNQYFEY 494
>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
gallus]
Length = 632
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 153/314 (48%), Gaps = 40/314 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A NS VVSP IA+I +TFE P + G FDW+L
Sbjct: 279 CECFYGWLEPLLARIAENSVAVVSPDIASIDLNTFEFSKPS---PYGHNHNRGNFDWSLS 335
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E KR K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 336 FGWESLPKYENKRRKDETYPIRTPTFAGGLFSISKEYFEHIGSYDDEMEIWGGENIEMSF 395
Query: 124 KFNWH----------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W ++ + P T + + + + ++ Y F
Sbjct: 396 RV-WQCGGLLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + FGD++ R +LR+ L CK+F WYL + N
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNV 511
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+ MC+D + KP+ +Y CH GGNQ++ S H EIR + E CL + G V L
Sbjct: 512 GNHMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570
Query: 273 YPCHGSKGNQYFEY 286
C KG + F +
Sbjct: 571 REC-SYKGQKIFAF 583
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++ + + ++ P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNSGWLEPLLERISEDETVIICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G M
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 456
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKQERDRRTSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W + E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR+ L CKSF WYL+ V D W G +
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRKQLRCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSLG 456
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++++ + VV P+I I +TFE G + IGGFDW L
Sbjct: 169 CECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 222
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 223 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 282
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 283 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 331
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L C+SF WYL+ V D W G M
Sbjct: 332 HFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMG 391
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 392 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 434
>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
Length = 466
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 163/328 (49%), Gaps = 57/328 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFE-----LRFPPGRLTSSYKFFI--- 55
CE WL+PLL+ +A + + VV P+I I DTF+ LR R++ + + I
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIEICLRCNLKRISETRRDKILFR 159
Query: 56 ---------GGFDWNLQFNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLG 105
GGFDW+L F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG
Sbjct: 160 FLGASADLRGGFDWSLVFKWEYLSQGERQARQKDPTQSIRTPMIAGGLFVINKAYFEKLG 219
Query: 106 TYDSGFDIWGGENLELSFKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLF 155
YD+ D+WGGENLE+SF+ + IP RKRH P P +G +F
Sbjct: 220 KYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVF 274
Query: 156 SIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--- 212
+ + ++ D + + + L+ +G++ R EL+R L CK F WYL+
Sbjct: 275 ARNTRRAAEVWMDD--YKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYP 332
Query: 213 -------------SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
S C+DS D + VGLYPCH GGNQ W ++K G I+
Sbjct: 333 ELVIPTSEGGPGGSLKQGTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHH 390
Query: 260 EACLD---YAGGDVILYP-CHGSKGNQY 283
+ CL YA G +L C GS+ ++
Sbjct: 391 DLCLTLPVYAKGTTLLMQICDGSENQKW 418
>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Acyrthosiphon pisum]
Length = 581
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 156/317 (49%), Gaps = 47/317 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL+PLLD +A + + VV P+I I D F+ L GGFDWNL
Sbjct: 235 VECNVNWLEPLLDRVAEDPTRVVCPIIDVINMDNFQYIGASSELR-------GGFDWNLV 287
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + R +R K+ P+ TP +AGGLF +DK +F KLGTYD +IWGGENLE+S
Sbjct: 288 FKWEYLSKEVRAQRQKDPTLPIRTPMIAGGLFVMDKDYFVKLGTYDKEMNIWGGENLEIS 347
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ + +
Sbjct: 348 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFAHNTRRAAEV--WMDQY 400
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------VSNDWSG---- 218
+ + LS FG++ R L++NLGCK FKWYL+ +++ G
Sbjct: 401 KRYYYNAVPLSRIVPFGNIADRLALKKNLGCKPFKWYLDNVYPELKLPATVDEFVGSIRQ 460
Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGD-VIL 272
MC+D+ + K G++PCH GGNQ W + G I+ D CL DY+ +I+
Sbjct: 461 GYMCLDTL--ENQVGKTAGIFPCHDYGGNQEWTFTIGGSIKHDMMCLSPTDYSSMSLIIM 518
Query: 273 YPCHGSKGNQYFEYDYK 289
PC + F+ + K
Sbjct: 519 KPCDSTTDEWKFDENTK 535
>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
saltator]
Length = 442
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 108/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 100 CECNADWIEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 153 FKWEYLSQIERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 212
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 213 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 265
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
+ + L+ +G++ R EL+R L CK F WYL+ S
Sbjct: 266 KQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 325
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
C+DS D + VGLYPCH GGNQ W ++K G I+ + CL YA G +L
Sbjct: 326 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 383
Query: 274 P-CHGSKGNQY 283
C GS+ ++
Sbjct: 384 QICDGSENQKW 394
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++++ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L C+SF WYL+ V D W G M
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Sarcophilus harrisii]
Length = 631
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 108/317 (34%), Positives = 152/317 (47%), Gaps = 40/317 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P Y G FDW+L
Sbjct: 278 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPS---PYGYNHNRGNFDWSLS 334
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++PE ER+R K+ P+ TPT AGGLFSI K +FE +GTYD IWGGEN+E+SF
Sbjct: 335 FGWESLPEHERQRRKDETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMKIWGGENIEMSF 394
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ + F
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---FKEIFY 450
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + FGD++ R E++ L CK+F WYL + N
Sbjct: 451 RRNTEAAKIVKQKTFGDISKRLEIKHRLQCKNFTWYLNNVYPEIYVPDLNPVISGYIQNK 510
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL G V L
Sbjct: 511 GRHLCLDVG-ENNLGGKPLIMYTCHGLGGNQYFEYSAQHEIRHSIQQELCLHAVQGPVQL 569
Query: 273 YPCHGSKGNQYFEYDYK 289
C KG + D +
Sbjct: 570 NTC-SYKGQKTLTIDVQ 585
>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Nasonia vitripennis]
Length = 572
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + S VV P+I I D F+ + GGFDW+L
Sbjct: 230 CECNADWLEPLLERVAEDPSRVVCPVIDVISMDNFQY-------IGASADLRGGFDWSLV 282
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENLE+S
Sbjct: 283 FKWEYLSQSERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 342
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D +
Sbjct: 343 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 395
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------- 218
+ + L+ +G++ R EL+R L CK F WYL+ + G
Sbjct: 396 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKHVYPELIIPTSEGGPGGSLKQ 455
Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YA-GGDVIL 272
C+DS D + VGLYPCH GGNQ W M+ G I+ + CL YA G +++
Sbjct: 456 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGMTNDGLIKHHDLCLTLPVYAKGTSLLM 513
Query: 273 YPCHGSKGNQY 283
C GS+ ++
Sbjct: 514 QICDGSENQKW 524
>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
Length = 631
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 148/314 (47%), Gaps = 46/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 291 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 343
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 344 FKWEYLSPTERAARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 403
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 404 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 458
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
+ + L+ FG++ R L+ L CK FKWYLE V D
Sbjct: 459 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEPQEVGQFRQDT 516
Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VGL+PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 517 TECLDTMGHVID--GTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 574
Query: 274 PCHGSKGNQYFEYD 287
C ++ ++ D
Sbjct: 575 SCDDTENQRWIMRD 588
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G M
Sbjct: 393 HFYNRNPPARKEAYGDLSERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W + E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G M
Sbjct: 393 HFYNRNPPARKETYGDISERKLLRERLQCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + ++ + +V P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNSGWLEPLLERIGKDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W + E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDQYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G M
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 456
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSEC--LDYNSPDSNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499
>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
rerio]
Length = 578
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/293 (37%), Positives = 143/293 (48%), Gaps = 54/293 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TF+ PG IGGFDW L
Sbjct: 226 CECHEGWLEPLLQRIKEEPSAVVCPVIDVIDWNTFQYLGNPGEPQ------IGGFDWRLV 279
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH+IPE E+KR A + V +PTMAGGLF+++K +F LGTYD+G ++WGGENLE SF
Sbjct: 280 FTWHSIPEHEQKRRSAATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSF 339
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S +KA L ++W + E
Sbjct: 340 RI-WQCGGSLEIHPCSHVGHVFPKKAP------YSRNKA----LANSVRAAEVWMDDFKE 388
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYL-------EVSNDWSGM-------- 219
+ + +GDVT R++LR L CK F+W+L +V D GM
Sbjct: 389 VYYHRSPHARLEAYGDVTDRRKLRMRLRCKDFRWFLDNIYPDIQVPEDKPGMFGMLKNKG 448
Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR---RDEA 261
C D P D HK G LYPCH G NQF+ S EIR RD A
Sbjct: 449 MTNYCFDY--NPPDEHKIAGHRVILYPCHGMGQNQFFEYSTLQEIRYNTRDPA 499
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 100/229 (43%), Gaps = 90/229 (39%)
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------------F 172
F WH+IPE E+KR A + V +PTMAGGLF+++K +F LGTYD+G F
Sbjct: 280 FTWHSIPEHEQKRRSAATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSF 339
Query: 173 DIWG-GENLEL----------------------------------SFKG----------- 186
IW G +LE+ FK
Sbjct: 340 RIWQCGGSLEIHPCSHVGHVFPKKAPYSRNKALANSVRAAEVWMDDFKEVYYHRSPHARL 399
Query: 187 -DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN 245
+GDVT R++LR L CK F+W+L+ N + + + P D KP G++ K G
Sbjct: 400 EAYGDVTDRRKLRMRLRCKDFRWFLD--NIYPDIQV-----PED--KP-GMFGMLKNKG- 448
Query: 246 QFWMMSKHGEIRRDEACLDY--------AGGDVILYPCHGSKGNQYFEY 286
M+ + C DY AG VILYPCHG NQ+FEY
Sbjct: 449 ----MTNY--------CFDYNPPDEHKIAGHRVILYPCHGMGQNQFFEY 485
>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
Length = 640
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 147/310 (47%), Gaps = 46/310 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 300 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 352
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 353 FKWEYLSASERTARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 412
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 413 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 467
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
+ + L+ FG++ R L+ L CK FKWYLE V D
Sbjct: 468 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDM 525
Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VGL+PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 526 TECLDTMGHLVD--GTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFSRGSQVVLK 583
Query: 274 PCHGSKGNQY 283
C ++ ++
Sbjct: 584 SCDDTENQRW 593
>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Taeniopygia guttata]
Length = 631
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 154/312 (49%), Gaps = 40/312 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N VVSP IA+I +TFE P S + G FDW+L
Sbjct: 278 CECFYGWLEPLLARIAENPVAVVSPDIASIDLNTFEFSKPSPYGHSHNR---GNFDWSLS 334
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E KR K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 335 FGWESLPKHENKRRKDETYPIRTPTFAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSF 394
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T + + + + ++ Y F
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 450
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + FGD++ R +LR+ L CK+F WYL + N
Sbjct: 451 RRNTEAAKIVKQKTFGDISKRIDLRQRLQCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNI 510
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+ MC+D + KP+ +Y CH GGNQ++ S H EIR + E CL + G V L
Sbjct: 511 GNRMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 569
Query: 273 YPCHGSKGNQYF 284
C KG + F
Sbjct: 570 REC-TYKGQKTF 580
>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
Length = 606
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 266 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 318
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 319 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 378
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 379 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 433
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 434 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 491
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 492 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 549
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 550 ACDDSENQRWI 560
>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
Length = 634
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 294 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 346
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 347 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 406
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 407 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 461
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 462 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 519
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 520 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 577
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 578 ACDDSENQRWI 588
>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
Short=pp-GaNTase 2; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 2; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2
gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
Length = 633
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 293 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 345
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 346 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 405
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 406 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 460
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 461 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 518
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 519 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 576
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 577 ACDDSENQRWI 587
>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
Length = 631
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 291 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 343
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 344 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 403
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 404 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 458
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 459 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 516
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 517 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 574
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 575 ACDDSENQRWI 585
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
Length = 633
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 293 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 345
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 346 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 405
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 406 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 460
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 461 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 518
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 519 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 576
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 577 ACDDSENQRWI 587
>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 615
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 275 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 327
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 328 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 387
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 388 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 442
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
+ + L+ FG++ R L+ L CK FKWYLE V D S
Sbjct: 443 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 500
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W +K GEI+ D+ CL G V+L
Sbjct: 501 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 558
Query: 274 PCHGSKGNQYF 284
C S+ ++
Sbjct: 559 ACDDSENQRWI 569
>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cavia porcellus]
Length = 568
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 153/322 (47%), Gaps = 50/322 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSID------------KAFFEK 164
F+ W E + A P P G + + + K FF
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYI 382
Query: 165 LGTYDSGF-----DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
+ F D++ GE + KE R L +++ Y+
Sbjct: 383 ISPAKCNFLTRDLDVFMGETDSDIVGTKY--TYKLKEERFVLSHRNYSPYIPSQGNMAKE 440
Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
+ N + C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD +
Sbjct: 441 KQSMIRNVETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 498
Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
G VI+ CH +GNQ +EYD
Sbjct: 499 LNGPVIMLKCHHMRGNQLWEYD 520
>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Anolis carolinensis]
Length = 575
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 152/320 (47%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 231 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 283
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 284 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 343
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H P P +G +F+ +
Sbjct: 344 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 389
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + +G++ SR EL++ L CK FKWYLE
Sbjct: 390 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRLNCKPFKWYLENVYPELRVPDHQDI 449
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
+ C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A
Sbjct: 450 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 507
Query: 267 GGDVI-LYPCHGSKGNQYFE 285
G +I L C + G Q +E
Sbjct: 508 PGSLIKLQGCRENDGRQKWE 527
>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
laevis]
gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
Length = 622
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/310 (34%), Positives = 153/310 (49%), Gaps = 49/310 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I ++FE P G+ S G FDW+
Sbjct: 270 CECFHGWLEPLLSRIAEDYTAVVSPDITTIDLNSFEFAKPVQYGKTHSR-----GNFDWS 324
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W AIPE E+ R KN P+ TPT AGGLFSI KA+FE +G+YD +IWGGEN+E+
Sbjct: 325 LTFGWEAIPEAEKLRRKNETYPIKTPTFAGGLFSISKAYFEHIGSYDEDMEIWGGENVEM 384
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H + P T ++ + + + + Y
Sbjct: 385 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YK 438
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
+ + ++ + FGDV+ R +L+ +L CK+F WYLE
Sbjct: 439 IIYYRRNDQAAKMVKEKSFGDVSKRLKLKADLHCKNFTWYLENIYPELFVPDRDPTYSGA 498
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL--DYA 266
V N+ + C+D + KP+ +YPCH GGNQ++ S H E+R + A CL Y
Sbjct: 499 VKNEGAQKCLDVG-ENNHGGKPLIMYPCHGMGGNQYFEYSTHKELRHNIAKQLCLRSKYG 557
Query: 267 GGDVILYPCH 276
G V L C
Sbjct: 558 PGQVELGECQ 567
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
Length = 635
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 147/310 (47%), Gaps = 46/310 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 295 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 347
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER RH + + TP +AGGLF IDKA+F KLG YD D+WGGENLE+S
Sbjct: 348 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 407
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 408 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 462
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
+ + L+ FG++ R L+ L CK FKWYLE V D
Sbjct: 463 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEPQEIGQFRQDG 520
Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
C+D+ D VG++PCH GGNQ W SK GEI+ D+ CL G V+L
Sbjct: 521 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFSRGSQVVLK 578
Query: 274 PCHGSKGNQY 283
C S+ ++
Sbjct: 579 SCDDSENQRW 588
>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
Length = 631
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 150/306 (49%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A+N + VVSP I I +TFE P + + G FDW+L
Sbjct: 255 CECFNGWLEPLLARIAKNRTAVVSPDITTIDLNTFEFMKPSPYGQNHNR---GNFDWSLA 311
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E+KR K+ P+ TPT AGGLFSI K +F ++G+YD +IWGGEN+E+SF
Sbjct: 312 FGWESLPDHEKKRRKDETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSF 371
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H + P T ++ + + + + Y
Sbjct: 372 RVWQCGGQLEIIPCSIVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YKEI 425
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +L+ FGD++ R + R L CKSF WYL+ V
Sbjct: 426 FYRRNQQAAQLARDKAFGDISERLDFRVRLRCKSFSWYLKNIYPEAFIPDLNPLSFGSVK 485
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N C+D A + + K + +YPCH GGNQ++ S H EIR + E CL A G V
Sbjct: 486 NVGKDSCLD-AGENNEGGKKLIMYPCHGLGGNQYFEYSTHHEIRHNIQKELCLHGAAGAV 544
Query: 271 ILYPCH 276
L C
Sbjct: 545 RLEECQ 550
>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
Length = 633
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 154/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W A+P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWEALPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
+ ++ + FGD++ R E++ L CK+F WYL +++ SG
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGSKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAPGPVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KTC 574
>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
rerio]
gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
Length = 582
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/291 (36%), Positives = 147/291 (50%), Gaps = 37/291 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL+ +A N + ++ P+I I +TFE + + +GGFDW L
Sbjct: 235 CECVPGWIEPLLERIAENETTIICPVIDTIDWNTFEFYM------QTEEPMVGGFDWRLT 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WHA+PE +RK K+ +P+ +PTMAGGLF++ KA+FE LGTYD G ++WGGENLELSF
Sbjct: 289 FQWHAVPEIDRKIRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSF 348
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
+ W E H P P + L + +A + TY F
Sbjct: 349 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARSNFLQNTVRAAEVWMDTYKQHF----YNRN 403
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC----IDSACK 226
+ K +GD++ R LR L CKSF+WYL+ V D W G I S C
Sbjct: 404 PPARKESYGDISERIVLRNRLQCKSFEWYLQNVYPGLHVPEDRPGWHGAVRSAGIHSECL 463
Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGG 268
+ H P G L+ CH QGGNQ++ + EIR + E C + G
Sbjct: 464 DYNAPDHNPTGAHLSLFGCHGQGGNQYFEYTSQREIRFNSVTELCAEVQDG 514
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 96/233 (41%), Gaps = 95/233 (40%)
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----------- 171
F WHA+PE +RK K+ +P+ +PTMAGGLF++ KA+FE LGTYD G
Sbjct: 287 LTFQWHAVPEIDRKIRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLEL 346
Query: 172 -FDIWG-GENLEL----------------------------------------------S 183
F +W G +LE+ +
Sbjct: 347 SFRVWQCGGSLEIHPCSHVGHVFPKKAPYARSNFLQNTVRAAEVWMDTYKQHFYNRNPPA 406
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
K +GD++ R LR L CKSF+WYL+ N + G+ + P + G
Sbjct: 407 RKESYGDISERIVLRNRLQCKSFEWYLQ--NVYPGLHV----------------PEDRPG 448
Query: 244 GNQFWMMSKHGEIRR---DEACLDY-------AGGDVILYPCHGSKGNQYFEY 286
W HG +R CLDY G + L+ CHG GNQYFEY
Sbjct: 449 ----W----HGAVRSAGIHSECLDYNAPDHNPTGAHLSLFGCHGQGGNQYFEY 493
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLRNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Oreochromis niloticus]
Length = 584
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/277 (37%), Positives = 140/277 (50%), Gaps = 34/277 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL+ ++ N+S +V P+I I +TFE + + IGGFDW L
Sbjct: 236 CECVPGWIEPLLERISENASTIVCPVIDTIDWNTFEF------YMQTDEPMIGGFDWRLT 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++PE ERKR K+ +P+ +PTMAGGLF++ KA+FE LGTYD G D+WGGENLELSF
Sbjct: 290 FQWHSVPEMERKRRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSF 349
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
+ W E H P P L + +A + +Y F
Sbjct: 350 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKKHF----YNRN 404
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMC----IDSACK 226
+ K +G+++ R LR L C SF+WYL E W G I S C
Sbjct: 405 PPARKEKYGNISERLLLREKLKCNSFEWYLKNIYPELHVPEDREGWHGAVRSSGIHSECL 464
Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
+ H P G L+ CH QGGNQ++ + EIR
Sbjct: 465 DYNAPEHSPTGSQLSLFGCHGQGGNQYFEYTSQKEIR 501
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 93/234 (39%), Gaps = 97/234 (41%)
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----------- 171
F WH++PE ERKR K+ +P+ +PTMAGGLF++ KA+FE LGTYD G
Sbjct: 288 LTFQWHSVPEMERKRRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLEL 347
Query: 172 -FDIWG-GENLEL----------------------------------------------S 183
F +W G +LE+ +
Sbjct: 348 SFRVWQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKKHFYNRNPPA 407
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYP-CHKQ 242
K +G+++ R LR L C SF+WYL+ +YP H
Sbjct: 408 RKEKYGNISERLLLREKLKCNSFEWYLK-----------------------NIYPELHVP 444
Query: 243 GGNQFWMMSKHGEIRRD---EACLDY-------AGGDVILYPCHGSKGNQYFEY 286
+ W HG +R CLDY G + L+ CHG GNQYFEY
Sbjct: 445 EDREGW----HGAVRSSGIHSECLDYNAPEHSPTGSQLSLFGCHGQGGNQYFEY 494
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 227 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 280
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 281 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 340
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 341 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 389
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 390 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 449
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 450 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 492
>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Monodelphis domestica]
Length = 708
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/294 (34%), Positives = 150/294 (51%), Gaps = 53/294 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ + ++ S ++ P+I I +TF+ G + IGGFDW+L
Sbjct: 360 CECNQGWLEPLLERIGQDESVIICPVIDTIDWNTFDFYMQEG------EPVIGGFDWHLT 413
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE ER+R ++ +P+ +P MAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 414 FQWQPVPEHERRRWQSRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 473
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGENLE 181
+ W A + + G +F + ++ ++W + E
Sbjct: 474 RV-WQC----------GGALEIHPCSHVGHVFPKRAPYARPNFRQNTVRAAEVWMDDYKE 522
Query: 182 -------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
L+ K +GDV+ RK LR+ L CKSF WYL+ V D W G
Sbjct: 523 HFYNRNPLARKESYGDVSERKLLRKRLNCKSFDWYLKTVFPALRVPEDRPGWHGAIRSVG 582
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL 263
I S C PT+ H + L+ CH QGGNQF+ + E+R + E CL
Sbjct: 583 ISSECLDYKTPERDPTEAH--LSLFGCHGQGGNQFFEYTLKKELRFSVQTELCL 634
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 151/306 (49%), Gaps = 57/306 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+P+L + VV P+I I +TF+ L + + IGGFDW L
Sbjct: 223 CECHEGWLEPVLHRIKEEPKAVVCPVIDVIDWNTFQY------LGHAGEPQIGGFDWRLV 276
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH+IP+ E+KR ++ + + +PTMAGGLF++ K FF LGTYD+G ++WGGENLE SF
Sbjct: 277 FTWHSIPDYEQKRRRSPVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSF 336
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 337 RI-WQCGGSLEVHPCSHVGHVFPKKAP------YSRSKA----LANSVRAAEVWLDEFKE 385
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM-------- 219
+ + + FGDVT R+ LR LGCKSFKWYL+ V +D GM
Sbjct: 386 IYYHRNPHARLEAFGDVTERRMLREKLGCKSFKWYLDNIYPDIHVPHDRPGMFGMLKNRG 445
Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAG 267
C D PTD + VG LY CH G NQF+ S +GEI R C+ AG
Sbjct: 446 KTNYCFDY--NPTDENVVVGQRVILYLCHGMGQNQFFEYSVNGEICYNTREPAGCI--AG 501
Query: 268 GDVILY 273
++ Y
Sbjct: 502 DNISTY 507
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + ++ + V+ P+I I +TFE G + IGGFDW L
Sbjct: 218 CECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 271
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ EP +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 272 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 331
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 332 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 380
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G +
Sbjct: 381 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 440
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
I S C PT + + L+ CH QGGNQF+ + + EIR
Sbjct: 441 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 483
>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
musculus]
gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 570
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 226 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 338
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 390 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 449
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 507
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 508 RLQGCRENDSRQKWE 522
>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
Length = 570
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 226 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 338
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 390 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 449
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 507
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 508 RLQGCRENNSKQKWE 522
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIHSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 145/283 (51%), Gaps = 46/283 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ ++R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +FS + L ++W + E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFSKRAPYARPNFLQNTAREAEVWMDDYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K + D++ RK LR L CKSF WYL+ V D W G M
Sbjct: 393 HFYNRNPPARKETYDDISERKLLRERLQCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452
Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
I S C + + P G L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSECLDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
Length = 526
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 182 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 234
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 235 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 294
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 295 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 345
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 346 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 405
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 406 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 463
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 464 RLQGCRENDSRQKWE 478
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Sarcophilus harrisii]
Length = 590
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 108/309 (34%), Positives = 155/309 (50%), Gaps = 53/309 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + S VV P+I I +TFE L +S IGGFDW L
Sbjct: 241 CECHDGWLEPLLERIHEEESAVVCPVIDVIDWNTFEY------LGNSGDPQIGGFDWRLV 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++PE+E+KR ++ + + +PTMAGGLF+++K +FE LG+YD+G ++WGGENLE SF
Sbjct: 295 FTWHSVPEKEQKRRRSKIDVIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSF 354
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 355 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 403
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
+ + K +GD+T RKELR L CK F+W+LE + D + GM ++
Sbjct: 404 IYYHRNMHARKEPYGDITERKELRDKLKCKDFRWFLENVYPELHIPEDRPGYFGMLVNRG 463
Query: 225 CKPT--DMHKP---------VGLYPCHKQGGNQFWMMSKHGEI----RRDEAC--LDYAG 267
D + P V LY CH G NQF+ + H E+ R+ EAC +D
Sbjct: 464 MADYCFDYNPPSESEITGNQVILYLCHGMGQNQFFEYTSHNELRYNTRQPEACAAVDVGT 523
Query: 268 GDVILYPCH 276
+ ++ C+
Sbjct: 524 DHLTMHLCY 532
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + ++ + V+ P+I I +TFE G + IGGFDW L
Sbjct: 234 CECNTGWLEPLLERIHKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ EP +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G +
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 456
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
I S C PT + + L+ CH QGGNQF+ + + EIR
Sbjct: 457 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 499
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + ++ + V+ P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R K+ EP +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
+ K +GD++ RK LR L CKSF WYL+ V D W G +
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 452
Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
I S C PT + + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
Length = 536
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 192 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 244
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 245 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 304
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 305 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 355
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 356 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 415
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 416 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 473
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 474 RLQGCRENDSRQKWE 488
>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 504
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 160 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 212
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 213 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 272
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 273 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 323
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 324 EFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 383
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 384 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 441
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 442 RLQGCRENDSRQKWE 456
>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
taurus]
gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
taurus]
Length = 571
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
Length = 544
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 200 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 252
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 253 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 312
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 313 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 363
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 364 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 423
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 424 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 481
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 482 RLQGCRENDSRQKWE 496
>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
Length = 569
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 225 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 277
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 278 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 337
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 338 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 388
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 389 EFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 448
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 449 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 506
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 507 RLQGCRENDSRQKWE 521
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSKG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 633
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 150/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGINHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W A+P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWEALPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEAYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 NAC 574
>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 548
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 204 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 256
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 257 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 316
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 317 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 367
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 368 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 427
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 428 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 485
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 486 RLQGCRENDSRQKWE 500
>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
caballus]
Length = 460
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 109/300 (36%), Positives = 144/300 (48%), Gaps = 57/300 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 110 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 163
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 164 FTWHVVPERERLRMRSPTDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 223
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E
Sbjct: 224 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDGYKE 272
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPV 234
L + + FGDVT RK+LR L CK F+W+LE N + ++H P
Sbjct: 273 LYYHRNPHARLEPFGDVTERKQLREKLRCKDFRWFLE--NVYP-----------ELHVPE 319
Query: 235 GLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--------AGGDVILYPCHGSKGNQYFEY 286
C F M+ G + C DY G V LY CHG NQ+FEY
Sbjct: 320 DRPGC-------FGMLQNKG---LKDYCFDYNPPNENQITGHQVTLYLCHGMGQNQFFEY 369
>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
scrofa]
Length = 633
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 152/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++R L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKRRLQCKNFTWYLNNIYPEAYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSVQHEIRHNIQKELCLHAAQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KTC 574
>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Callithrix jacchus]
Length = 578
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 142/285 (49%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 495
>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Loxodonta africana]
Length = 572
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 228 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 280
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+S
Sbjct: 281 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 340
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 341 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 391
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 392 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 451
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D G +I
Sbjct: 452 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTPGSLI 509
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 510 KLQGCRENDSRQKWE 524
>gi|324520233|gb|ADY47590.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 267
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 95/232 (40%), Positives = 125/232 (53%), Gaps = 42/232 (18%)
Query: 89 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTP 148
MAGGLF+ID+ +FEKLGTYD GFDIWGGENLE+SFK W E +
Sbjct: 1 MAGGLFAIDRQYFEKLGTYDPGFDIWGGENLEISFKI-WMCGGRLE----------IVPC 49
Query: 149 TMAGGLFSIDKAFFEKLGTYDSG------FDIWGGENLELSFK------GDFGDVTSRKE 196
+ G +F + + G ++W E E+ ++ G++GDV+ RK
Sbjct: 50 SHVGHVFRKKSPYKWRTGVNVLQRNNVRLAEVWLDEYKEIYYERINHKLGEYGDVSERKR 109
Query: 197 LRRNLGCKSFKWYL-----------------EVSNDWS-GMCIDSACKPTDMHKPVGLYP 238
LR L C SFKWYL E+ N + C+D ++ V YP
Sbjct: 110 LRERLKCHSFKWYLDNVFPDLFIPSKAIGKGEIRNRGNPKFCVDHEVGRNVVNDAVIPYP 169
Query: 239 CHKQGGNQFWMMSKHGEIRRDEACLDYAG-GDVILYPCHGSKGNQYFEYDYK 289
CH GGNQFW++SK GEIRRDE C+DY G G V+ Y CHGSKGNQ +EY+++
Sbjct: 170 CHLMGGNQFWLLSKEGEIRRDEYCIDYPGRGSVVTYECHGSKGNQLWEYNHE 221
>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 653
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 108/308 (35%), Positives = 145/308 (47%), Gaps = 74/308 (24%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL +A N + VV+P+I I D + L + + F I N+ FNW +
Sbjct: 343 WLEPLLARVAENHTRVVAPVIDMISDRS--LACGGNEIGNLGTFEIA----NMGFNWLTL 396
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
+ E+ +H +EP TPT+AGGLFSI++A+F K+GTYD G DIWGGENLE+SF+
Sbjct: 397 NKTEKAKH-GQSEPWKTPTIAGGLFSINRAYFTKMGTYDHGMDIWGGENLEISFR----- 450
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSG------------- 171
VW M GG I F + Y G
Sbjct: 451 ---------------VW---MCGGSLEIHPCSHVAHLFRSMSPYKWGKSFRDILRKNAVR 492
Query: 172 -FDIWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
++W E + ++ GD+GDV+ RK+LR LGCKSF WYL
Sbjct: 493 TAEVWMDEYKHIYYERLNYDLGDYGDVSERKDLRNRLGCKSFGWYLKTMLPDMKLPETAL 552
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
EV N GMC+D+ T PCH QGGNQF+ + +G I RD ACL
Sbjct: 553 YSGEVRNMEKGMCLDTM--GTTAGNKFQAIPCHHQGGNQFFRFTVNGHIERDSACLSDQD 610
Query: 268 GDVILYPC 275
G ++ C
Sbjct: 611 GSLLYVLC 618
>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
Length = 618
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 229 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 281
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 282 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 341
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 392
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 393 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 452
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 453 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 510
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 511 RLQGCRENDSRQKWE 525
>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
grunniens mutus]
Length = 530
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 186 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 238
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 239 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 298
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 299 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 349
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 350 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 409
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 410 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 467
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 468 KLQGCRENDSRQKWE 482
>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Takifugu rubripes]
Length = 624
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 148/306 (48%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N S VVSP I I +TFE P + + G FDW+L
Sbjct: 274 CECFNGWLEPLLARIAENHSAVVSPDITTIDLNTFEFVKPSPYGQNHNR---GNFDWSLA 330
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +F ++G+YD +IWGGEN+E+SF
Sbjct: 331 FGWESLPDHEKRRRKDETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSF 390
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H + P T ++ + + + + Y
Sbjct: 391 RVWQCGGQLEIIPCSIVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YKEI 444
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +L FGD++ R +LR L CKSF WYL+ V
Sbjct: 445 FYRRNQQAAQLVRDKAFGDISQRMDLRARLKCKSFSWYLKNIYPEAFIPDLNPLGFGSVK 504
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N C+D A + + K V +YPCH GGNQ++ S EIR + E CL A G V
Sbjct: 505 NVGKDSCLD-AGENNEGGKRVIMYPCHGLGGNQYFEYSTRHEIRHNIQKELCLHGAAGAV 563
Query: 271 ILYPCH 276
L C
Sbjct: 564 KLEECQ 569
>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Ornithorhynchus anatinus]
Length = 634
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 281 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFSKPSPYGNNHNR---GNFDWSLS 337
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++PE E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 338 FGWESLPEHEKQRRKDETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 397
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ + F
Sbjct: 398 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---FKEIFY 453
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
E ++ + FGD++ R ELR L CK+F WYL +++ SG
Sbjct: 454 RRNTEAAKIVKQKAFGDLSKRLELRDRLQCKNFTWYLNTIYPEVYVPDLNPVLSGYIKSV 513
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S+ EIR + E CL + G V L
Sbjct: 514 GRHVCLDVG-ENNQGTKPLIMYTCHGLGGNQYFEYSEQHEIRHNIQKELCLHASHGPVQL 572
Query: 273 YPC 275
C
Sbjct: 573 KAC 575
>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 626
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 159/321 (49%), Gaps = 44/321 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P+++++ ++ + VV P+I I D T + TSS + IGGF+W +
Sbjct: 259 CEATDHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMG-----TSSDFYQIGGFNWKGE 313
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W PE RK K+ A+P+ +PTMAGGLF+ID+ +F + G+YDS + WGGENLE+SF
Sbjct: 314 FIWINTPEAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSF 373
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + P P P+ I+ A ++ + + + +
Sbjct: 374 RIWMCGGSLVIAPCSHVGHIFRDYHPYKFPSNK-DTHGINTARLAEV--WMDNYKYYFYQ 430
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMC 220
N K FGD++ RK LR L CKSFKWYL+ N +GMC
Sbjct: 431 NRPELRKISFGDISERKALRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMC 490
Query: 221 IDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRRDEACL---------DYAGG 268
+DS D +P+G+YPCHK GGNQ + EIR++++C D
Sbjct: 491 LDSMSHNYDNTEPLGIYPCHKDTNSGGNQLVSYTWRHEIRKEDSCAELSSEPEKSDKTAR 550
Query: 269 DVILYPC-HGSKGNQYFEYDY 288
V++ PC G++ + +D+
Sbjct: 551 KVMMAPCGEGAESEERQRWDH 571
>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
gallus]
Length = 566
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 151/320 (47%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 222 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 274
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+S
Sbjct: 275 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 334
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H P P +G +F+ +
Sbjct: 335 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 380
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 381 EVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENVYPELRVPDHQDI 440
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
+ C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A
Sbjct: 441 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 498
Query: 267 GGDVI-LYPCHGSKGNQYFE 285
G +I L C + Q +E
Sbjct: 499 PGSLIKLQGCRENDSRQKWE 518
>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3 [Felis catus]
Length = 633
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSI 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 RAC 574
>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Canis lupus familiaris]
Length = 632
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 279 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 335
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 336 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 395
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 451
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 452 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSI 511
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 570
Query: 273 YPC 275
C
Sbjct: 571 RAC 573
>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 578
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R + VV P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P++ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Ailuropoda melanoleuca]
gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
Length = 632
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 279 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 335
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 336 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 395
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 451
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 452 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 511
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQRELCLHAAQGLVQL 570
Query: 273 YPC 275
C
Sbjct: 571 RAC 573
>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 571
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+++R R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQKRDRQISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oryctolagus cuniculus]
Length = 633
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 155/303 (51%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
+ ++ + FGD++ R E++ L CK+F WYL E++ SG
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKNRLQCKNFTWYLNTVYPEVYVPELNPVISGYIKTV 512
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G++ L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQNEIRHNIQKELCLHAAPGNLQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|301608341|ref|XP_002933751.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 586
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 102/304 (33%), Positives = 145/304 (47%), Gaps = 36/304 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P G++ S G FDW+
Sbjct: 233 CECFHGWLEPLLSRVAEDHTAVVSPDITAINYNTFEFGKPVQQGKMNSR-----GNFDWS 287
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L FNW AIP + K+ K+ P+ TPT AGGLFSI KA+FE +G+YD +IWGGEN+E+
Sbjct: 288 LAFNWEAIPAADEKQRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENVEM 347
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
SF+ IP P P + E + Y +
Sbjct: 348 SFRVWQCGGKLEIIPCSVVGHVFRTKSPHTFPKGTQVILRNQVRLAEVWMDDYKVLYYRR 407
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
+ +++ + FGD++ R +L+ +L CK+F WYLE + N+ +
Sbjct: 408 NEQAAKIAKEKSFGDISKRLKLKADLQCKNFTWYLENIYPEMFVPDRDPTYYGAIKNEGT 467
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL--DYAGGDVIL 272
CID + +YPCH GGNQ++ S H E+R + + CL Y G V L
Sbjct: 468 QNCIDVGENNNYGSQLPIMYPCHGMGGNQYFEYSTHKELRHNLKTQLCLCSKYEPGPVKL 527
Query: 273 YPCH 276
C
Sbjct: 528 VDCQ 531
>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Otolemur garnettii]
Length = 571
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ TD VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFTD--GVVGVYECHNAGGNQEWALTKEKAVKHIDLCLTVVDRAPGALI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
boliviensis boliviensis]
Length = 578
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 142/285 (49%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKYERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK LR L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 495
>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
floridanus]
Length = 451
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 154/320 (48%), Gaps = 56/320 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VV P+I I DTF+ L GGFDW+L
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + ER+ R K+ + + TP +AGGLF I+KA+FEKLG YD+ D+WGGENL +
Sbjct: 153 FKWEYLSQAERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLGIV 212
Query: 123 FKFNWHAIPERE-------------------RKRHKNAAEPVWTPTMAGGLFSIDKAFFE 163
+F+ I R RKRH P P +G +F+ +
Sbjct: 213 IQFHVQKISFRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAA 267
Query: 164 KLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------- 212
++ D + + + L+ +G++ R EL+R L CK F WYL+
Sbjct: 268 EVWMDD--YKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTSE 325
Query: 213 -----SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD--- 264
S C+DS D + VGLYPCH GGNQ W ++K G I+ + CL
Sbjct: 326 GGPGGSLKQGTACLDSMGHLLDGN--VGLYPCHNTGGNQEWGLTKDGLIKHHDLCLTLPV 383
Query: 265 YAGGDVILYP-CHGSKGNQY 283
YA G +L C GS+ ++
Sbjct: 384 YAKGTTLLMQICDGSENQKW 403
>gi|170038563|ref|XP_001847118.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
quinquefasciatus]
gi|167882317|gb|EDS45700.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
quinquefasciatus]
Length = 531
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 79/156 (50%), Positives = 91/156 (58%), Gaps = 54/156 (34%)
Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN------------- 214
KGD+GDV+SRK+LR LGCKSF+WYL EV N
Sbjct: 327 KGDYGDVSSRKQLREELGCKSFRWYLDNIFPELFIPGEAVASGEVRNMGYGNRTCLDAPG 386
Query: 215 ------------------------DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMM 250
WSG+CIDSA KP DMH P+G++PCH+ GGNQ+WM+
Sbjct: 387 GKKNLRKPVGLYPCHNQGGNQVANPWSGLCIDSAAKPEDMHTPLGIWPCHQAGGNQYWML 446
Query: 251 SKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
SK GEIRRDEACLDYAG DVILYPCHGSKGNQY+ Y
Sbjct: 447 SKTGEIRRDEACLDYAGQDVILYPCHGSKGNQYWNY 482
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/295 (34%), Positives = 145/295 (49%), Gaps = 55/295 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ + S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 240 CECHEEWLEPLLERIKEEPSAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERE+K+ ++ + + +PTMAGGLF+++K +F LG+YD+G ++WGGENLE SF
Sbjct: 294 FTWHVVPEREQKQRRSKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSF 353
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF--FEKLGTYDSGFDIWGGENLE 181
+ W + + + G +F + + L ++W E
Sbjct: 354 RI-WQC----------GGSLEIHPCSHVGHVFPKQAPYSRAKALANSVRAAEVWMDSYKE 402
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
L + + +GDVT R+ LR L CK FKWYL+ V D G
Sbjct: 403 LYYHRNPHARMEPYGDVTERRLLREKLKCKDFKWYLDNIYPELHVPEDRLGYFGMLKNKG 462
Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRD----EAC 262
C D P + H G LYPCH G NQF+ + + EIR + EAC
Sbjct: 463 MANFCFDY--NPPNEHDITGHVVILYPCHGMGQNQFFEYTSYHEIRYNTRHPEAC 515
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/300 (34%), Positives = 149/300 (49%), Gaps = 42/300 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N+++VVSP I++I +TFE P S + G FDW+L
Sbjct: 281 CECFYGWLEPLLARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHNR---GNFDWSLS 337
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++PE E K+ K+ P+ TPT AGGLFSI K +F +G+YD +IWGGEN+E+SF
Sbjct: 338 FGWESLPEHESKKRKDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSF 397
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H + P T + + + + ++ Y +
Sbjct: 398 RVWQCGGQLEIIPCSVVGHVFRSKSPH---SFPKGTQVITRNQVRLAEVWMDE---YKNI 451
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKP-TDM 230
F E ++ + FGD++ R EL++ L CK FKWYL SN + + P +
Sbjct: 452 FYRRNTEAAKIVKQQTFGDISKRHELKQRLQCKDFKWYL--SNVYPEAYVPDLNPPLSGF 509
Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYKY 290
K VG C G N ++ G +I+Y CHG GNQYFEY ++
Sbjct: 510 LKNVGRRACLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYSARH 551
>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Taeniopygia guttata]
Length = 669
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 151/320 (47%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 325 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 377
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+S
Sbjct: 378 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 437
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H P P +G +F+ +
Sbjct: 438 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 483
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 484 EVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENVYPELRVPDHQDI 543
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
+ C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A
Sbjct: 544 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 601
Query: 267 GGDVI-LYPCHGSKGNQYFE 285
G +I L C + Q +E
Sbjct: 602 PGSLIKLQGCRENDSRQKWE 621
>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
lupus familiaris]
Length = 588
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 150/320 (46%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 244 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 296
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 297 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 356
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + +P RK+H P P +G +F+ +
Sbjct: 357 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 402
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 403 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 462
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
+ C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A
Sbjct: 463 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRA 520
Query: 267 GGDVI-LYPCHGSKGNQYFE 285
G VI L C + Q +E
Sbjct: 521 PGSVIKLQGCRENDTRQKWE 540
>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6 [Ovis aries]
Length = 792
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 156/314 (49%), Gaps = 37/314 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 442 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRVQSR-----GNFDWS 496
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P RE++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+
Sbjct: 497 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 556
Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
SF+ W + E T T G+ I + + G+ +I+
Sbjct: 557 SFRV-WQCGGQLEIIPCSVVGHVFRTKSPHTFPKGINVIARNQVRLAEVWMDGYKEIFYR 615
Query: 178 ENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDW 216
NL +++ + FGD++ R +LR L C++F W+L+ + N
Sbjct: 616 RNLQAAQMAREKSFGDISERLQLRERLNCRNFSWFLDNIYPEMFVPDLKPTFFGALKNLG 675
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVILY 273
C+D + + KP+ LY CH GGNQ++ + ++R + A CL + G + L
Sbjct: 676 VDHCLDVG-ENNNGGKPLILYACHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLGLR 734
Query: 274 PCHGSKGNQYFEYD 287
CH + N D
Sbjct: 735 SCHFTGKNSQVPKD 748
>gi|431895640|gb|ELK05066.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Pteropus alecto]
Length = 367
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 145/313 (46%), Gaps = 47/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 23 CECNDHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 75
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 76 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 135
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 136 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 186
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGM 219
E + +G++ SR ELR+ L CK FKWYLE + +
Sbjct: 187 EYKNFYYAAVPSARNVPYGNIQSRLELRKTLACKPFKWYLENVYPELRVPDHQDIAFGAL 246
Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-L 272
+ C T H VG+Y CH GGNQ W ++K ++ + CL D A G +I L
Sbjct: 247 QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLIKL 306
Query: 273 YPCHGSKGNQYFE 285
C + Q +E
Sbjct: 307 QGCRENDSRQKWE 319
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ + +V P+I I +TFE G + IGGFDW L
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P+ ER R + +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + E + + G +F + L ++W E E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARVAEVWMDEYKE 392
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K +GD++ RK +R L CKSF WYL+ V D W G
Sbjct: 393 HFYNRNPPARKEAYGDISERKLIRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452
Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
I S C D + P + L+ CH QGGNQF+ + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495
>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Cricetulus griseus]
gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
griseus]
Length = 535
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/313 (32%), Positives = 147/313 (46%), Gaps = 47/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 191 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 243
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 244 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 303
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 304 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 354
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGM 219
E + +G++ SR ELR+ L CK FKWYL+ + +
Sbjct: 355 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLDNVYPELRVPDHQDIAFGAL 414
Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-L 272
+ C T H VG+Y CH GGNQ W ++K ++ + CL D + G +I L
Sbjct: 415 QQGTNCLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIRL 474
Query: 273 YPCHGSKGNQYFE 285
C + Q +E
Sbjct: 475 QGCRENDSRQKWE 487
>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 579
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 148/304 (48%), Gaps = 53/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +P+RERK ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERKLMRSPIDVIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSF 339
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W + E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDDFKE 388
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
L + + FGDVT RK+LR L CK FKW+L+ +
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLENRG 448
Query: 216 WSGMCID---SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
G C+D + + H+ V LY CH G NQF+ + EI R+ EAC+ G
Sbjct: 449 LRGYCLDYNPPSENNVEGHQ-VLLYLCHGMGQNQFFEYTSRQEIRYNTRQPEACIAVEEG 507
Query: 269 DVIL 272
+L
Sbjct: 508 KDVL 511
>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
Length = 453
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 109 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 161
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 162 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 221
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 222 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 272
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 273 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 332
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 333 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 390
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 391 KLQGCRENDSRQKWE 405
>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Nomascus leucogenys]
Length = 570
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 226 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 338
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 390 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 449
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 507
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 508 KLQGCRENDSRQKWE 522
>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 576
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 148/304 (48%), Gaps = 53/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +P+RERK ++ + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERKLMRSPIDVIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSF 339
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W + E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDDFKE 388
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
L + + FGDVT RK+LR L CK FKW+L+ +
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLENRG 448
Query: 216 WSGMCID---SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
G C+D + + H+ V LY CH G NQF+ + EI R+ EAC+ G
Sbjct: 449 LRGYCLDYNPPSENNVEGHQ-VLLYLCHGMGQNQFFEYTSRQEIRYNTRQPEACIAVEEG 507
Query: 269 DVIL 272
+L
Sbjct: 508 KDVL 511
>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
sapiens]
gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_c [Homo sapiens]
gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [synthetic
construct]
Length = 571
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
caballus]
Length = 633
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R ++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQSLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
taurus]
gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 154/316 (48%), Gaps = 41/316 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRIQSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P RE++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
SF+ IP P T G+ I + + G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDGYKEIF 443
Query: 176 GGENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
NL +++ + FGD++ R +LR L C +F W+L+ + N
Sbjct: 444 YRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPEMFVPDLKPTFFGALKN 503
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
C+D + + KP+ LY CH GGNQ++ + ++R + A CL + G +
Sbjct: 504 LGVDHCLDVG-ENNNGGKPLILYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562
Query: 272 LYPCHGSKGNQYFEYD 287
L CH + N D
Sbjct: 563 LRSCHFTGKNSQVPKD 578
>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
Length = 576
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 146/304 (48%), Gaps = 53/304 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE L +S + IGGFDW L
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +P+RER+ ++ + + +PTMAGGLF++ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERQSMRSPIDVIRSPTMAGGLFAVSKRYFDYLGSYDTGMEVWGGENLEFSF 339
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 388
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
L + + FGDVT RK+LR L CK FKW+L+ +
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLQNRG 448
Query: 216 WSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
G C+D P + H V LY CH G NQF+ + EI R+ EAC+ G
Sbjct: 449 LRGYCLDYN-PPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIRYNTRQPEACITVEDG 507
Query: 269 DVIL 272
L
Sbjct: 508 KDTL 511
>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 232 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 284
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 285 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 344
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 395
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 396 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 455
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 456 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 513
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 514 KLQGCRENDSRQKWE 528
>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
troglodytes]
gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 232 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 284
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 285 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 344
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 395
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 396 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 455
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 456 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 513
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 514 KLQGCRENDSRQKWE 528
>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
mulatta]
gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
Length = 571
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oryctolagus cuniculus]
Length = 571
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
Length = 571
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 148/317 (46%), Gaps = 55/317 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM----------- 219
E + +G++ SR ELR+ L CK FKWYLE N + +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLE--NVYPELRVPDHQDIALG 448
Query: 220 -------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGD 269
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G
Sbjct: 449 XLQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGS 506
Query: 270 VI-LYPCHGSKGNQYFE 285
+I L C + Q +E
Sbjct: 507 LIKLQGCRENDSRQKWE 523
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/289 (35%), Positives = 146/289 (50%), Gaps = 38/289 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + V+ P+I I D T E S F IG F W+
Sbjct: 273 CEATLGWLEPLLQRIKEDKRAVLVPIIDVIDDKTLEYYH-----GSPESFQIGSFTWSGH 327
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP+RE KR + P +PTMAGGLF+ID+ +F LG+YD G D+WGGENLE+SF
Sbjct: 328 FTWMDIPKREIKRRGSRVGPTNSPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSF 387
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
+ + IP R H + +T I+ A + + Y F +
Sbjct: 388 RIWMCGGSLETIP-CSRVGHIFRSFHPYTFPGNKDTHGINTARVVEVWMDDYKELFYMHR 446
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSG 218
G+ + D GD ++RK+LR++L CKSFKWYLE V ND G
Sbjct: 447 GDLKTI----DIGDTSARKKLRKDLKCKSFKWYLENVLPDKFIMTEHSLGYGRVMNDAFG 502
Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLD 264
+C+D+ + D +G YPCH Q +Q + +SK G++RR+E+C +
Sbjct: 503 KQLCLDNLQRNEDQPYNLGQYPCHAQMAMSQVFALSKLGQLRREESCAE 551
>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Papio anubis]
Length = 571
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDXCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
aries]
Length = 633
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHAAQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
And Ea2
gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
Length = 501
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 157 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 209
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 210 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 269
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 270 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 320
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 321 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 380
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 381 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 438
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 439 KLQGCRENDSRQKWE 453
>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
2 [Nomascus leucogenys]
Length = 571
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Monodelphis domestica]
Length = 571
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYSFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGNNCLDTLGHFAD--GVVGVYECHNSGGNQEWALTKDKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
scrofa]
Length = 571
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523
>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 154/316 (48%), Gaps = 41/316 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRVQSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P RE++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
SF+ IP P T G+ I + + G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDGYKEIF 443
Query: 176 GGENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
NL +++ + FGD++ R +LR L C +F W+L+ + N
Sbjct: 444 YRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPEMFVPDLKPTFFGALKN 503
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
C+D + + KP+ LY CH GGNQ++ + ++R + A CL + G +
Sbjct: 504 LGVDHCLDVG-ENNNGGKPLILYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562
Query: 272 LYPCHGSKGNQYFEYD 287
L CH + N D
Sbjct: 563 LRSCHFTGKNSQVPKD 578
>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
[Macaca mulatta]
Length = 554
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 210 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 262
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 263 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 322
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 323 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 373
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 374 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 433
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 434 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 491
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 492 KLQGCRENDSRQKWE 506
>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
Length = 633
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 152/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGNNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ ER+R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDDEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + + Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDD---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
+ ++ + FGD++ R E++ L CK+F WYL +++ SG
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSVQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
4 [Pan troglodytes]
Length = 533
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|312370886|gb|EFR19191.1| hypothetical protein AND_22918 [Anopheles darlingi]
Length = 1204
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 102/305 (33%), Positives = 149/305 (48%), Gaps = 37/305 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+ L+ +A+ + + P I I +DT L +S +++ G FDW L
Sbjct: 222 CEVIEGWLEALVAHVAQRETMIAIPAIDWIHEDTLALN-----AQNSVRYY-GSFDWGLN 275
Query: 64 FNWHAIPER--ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
F W +R + N A P TPTMAGGLF+I ++FFE+LG YD G I+GGEN+EL
Sbjct: 276 FQWRVRADRIMQPAMAGNPAAPYDTPTMAGGLFTIHRSFFERLGWYDEGMQIYGGENMEL 335
Query: 122 SFKFNWHA-----IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-----DSG 171
SFK W I R H + ++ G + + + D
Sbjct: 336 SFK-AWMCGGSMQIVGCSRVAHIQKRGHPYLRQLSDGFALVRRNSIRVAEVWLDEYADYF 394
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------VSNDWSGM 219
++ +GG + +G FG++T R ELR+ L CK F+WYLE V+
Sbjct: 395 YETFGGR----ARRGSFGNLTERHELRQRLACKPFRWYLETVFPEQFDPSKAVARGEIRF 450
Query: 220 CIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
D+ P + P + L CH GG+Q W ++ GE+ R++ CLDY G + + CHG
Sbjct: 451 ADDAKATPLCLDWPSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYDGELLSVVRCHG 510
Query: 278 SKGNQ 282
GNQ
Sbjct: 511 LGGNQ 515
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 102/321 (31%), Positives = 147/321 (45%), Gaps = 56/321 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVS-PLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNL 62
CE WL+ LDV+AR+ H ++ P I I + G +++ + G W L
Sbjct: 849 CECMVGWLEGQLDVVARDPRHTIALPTIDWIDEKNL------GLVSNKAPVYYGAMGWGL 902
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +R K +N EP TP MAGGLF+I + FE LG YD D++GGEN+ELS
Sbjct: 903 DFQWRGRWDRVNK-PENKLEPFSTPVMAGGLFTIHRKLFEWLGWYDQQLDVYGGENIELS 961
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSG-FDI 174
K +P + P + +A + + + L Y + +D+
Sbjct: 962 LKAWMCGGQLLTVPCSRVAHIQKTGHP-YLLGLAKDVARTNSVRVAEVWLDQYAAVLYDL 1020
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------------------SNDW 216
+GG ++GDFGDVT RK+LRR L CKSF+WYLE N+
Sbjct: 1021 FGGPQ----YRGDFGDVTERKQLRRALHCKSFRWYLETVFPELAPALDKRPGHGRFENEA 1076
Query: 217 SGM------CI---DSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYA 266
M C+ SA PT + PC Q W+ + GE+ + CLDY
Sbjct: 1077 LSMEGQPKHCLTAQSSAGLPT-------MEPCQAGSDARQHWLHNLFGELSNENRCLDYD 1129
Query: 267 GGDVILYPCHGSKGNQYFEYD 287
G + +Y CH ++GNQ + Y+
Sbjct: 1130 GSALRVYACHKARGNQEWRYN 1150
>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
caballus]
Length = 539
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 195 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 247
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 248 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 307
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 308 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 358
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 359 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 418
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 419 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 476
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 477 KLQGCRENDSRQKWE 491
>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Loxodonta africana]
Length = 633
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
+ ++ + FGD++ R E++ L CK+F WYL +++ SG
Sbjct: 453 RRNTDAAKIVRQKSFGDLSKRFEIKHRLQCKNFTWYLNSVYPEVYVPDLNPVISGYIKSF 512
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQHLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAPGPVQL 571
Query: 273 YPC 275
C
Sbjct: 572 RTC 574
>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_a [Homo sapiens]
Length = 508
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Sarcophilus harrisii]
Length = 542
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 107/320 (33%), Positives = 150/320 (46%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 198 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 250
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 251 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 310
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H P P +G +F+ +
Sbjct: 311 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYSFPGGSGTVFARNTR---------RAA 356
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 357 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 416
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
+ C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A
Sbjct: 417 AFGALQQGNNCLDTLGHFAD--GVVGVYECHNSGGNQEWALTKDKSVKHMDLCLTVVDRA 474
Query: 267 GGDVI-LYPCHGSKGNQYFE 285
G +I L C + Q +E
Sbjct: 475 PGSLIKLQGCRENDSRQKWE 494
>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
paniscus]
Length = 533
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Cavia porcellus]
Length = 531
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 187 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 239
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 240 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 299
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 300 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 350
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGC+ F+WYLE +
Sbjct: 351 EYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCRPFQWYLENVYPELRVPDHQDIAFGAL 410
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G ++
Sbjct: 411 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGALV 468
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 469 KLQGCRENDSRQKWE 483
>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
gorilla gorilla]
Length = 533
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
taurus]
gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
mutus]
Length = 633
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 153/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
+ ++ + FGD++ R E++ L CK+F WYL +++ SG I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511
Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+P + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 VGRPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAALGAVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
Length = 533
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485
>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
glaber]
Length = 567
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 147/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 223 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 275
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 276 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 335
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 336 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 386
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ LGC+ F+WYLE +
Sbjct: 387 EYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCQPFQWYLENVYPELRVPDHQDIAFGAL 446
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G ++
Sbjct: 447 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGALV 504
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 505 KLQGCRENDSRQKWE 519
>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
Length = 530
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 186 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 238
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 239 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 298
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 299 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 349
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 350 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 409
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 410 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 467
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 468 KLQGCRENDSRQKWE 482
>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
Length = 633
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 153/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
+ ++ + FGD++ R E++ L CK+F WYL +++ SG I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511
Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+P + KP+ LY CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 VGRPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAALGAVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 146/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 212 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 264
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 265 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 324
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 325 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 375
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 376 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 435
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 436 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 493
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 494 KLQGCRENDSRQKWE 508
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL + + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 264 CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 318
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P ER+R K+ +P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 319 LTFGWETLPAHERQRRKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 378
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + ++ Y
Sbjct: 379 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMDE---YK 432
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C++F WYL
Sbjct: 433 EIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPEMFVPDLKPTFYGA 492
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N C+D + KP+ +YPCH GGNQ++ + ++R + A CL + G
Sbjct: 493 IKNLGIDQCLDVG-ENNRGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 551
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L C + N D
Sbjct: 552 TLGLRGCQFTVKNSQVPKD 570
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL + + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 274 CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 328
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P ER+R K+ +P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 329 LTFGWETLPAHERQRRKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 388
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + ++ Y
Sbjct: 389 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMDE---YK 442
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C++F WYL
Sbjct: 443 EIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPEMFVPDLKPTFYGA 502
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N C+D + KP+ +YPCH GGNQ++ + ++R + A CL + G
Sbjct: 503 IKNLGIDQCLDVG-ENNRGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 561
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L C + N D
Sbjct: 562 TLGLRGCQFTVKNSQVPKD 580
>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Monodelphis
domestica]
Length = 631
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 104/315 (33%), Positives = 154/315 (48%), Gaps = 40/315 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I TFE P ++ + G FDW+L
Sbjct: 278 CECFYGWLEPLLSRIAENYTAVVSPDIASIDLTTFEFSKPSPYGSNHNR---GNFDWSLS 334
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD IWGGEN+E+SF
Sbjct: 335 FGWESLPDHEKQRRKDETYPIRTPTFAGGLFSISKKYFEYIGTYDEEMKIWGGENIEMSF 394
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ + F
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---FKEIFY 450
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E ++ + +GD++ R ++R L CK+F WYL + N
Sbjct: 451 RRNTEAAKIVKQKAYGDISKRLDIRHRLQCKNFTWYLNNIYPEIYVPDLNPVISGYIQNI 510
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S+ EIR + E CL G V +
Sbjct: 511 GRHLCLDVG-ENNQGGKPLIMYTCHFLGGNQYFEXSEQHEIRHSIQKELCLHALQGPVQM 569
Query: 273 YPCHGSKGNQYFEYD 287
C KG + F D
Sbjct: 570 KAC-SYKGQKTFTVD 583
>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
Length = 570
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 147/314 (46%), Gaps = 51/314 (16%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E +RWL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL F
Sbjct: 227 ECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLVF 279
Query: 65 NW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+SF
Sbjct: 280 KWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISF 339
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + IP P P +G +F+ + ++W E
Sbjct: 340 RVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMDE 390
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SND 215
+ +G++ SR ELR+ LGCK FKWYL+ +
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGALQ 450
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI- 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D + G +I
Sbjct: 451 QGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIR 508
Query: 272 LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 509 LQGCRENDSRQKWE 522
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 144/296 (48%), Gaps = 36/296 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
SF+ IP P P + E + +Y F
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGISVIARNQVRLAEVWMDSYKEIFYRR 446
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DMHKP 233
+ +++ + FGD++ R +LR L C++F W+L +N + M + KPT +
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFL--TNIYPEMFVPD-LKPTFYGAIRN 503
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
+G+ C G N ++ G +I+Y CHG GNQYFEY +
Sbjct: 504 LGINQCLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYTTR 541
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 144/296 (48%), Gaps = 36/296 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
SF+ IP P P + E + +Y F
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGISVIARNQVRLAEVWMDSYKEIFYRR 446
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DMHKP 233
+ +++ + FGD++ R +LR L C++F W+L +N + M + KPT +
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFL--TNIYPEMFVPD-LKPTFYGAIRN 503
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
+G+ C G N ++ G +I+Y CHG GNQYFEY +
Sbjct: 504 LGINQCLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYTTR 541
>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
catus]
Length = 653
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 149/318 (46%), Gaps = 57/318 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 309 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 361
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 362 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 421
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + +P RK+H P P +G +F+ +
Sbjct: 422 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 467
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSN 214
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 468 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 527
Query: 215 DWSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGG 268
+ + + C T H VG+Y CH GGNQ W ++K ++ + CL D G
Sbjct: 528 AFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRTPG 587
Query: 269 DVI-LYPCHGSKGNQYFE 285
VI L C + Q +E
Sbjct: 588 SVIKLQGCRENDSRQKWE 605
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E+++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
boliviensis boliviensis]
Length = 724
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/315 (32%), Positives = 145/315 (46%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 380 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 432
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE LG YD D+WGGENLE+S
Sbjct: 433 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEALGKYDMMMDVWGGENLEIS 492
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 493 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 543
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 544 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 603
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
C+D+ D VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 604 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 661
Query: 272 -LYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 662 KLQGCRENDSRQKWE 676
>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
Length = 581
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/306 (33%), Positives = 144/306 (47%), Gaps = 51/306 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD + + + VVSP+I I D FE L GGFDWNL
Sbjct: 236 CECNVGWLEPLLDRIKGDRTRVVSPIIDVINMDNFEYIGASADLK-------GGFDWNLV 288
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE KR N +P+ TP +AGGLFSI+K +FE+LG YD D+WGGENLE+S
Sbjct: 289 FKWDYMTPEERNKRAGNPIQPIRTPMIAGGLFSIEKKWFEELGKYDRNMDVWGGENLEIS 348
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 349 FRVWQCHGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFARNTR---------RAAEVWMD 399
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +FGD++ R +LR+ L CK FKW+LE S
Sbjct: 400 NYKEFYYAAVPSAKMVNFGDISERMDLRKRLSCKPFKWFLEHVYPELKVPGHQDQAFGSI 459
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVI 271
C+D+ D +G++PCH GGNQ + ++K G IR + C+ G G V+
Sbjct: 460 QQDNNCMDTLGNFAD--GILGIFPCHFAGGNQEFSLTKEGFIRHLDLCVTLTGSMPGTVV 517
Query: 272 -LYPCH 276
L+ C
Sbjct: 518 KLFQCQ 523
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E+++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 628
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 148/318 (46%), Gaps = 57/318 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 284 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 336
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+S
Sbjct: 337 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 396
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + +P RK+H P P +G +F+ +
Sbjct: 397 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 442
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSN 214
++W E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 443 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLSCKPFKWYLENVYPELRVPDHQDI 502
Query: 215 DWSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----G 267
+ + + C T H VG+Y CH GGNQ W ++K ++ + CL G
Sbjct: 503 AFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKDRSVKHMDLCLTVVERTPG 562
Query: 268 GDVILYPCHGSKGNQYFE 285
V L C + Q +E
Sbjct: 563 ALVKLQGCRENDSRQKWE 580
>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
Length = 573
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 105/286 (36%), Positives = 141/286 (49%), Gaps = 42/286 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV P+I I DTF+ L GGFDWNL
Sbjct: 231 CECNVDWLEPLLIRVKEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 283
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER +R K+ P+ TP +AGGLF IDK +FEKLG YD+ DIWGGENLE+S
Sbjct: 284 FKWEYLSTAERHERQKDPTTPIRTPMIAGGLFVIDKVYFEKLGKYDTQMDIWGGENLEIS 343
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 344 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSGNIFAKNTRRAAEVWMDD-- 396
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSA------- 224
+ + + L+ FGD+ R EL+ L CK FKWYL +N + + I
Sbjct: 397 YKQYYYAAVPLAKNIPFGDIEERMELKERLQCKPFKWYL--ANVYPQLTIPEQQTKGSLR 454
Query: 225 ----CKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
C T H VGLY CH GGNQ W ++K G+I+ + CL
Sbjct: 455 QGPYCMDTLGHLVDGIVGLYQCHDSGGNQDWAITKKGQIKHLDLCL 500
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 146/322 (45%), Gaps = 62/322 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL L N V P I I FE G G F W L
Sbjct: 230 CEASFGWLEPLLARLQENPKLAVVPDIEVISFKNFEYSSEKGSYNR------GIFSWELM 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD---------SGFDIW 114
FNW +P RE+ R K ++P+ +PTMAGGLF++++ +F + G YD W
Sbjct: 284 FNWGPLPPREKMRRKYESDPIKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYW 343
Query: 115 GGENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
GGEN+E+SF+ IP P +P + SI A
Sbjct: 344 GGENVEMSFRLWMCGEGIEIIPCSRVGHVFRERAPYKSPDGSTDHNSIRVA--------- 394
Query: 170 SGFDIWGGENLEL--SFKGDF-----GDVTSRKELRRNLGCKSFKWYLE----------- 211
++W E E+ SF+ + GDV+ RK+LR +L CKSFKWYL+
Sbjct: 395 ---EVWMDEFKEIFYSFRANLKPEQGGDVSERKKLREDLKCKSFKWYLQNIIPELEIPDK 451
Query: 212 -------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
V N + C+D+ + KP GLYPCHK G NQ+++ +K EI D CLD
Sbjct: 452 YPYGRGDVKNLGTLSCLDTLAQNNQGGKP-GLYPCHKMGTNQYFIFTKKFEIWHDGLCLD 510
Query: 265 YAGGD----VILYPCHGSKGNQ 282
+ D V L+PCH GNQ
Sbjct: 511 LSDSDLNAKVKLWPCHKQGGNQ 532
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 4/73 (5%)
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD--EACLDYAGGDVILYPC 275
G+C+D +D++ V L+PCHKQGGNQ W +K G I + + CL+ G +++ C
Sbjct: 506 GLCLD--LSDSDLNAKVKLWPCHKQGGNQKWKHTKSGLIMHESRKKCLEGQGDQILIRAC 563
Query: 276 HGSKGNQYFEYDY 288
+ NQ + +++
Sbjct: 564 DTNNANQRWLFEH 576
>gi|170038567|ref|XP_001847120.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167882319|gb|EDS45702.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 494
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 146/313 (46%), Gaps = 48/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+ LDV+A + + P I I ++T L + + + G FDW +
Sbjct: 156 CEVIVGWLEAQLDVVAADPQTIAIPSIDWIHEETMALN------AQNSQLYFGSFDWTVN 209
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E++ K +N P TP MAGGLF+I++ FFE LG YD GF +G EN+ELSF
Sbjct: 210 FQWKSRAEKKVK-PENPVAPFDTPVMAGGLFTINRTFFEHLGWYDEGFQTYGAENMELSF 268
Query: 124 KF----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
K + I R H + + GG ++ + ++W E
Sbjct: 269 KTWMCGGFMKIVPCSRVAHIQKRGHPYLASSPGGFNAVKRNTVRLA-------EVWLDEY 321
Query: 180 LELSF--------KGDFGDVTSRKELRRNLGCKSFKWYLEV---------------SNDW 216
E + +GDFGDV+SRK+LR L C+ F+WY+E
Sbjct: 322 AEYYYESFGGRKNRGDFGDVSSRKKLRARLNCRPFRWYMETVFPEQFDPSKAVGRGQFRI 381
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCH 276
G C+D K + + CH GG+Q W + GEI R++ C+D+ + + CH
Sbjct: 382 GGGCLDWPTK-------LSVIGCHGLGGHQLWFFTADGEITREDHCMDFDSKKLEMIRCH 434
Query: 277 GSKGNQYFEYDYK 289
KGNQ + ++ K
Sbjct: 435 KQKGNQMWVFEEK 447
>gi|324507788|gb|ADY43296.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Ascaris suum]
Length = 580
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 143/310 (46%), Gaps = 50/310 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +WL+PLL + N VV+P+I I DTF L GGF+WNL
Sbjct: 232 CECNVQWLEPLLARVKENPHAVVAPIIDVINMDTFNYVAASADLR-------GGFEWNLV 284
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + R RH + P+ TP +AGGLF I K +FE LGTYD D+WGGENLELS
Sbjct: 285 FKWEYLSGKLRDDRHSHPTLPIKTPVIAGGLFMIRKDWFETLGTYDPDMDVWGGENLELS 344
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F + ++W
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFQKNTR---------RAAEVWLD 395
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------VSN 214
+ L K DFGD++ R +L+ L CK+F WYL+ ++
Sbjct: 396 DYKMLYLKQVPSARFVDFGDISERLKLKEQLHCKNFTWYLKEVYPELKIPEREDGLYLTF 455
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACL-DYAGGDV 270
+G+CIDS K T H PVG+Y CH GGNQ W+ K ++ + C+ D G V
Sbjct: 456 KQAGLCIDSLGKQT-AHSPVGVYSCHGTGGNQEWVFDKQKGTLKNPFTKLCMSDSDIGVV 514
Query: 271 ILYPCHGSKG 280
L C + G
Sbjct: 515 SLQKCETADG 524
>gi|357624672|gb|EHJ75362.1| hypothetical protein KGM_04161 [Danaus plexippus]
Length = 771
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 148/326 (45%), Gaps = 63/326 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL ++ V++PLI I TFEL ++ +F +GGF +
Sbjct: 414 CEVNVDWLRPLLQRISHKRDAVLTPLIDVIDQSTFELE-------AAQQFQVGGFTFMGH 466
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PERE++R + P W+PTMAGGLF+I++ ++ +LG YD WGGENLE+SF
Sbjct: 467 FTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELGAYDEQMAGWGGENLEMSF 526
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP--TMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
+ +P A P P T G+ + A ++W
Sbjct: 527 RIWQCGGTLETVPCSRVGHVFRAFHPYGLPAHTDTHGINTARMA------------EVWM 574
Query: 177 GENLELSF--------KGDFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
E EL + GDVT RK LR L CKSF+WYL+
Sbjct: 575 DEYAELFYLNRPDLRKSPKIGDVTHRKILREKLKCKSFQWYLDNIYKEKFVPVRDVFGYG 634
Query: 212 -VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDY---- 265
N S MC+D+ + + +GLYPCH + Q +S GE+R +E C +
Sbjct: 635 RFMNPSSAMCLDTLQREGEA-TALGLYPCHSRLEPTQHLALSLAGELRDEEKCAEVQSPV 693
Query: 266 -----AGGDVILYPCHGSKGNQYFEY 286
V++ CHG Q++ Y
Sbjct: 694 GSNENVSRRVLMVTCHGKHRGQHWRY 719
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/199 (32%), Positives = 88/199 (44%), Gaps = 32/199 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFE--------------LRFPPGRLTS 49
CEVQ+ WL+PLL + VV P+I I F R R+
Sbjct: 344 CEVQEDWLRPLLQRIRDFPHAVVVPIIDVIESSNFYYSVQDPVIFQGLILARISGARIAR 403
Query: 50 SYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD- 108
+ +W P +R HK A V TP + ID++ FE
Sbjct: 404 GDVLIFLDSHCEVNVDWLR-PLLQRISHKRDA--VLTPLID----VIDQSTFELEAAQQF 456
Query: 109 --SGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG 166
GF G F W +PERE++R + P W+PTMAGGLF+I++ ++ +LG
Sbjct: 457 QVGGFTFMG--------HFTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELG 508
Query: 167 TYDSGFDIWGGENLELSFK 185
YD WGGENLE+SF+
Sbjct: 509 AYDEQMAGWGGENLEMSFR 527
>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Cavia porcellus]
Length = 622
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 102/313 (32%), Positives = 148/313 (47%), Gaps = 47/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A N VVSP I I +TFE P GR+ S G FDW
Sbjct: 272 CECFHGWLEPLLARIAENKMAVVSPDIVTINLNTFEFSKPIPEGRIHSR-----GNFDWI 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F W+L
Sbjct: 441 KIFYRRNLQAAKIAQEKSFGDISERLQLRERLHCHNFSWFLSNIYPEMFVPDLSPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N C+D + KP+ +Y CH GGNQ++ + E+R + A CL G
Sbjct: 501 IKNLGINQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRELRHNVAKQLCLHARAG 559
Query: 269 DVILYPCHGSKGN 281
+ L CH + N
Sbjct: 560 TLGLRACHFTGKN 572
>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 150/312 (48%), Gaps = 50/312 (16%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL+PLL +A + S V P+I+ I F ++S + IGGFDW L F
Sbjct: 249 ECNNGWLEPLLQRIAEDDSVVAVPIISTIAWQDFAFHHS----SNSIEPQIGGFDWRLTF 304
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
WH+IP+ + + K +PV TPTMAGGLF++ + +F +G+YD+G ++WGGENLE+SF+
Sbjct: 305 QWHSIPDEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFR 364
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGE---- 178
W + + ++ G +F + K T ++ ++W +
Sbjct: 365 V-WMC----------GGSLEIIPCSIVGHVFPKTAPYERKSFTPNTVRAVEVWLDDYKRH 413
Query: 179 ---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM--------- 219
LS +GD++ R LR L CKSF+WYLE V D G
Sbjct: 414 FYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDLPVPEDTPGQFGALHNKGS 473
Query: 220 ---CIDSACKPTDM-HKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL---DYAGGD 269
C+D D+ H VG + CH QGGNQF+ + G +R + E C+ D G+
Sbjct: 474 PSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRYTSQFELCIAKKDDNSGE 533
Query: 270 VILYPCHGSKGN 281
+ C+G N
Sbjct: 534 IAAVMCNGKNVN 545
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E+++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
norvegicus]
Length = 633
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 152/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI + +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R+K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRNKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E+++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVG-ENNQGDKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|405966237|gb|EKC31544.1| Putative polypeptide N-acetylgalactosaminyltransferase 9, partial
[Crassostrea gigas]
Length = 513
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 154/324 (47%), Gaps = 67/324 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL+ +A + +V P +I DTFE + +S F+GGFD++L
Sbjct: 145 CECTKGWLEPLLNEIADDYRNVAIPFTDSIDADTFEYKG-----SSLNYVFVGGFDFDLH 199
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+RE+ R + +P+W+PT G +I K FF++LG YD+ IWGGENLELSF
Sbjct: 200 FAWRVMPDREQNRRRLLTDPIWSPTHLGCCLAISKRFFDELGRYDNELQIWGGENLELSF 259
Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
K ++W R R+ VW +
Sbjct: 260 KTWMCGGKMKIIPCSHVGHVFRHKMPYSWGKDGYRTFIRNSLRVAEVW-------MDQYK 312
Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
+ +++++ Y S +I D G+++SRK +R+ L CK F WYL
Sbjct: 313 EVYYDRI--YYSQNEI------------DIGNISSRKAIRQRLHCKPFDWYLKNVYPELY 358
Query: 211 -----EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY 265
+ + + MCIDS + KP+ C G +Q W ++ IRRDE CL +
Sbjct: 359 IPRDCKATGQINNMCIDSYTGGSFYGKPISARECIHLGTSQHWTWTRENTIRRDEGCLVF 418
Query: 266 AG-GDVILYPCHGSKGNQYFEYDY 288
G V++ PC + ++Y +++Y
Sbjct: 419 DGISRVLMGPC--ATLSKYLQWEY 440
>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
Length = 553
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 56/313 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + + +V P+I I D+F+ L GGFDWNL
Sbjct: 206 CECNEGWLEPLLARVVEDRTRIVCPVIDVIAMDSFQYIAASTELR-------GGFDWNLV 258
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +P E+ R + P+ TP +AGGLF ID+ +F+KLG+YD DIWGGENLE+S
Sbjct: 259 FKWELLPAEEKANRKTDPTIPIRTPMIAGGLFVIDRQYFQKLGSYDLQMDIWGGENLEIS 318
Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
F+ W E RK+H P P +G +F+ + ++ D
Sbjct: 319 FR-TWQCGGRLEIVPCSRVGHVFRKQH-----PYSFPGGSGTIFARNTRRAAEVWMDD-- 370
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSG-- 218
+ + + ++ FG++T R LR +L CK FKWY+E D SG
Sbjct: 371 YKKYYFAAVPMARTVTFGNITDRLALRNSLNCKPFKWYVENVYPELLKHLPTVRDPSGTN 430
Query: 219 --------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL-----DY 265
+C D+ + H +GLY CH GGNQ W +G +R CL Y
Sbjct: 431 SGAIKYKSLCFDTYGRGAGSH--IGLYACHMTGGNQAWTY-LNGRLRHGSWCLAPPTPAY 487
Query: 266 AGGDVILYPCHGS 278
G VI PC S
Sbjct: 488 VGAQVITLPCSSS 500
>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
paniscus]
Length = 633
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQSLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KGC 574
>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
Length = 576
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 104/286 (36%), Positives = 142/286 (49%), Gaps = 42/286 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV P+I I DTF+ L GGFDWNL
Sbjct: 234 CECNVDWLEPLLVRVQEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 286
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER +R K+ P+ TP +AGGLF IDKA+FEKLG YD+ DIWGGENLE+S
Sbjct: 287 FKWEYLSNAERHERQKDPTTPIRTPMIAGGLFVIDKAYFEKLGKYDTQMDIWGGENLEIS 346
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 347 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSGNIFAKNTRRAAEVWMDD-- 399
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--------------SNDWS 217
+ + + L+ FG++ R +L+ L CK+FKWYL+ S
Sbjct: 400 YKQYYYAAVPLAKNIPFGNIDERLQLKEQLECKNFKWYLDNVYPQLTIPEQQTKGSLRQG 459
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
CID+ D VGLY CH GGNQ W ++K G+I+ + CL
Sbjct: 460 PYCIDTLGHLVD--GIVGLYHCHNSGGNQDWAITKSGQIKHLDLCL 503
>gi|432112638|gb|ELK35354.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Myotis davidii]
Length = 416
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL + + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 66 CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 120
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 121 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 180
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 181 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---SYK 234
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F E +++ + FGD++ R +LR L C++F W+L
Sbjct: 235 EIFYRRNMEAAKMAQEKTFGDISERLQLREQLHCRNFSWFLHNIYPELFIPDLKPTFYGA 294
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N C+D K KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 295 IKNLGINQCLDVGEK-NHGGKPLIMYACHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 353
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 354 TLGLRSCHFTGKNSQVPKD 372
>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Nomascus leucogenys]
Length = 633
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
anubis]
Length = 633
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 100/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI + +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E+++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVG-ENNQGDKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Pongo abelii]
gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Pongo abelii]
gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
3 [Pongo abelii]
Length = 633
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Macaca mulatta]
gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Macaca mulatta]
gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
Length = 633
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E +R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEEQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
+ ++ + FGD++ R E++ L CK+F WYL +++ SG I S
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511
Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
A +P + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 AGQPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 154/319 (48%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N + D
Sbjct: 560 ALGLGSCHFTGKNSHVPKD 578
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 150/320 (46%), Gaps = 53/320 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PLL ++ + + VV+P+I I DDTF+ ++ + GGF W +
Sbjct: 227 VEVTDGWLEPLLSRISEDRTRVVAPVIDVISDDTFQY-------VTAAESTWGGFSWTMN 279
Query: 64 FNWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ RE+KR KN P+ TPT+AGGLFSID+ +F +G YD G IWGGENLE+S
Sbjct: 280 FRWYQASAREQKRRGKNKTTPIRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEIS 339
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG ++ G ++W E
Sbjct: 340 FRV-WMCGGTLEINPCSHVGHVFRKQTPYTFEGGTSNV------IYGNARRTAEVWMDEY 392
Query: 180 LELSFK-------GDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
E +K G+++ R LR+ LGCKSFKWYL+ + N+
Sbjct: 393 KEFYYKMTPSAMFAPLGNISDRIALRKRLGCKSFKWYLKNIYPESNIPPTYYSIGYIKNE 452
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ-----FWMMSKHGEIRRDEACLDY---AG 267
+ +C+D+ + L CH GGNQ W + IR DE CL A
Sbjct: 453 KNDLCLDTMGRKASGSP--ALLTCHNSGGNQVLFMKVWSYTGTLNIRADELCLQASRKAD 510
Query: 268 GDVILYPCHGSKGNQYFEYD 287
+ L C+ + +Q ++YD
Sbjct: 511 SPIFLQQCNNDE-SQIWDYD 529
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 153/313 (48%), Gaps = 51/313 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL+ + ++ + VV P I +I + F ++ P + G F+W+L
Sbjct: 206 CEVTIGWLEPLLNRIHQDRTTVVCPEIDSIDLNNFAYKYGPSGV------LRGTFNWDLS 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W P ER R +A +P+ +PTMAGGLF+ID+ +F +LGTYD G +IWG EN+ELSF
Sbjct: 260 FKWSIAPTSERLRRTSATDPMRSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP +P T + L SI ++++ ++W +
Sbjct: 320 KVWQCGGKLEIIPCSHVGHVFREVQPYDT---SVSLHSIANKNYQRVA------EVWMDD 370
Query: 179 NLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
+ ++ FGD++ +LR+ L C+SF+WYL+ V
Sbjct: 371 YKKFFYQRHPYLTDQSFGDISENLKLRQRLKCRSFRWYLQNVFTDVILPNETAIATGKVR 430
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGD 269
N S MC+D+ + ++ +GL PC+ Q + + EI ++ACLD + G
Sbjct: 431 NPISNMCLDTFGRTSNTF--LGLSPCNIQRDTMLFAYTSRKEISWNDACLDASFIMPGFK 488
Query: 270 VILYPCHGSKGNQ 282
+ + CH GNQ
Sbjct: 489 IQMAECHRIGGNQ 501
>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
CRA_b [Homo sapiens]
gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
[synthetic construct]
Length = 633
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
mulatta]
Length = 641
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C SF WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Callithrix jacchus]
Length = 633
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 103/303 (33%), Positives = 153/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSHHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
+ ++ + FGD++ R E++ L CK+F WYL +++ SG
Sbjct: 453 RRNTDAAKIVKQKTFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GHPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 149/307 (48%), Gaps = 50/307 (16%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL +A + S V P+I+ I F ++S + IGGFDW L F WH+I
Sbjct: 254 WLEPLLQRIAEDDSVVAVPIISTIAWQDFGFHHS----SNSIEPQIGGFDWQLTFQWHSI 309
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
P+ + + K +PV TPTMAGGLF++ + +F +G+YD+G ++WGGENLE+SF+ W
Sbjct: 310 PDEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRV-WMC 368
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGE-------NL 180
+ + ++ G +F + K T ++ ++W +
Sbjct: 369 ----------GGSLEIIPCSIVGHVFPKTAPYERKSFTPNTVRAVEVWLDDYKRHFYARN 418
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM------------CI 221
LS +GD++ R LR L CKSF+WYLE V D G C+
Sbjct: 419 PLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDLPVPEDTPGQFGALHNKGSPSRCL 478
Query: 222 DSACKPTDM-HKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL---DYAGGDVILYP 274
D D+ H VG + CH QGGNQF+ + G +R + E C+ D G++
Sbjct: 479 DYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRYTSQFELCIAKKDDNSGEIAAVM 538
Query: 275 CHGSKGN 281
C+G N
Sbjct: 539 CNGKNVN 545
>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Cricetulus griseus]
Length = 633
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 149/302 (49%), Gaps = 37/302 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI + +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISREYFEHIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS---------NDWSGMCIDSA 224
+ ++ + FGD++ R E+++ L CK+F WYL N I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTVYPEVYVPDLNPVISGYIKSV 512
Query: 225 CKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILY 273
+P + KP+ LY CH GGNQ++ S EIR + E CL G V L
Sbjct: 513 GQPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQLK 572
Query: 274 PC 275
C
Sbjct: 573 AC 574
>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
fascicularis]
gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
Length = 622
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C SF WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
(GalNAc-T3) [Homo sapiens]
Length = 633
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Papio anubis]
gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Papio anubis]
Length = 622
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C SF WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 573
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 150/318 (47%), Gaps = 53/318 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+ LL ++ N +V P+I I DTFE R G FDW
Sbjct: 223 CEVTIGWLETLLKHISENQKRIVCPIIDVISHDTFEYLLGSDRTW-------GTFDWQFN 275
Query: 64 FNWHAIPERERKRHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F+W + +RE R + P+ TPTMAGGLF+I + +F ++G YD +IWGGEN+ELS
Sbjct: 276 FHWETVVDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELS 335
Query: 123 FKFNWHA-----IPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEK-----LGTYDS 170
F+ W I R H + ++ W GG+ I F + L Y
Sbjct: 336 FRV-WQCGGELLIDPCSRVGHVFRKSSPYTW----PGGVSHILHKNFVRTALVWLDQYSR 390
Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDW------- 216
+ + L + D+GDVT RK+LR+ L CKSF+WYLE + D
Sbjct: 391 FYFMLNPSALSV----DYGDVTKRKKLRQQLNCKSFRWYLEHIYPESSIPIDVIRLGEIR 446
Query: 217 --SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD------YAGG 268
SG C+DS + + VG+ CH QGGNQ + +++ G IR C+D G
Sbjct: 447 HKSGQCLDSLGHK--LGETVGVTHCHGQGGNQVFAITESGTIRVHAGCMDGGSSKSVGTG 504
Query: 269 DVILYPCHGSKGNQYFEY 286
++ C +Q FE+
Sbjct: 505 ILVFKKCEKDSISQKFEF 522
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 135/306 (44%), Gaps = 39/306 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL PLL+ +A N V P I I TF+ + + G F+W
Sbjct: 155 CECNKGWLPPLLERIALNRRTAVCPTIDFIDHKTFQYK-------PMDPYIRGTFNWRFD 207
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ A+ E + ++ + V +P MAGGLF+I++ FF +LG YD G IWGGE E+SF
Sbjct: 208 YKERAVRPEEMAKRRDPTQEVKSPVMAGGLFAINREFFSELGQYDPGMFIWGGEQYEISF 267
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P L + + + Y W +
Sbjct: 268 KLWQCGGQLENIPCSRVGHVYRHHVPYTYPKHDATLVNFRRVAEVWMDEYKD----WLYD 323
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------VSNDWSGMCID 222
D+GD++ R LR+ L CKSFKWYLE V N MC+D
Sbjct: 324 KRPEIKSVDYGDISDRIALRKRLKCKSFKWYLENVANDTVKTKLCACFQVRNQGKNMCLD 383
Query: 223 SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAGGDVILYPCHGS 278
S + D H VGL CH GGNQ + + E+R DE C D + G V +PCH
Sbjct: 384 SMGR-KDGH--VGLASCHNMGGNQAFQYTYIRELRTDETCFDVHESFPGAKVHFFPCHEM 440
Query: 279 KGNQYF 284
KGNQ F
Sbjct: 441 KGNQEF 446
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 153/316 (48%), Gaps = 41/316 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
SF+ IP P T G+ I + + DS +I+
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSP---HTFPKGISVIARNQVRLAEVWMDSYKEIF 443
Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
NL+ ++ + FGD++ R +L+ L C++F W+L + N
Sbjct: 444 YRRNLQAAKMAQEKSFGDISERLQLKERLHCRNFSWFLHNIYPEMFVPDLKPTFYGAIRN 503
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G +
Sbjct: 504 LGVDQCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562
Query: 272 LYPCHGSKGNQYFEYD 287
L CH + N D
Sbjct: 563 LRSCHFTGQNSQVPKD 578
>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Otolemur garnettii]
Length = 633
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 105/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGGNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P++E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDQEKQRRKDETYPIKTPTFAGGLFSISKKYFEYIGSYDDEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
+ ++ + FGD++ R E++ L CK+F WYL +++ SG I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511
Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
KP + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 512 IGKPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSSLREIRHNIQKELCLHAAKGPVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
troglodytes]
gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
troglodytes]
Length = 633
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KGC 574
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/287 (34%), Positives = 133/287 (46%), Gaps = 31/287 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VVSP I I ++F+ P SS+ F G FDW+L
Sbjct: 266 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLESFQFNKPA---PSSHAFNRGNFDWSLT 322
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IPE RK K+ PV TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 323 FGWEQIPEAARKLRKDETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSF 382
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
+ IP P P + E + Y F
Sbjct: 383 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKKIFYRRNK 442
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
+++ + ++GD++ R LR L CK+F WYL + N S
Sbjct: 443 NAAKMAKENNYGDISERLNLRERLHCKNFSWYLNTVYPEAFVPDLTPDRFGAIKNQGSKT 502
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACL 263
C+D + KPV +Y CH GGNQ++ S H E+R + E CL
Sbjct: 503 CLDVG-ENNLGGKPVMMYTCHNMGGNQYFEYSSHKELRHNIGKELCL 548
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLRSCHFTGKNSQVPKD 578
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERPQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|357629476|gb|EHJ78219.1| hypothetical protein KGM_03405 [Danaus plexippus]
Length = 353
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 103/313 (32%), Positives = 145/313 (46%), Gaps = 59/313 (18%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL+PLL + + + VV P+I I DTF+ L GGFDWNL F
Sbjct: 24 ECNVHWLEPLLQRIKEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLVF 76
Query: 65 NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + + ER R + + + TP +AGGLFS+D+ +F KLG YD D+WGGENLE+SF
Sbjct: 77 KWEYLSQAERGARLSDPTQVIRTPMIAGGLFSMDRKYFSKLGKYDMKMDVWGGENLEISF 136
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P +G +F+ + +
Sbjct: 137 RVWQCGGSLEIVPCSRVGHVFRKRH-----PYSFPGGSGAVFARNTR---------RAAE 182
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------V 212
+W + EL ++ DFGD++ R +R+ L CK F+WYLE +
Sbjct: 183 VWMDDYKELYYRSQPLAKQVDFGDISERVSIRQRLHCKPFRWYLEHVYPELRVPTFGNSI 242
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ C+D+ D V +YPCH GGNQ W +G IR CL + D
Sbjct: 243 AIKQGPRCLDTMGHQVD--GTVAMYPCHNTGGNQEWSFD-NGLIRHQSLCLGLSQEDSVT 299
Query: 270 VILYPCHGSKGNQ 282
V+L C S NQ
Sbjct: 300 VVLAVCDPSDHNQ 312
>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
Length = 595
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 151/331 (45%), Gaps = 64/331 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K+WL+PLL +A + + VV P+I I DTFE P GGF+W L
Sbjct: 237 CEVSKQWLEPLLARIAEDRTRVVCPIIDIINSDTFEYTASP--------LVRGGFNWGLH 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P++ + AA P+ +PTMAGGLF+ID+ +F++LG YD G DIWGGENLE+SF
Sbjct: 289 FKWDQVPQQLLQGPDGAAAPINSPTMAGGLFAIDREYFDELGRYDEGMDIWGGENLEISF 348
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM+ + + D D
Sbjct: 349 RIWMCGGTLEIIPCSRVGHVFRKR-RPYGSPNGEDTMSKNSLRMAHVWM------DEYKD 401
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ E+ + +GD++ R +LR L C SFKWYL+
Sbjct: 402 QYFSLRPEMKTR-TYGDISDRLKLREKLNCHSFKWYLDNIYPELFVPGGDKLKQVGVGQL 460
Query: 212 -----------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RD 259
+ + SG+C+ S P + V + C + NQ W ++ E++
Sbjct: 461 PPRPKVIKKGHIKHLDSGLCLISQNGPNEKGSLVVVSECLSEDKNQVWYLTDQDELQLTG 520
Query: 260 EACLDYAGGDVILYP----CHGSKGNQYFEY 286
CLD D +P CHG+ G Q +++
Sbjct: 521 LLCLDVNENDPKSFPRIMKCHGTSGGQQWKF 551
>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
boliviensis boliviensis]
Length = 633
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 149/303 (49%), Gaps = 39/303 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P + + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSHHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
+ ++ + FGD++ R E++ L CK+F WYL + +
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ S EIR + E CL A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571
Query: 273 YPC 275
C
Sbjct: 572 KAC 574
>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Cavia porcellus]
Length = 633
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 105/307 (34%), Positives = 152/307 (49%), Gaps = 38/307 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP IA+I +TFE P T+ + G FDW+L
Sbjct: 280 CECFYGWLEPLLARIADNYTAVVSPDIASIDLNTFEFNKPSPYGTNHNR---GNFDWSLS 336
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T +A + + + ++ Y F
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS---------NDWSGMCIDSA 224
E ++ + FGD++ R +R+ L CK+F WYL N I S
Sbjct: 453 RRNTEAAKIVKQKTFGDLSKRFAIRKRLQCKNFTWYLNTVYPEVYVPDLNPVISGYIKSV 512
Query: 225 CKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILY 273
+P + KP+ LY CH GGNQ++ S EIR + E CL +A D++
Sbjct: 513 GQPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHSIQKELCL-HATSDLLQL 571
Query: 274 PCHGSKG 280
KG
Sbjct: 572 KACAYKG 578
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLMPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 153/316 (48%), Gaps = 41/316 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LSFGWEALPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
SF+ IP P T G+ I + + G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGISVIARNQVRLAEVWMDGYKEIF 443
Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
N++ ++ + FGD++ R +LR L C +F W+L+ + N
Sbjct: 444 YRRNMQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLQNIYPEMFVPDLKPTFYGAIKN 503
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G +
Sbjct: 504 LGIDHCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562
Query: 272 LYPCHGSKGNQYFEYD 287
L CH + N D
Sbjct: 563 LRSCHFTGKNSQVPKD 578
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/296 (35%), Positives = 147/296 (49%), Gaps = 42/296 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 264 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSR-----GNFDWS 318
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++RHK+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 319 LTFGWETLPPHEKQRHKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 378
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
SF+ IP P T G+ I + + DS I+
Sbjct: 379 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDSYKQIF 435
Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DM 230
NL+ ++ + FGD++ R +LR L C++F W+L N + M + KPT
Sbjct: 436 YRRNLQAAKMAQEKSFGDISERLKLRELLHCRNFSWFLH--NVYPEMFVPD-LKPTFYGA 492
Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
K +G+ C G N ++ G +I+Y CHG GNQYFEY
Sbjct: 493 IKNLGINQCLDVGEN------------------NHGGKPLIMYACHGLGGNQYFEY 530
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMTQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKRLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|149032012|gb|EDL86924.1| rCG50623 [Rattus norvegicus]
Length = 431
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 148/306 (48%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L
Sbjct: 81 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSR---GNFDWSLT 137
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF
Sbjct: 138 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 197
Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 198 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 251
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +++ + +FGDV+ R LR L C +F WYL +
Sbjct: 252 FYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPEMFVPDLNPTFSGAIK 311
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N + C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G +
Sbjct: 312 NLGTSQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 370
Query: 271 ILYPCH 276
L C
Sbjct: 371 GLRNCQ 376
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 146/313 (46%), Gaps = 35/313 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W AIP E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEAIPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
SF+ IP P P + E + Y F
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGVSVIARNQVRLAEVWMDNYKEIFYRR 446
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
+ +++ + FGD++ R +LR L C +F W+L + N
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLHNIYPEMFVPDLKPTLYGAIRNLGI 506
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYP 274
C+D + KP+ +Y CH GGNQ++ + ++R + + CL + G + L
Sbjct: 507 NQCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNISKQLCLHASAGTLGLRS 565
Query: 275 CHGSKGNQYFEYD 287
CH + N D
Sbjct: 566 CHFTGKNSQVPKD 578
>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oreochromis niloticus]
Length = 559
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 215 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 267
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 268 FKWDYMTQEQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 327
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 328 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 378
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR E+++ L CK FKWYLE +
Sbjct: 379 EYKNFYYAAVPSARNVPYGNIQSRLEMKKRLNCKPFKWYLENVYPELRVPDHQDIAFGAL 438
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
G C+D+ D VG+Y CH GGNQ W ++K ++ + CL AG +
Sbjct: 439 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 496
Query: 271 ILYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 497 KLQGCRENDSRQKWE 511
>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
Length = 560
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 144/315 (45%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 216 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 268
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ R R N P+ TP +AGGLF +DK +FE LG YD D+WGGENLE+S
Sbjct: 269 FKWDYMTVEQRRVRQGNPTAPIKTPMIAGGLFVMDKDYFELLGKYDMMMDVWGGENLEIS 328
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 329 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 379
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR E+++ LGC+ FKWYLE +
Sbjct: 380 EFKNFYYAAVPSARNVPYGNIQSRMEMKKRLGCQPFKWYLENVYPELRVPDHQDIAFGAL 439
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
G C+D+ D VG+Y CH GGNQ W ++K ++ + CL AG +
Sbjct: 440 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSQI 497
Query: 271 ILYPCHGSKGNQYFE 285
+ C + Q +E
Sbjct: 498 KMQGCRENDSRQKWE 512
>gi|157820305|ref|NP_001099666.1| polypeptide N-acetylgalactosaminyltransferase 2 [Rattus norvegicus]
gi|149043195|gb|EDL96727.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 473
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 140/303 (46%), Gaps = 58/303 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +RWL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 160 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 212
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 213 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 272
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
F+ VW G L I + + + GG
Sbjct: 273 FR--------------------VW--QCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTV- 309
Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDWSGMCIDSACK 226
F + SR ELR+ LGCK FKWYL+ + C+D+
Sbjct: 310 -----FARIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGALQQGTNCLDTLGH 364
Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGNQ 282
D VG+Y CH GGNQ W ++K ++ + CL D + G +I L C + Q
Sbjct: 365 FAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIRLQGCRENDSRQ 422
Query: 283 YFE 285
+E
Sbjct: 423 KWE 425
>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Oryzias latipes]
Length = 556
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 212 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 264
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ R R N P+ TP +AGGLF +DK +FE LG YD D+WGGENLE+S
Sbjct: 265 FKWDYMTLEQRRARQGNPIAPIKTPMIAGGLFVMDKEYFELLGKYDMMMDVWGGENLEIS 324
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 325 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 375
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR E+++ LGCK FKWYL+ +
Sbjct: 376 EYKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 435
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
G C+D+ D VG+Y CH GGNQ W ++K ++ + CL AG +
Sbjct: 436 QQGGNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 493
Query: 271 ILYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 494 KLQGCRENDSRQKWE 508
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578
>gi|148672125|gb|EDL04072.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 436
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 147/306 (48%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L
Sbjct: 86 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 142
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF
Sbjct: 143 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 202
Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 203 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 256
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + ++ + +FGD++ R LR L C +F WYL +
Sbjct: 257 FYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIK 316
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N + C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G +
Sbjct: 317 NLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 375
Query: 271 ILYPCH 276
L C
Sbjct: 376 GLRSCQ 381
>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
Length = 559
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 148/310 (47%), Gaps = 41/310 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 215 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 267
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 268 FKWDYMTLEQRRARQGNPIAPIKTPMIAGGLFVMDKDYFEELGKYDMMMDVWGGENLEIS 327
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++ D F +
Sbjct: 328 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDD--FKNFYY 385
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM------------------ 219
+ + +G++ SR E+++ LGCK FKWYLE N + +
Sbjct: 386 AAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLE--NVYPELRVPDHQDIAFGALQQGQN 443
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDVILYPC 275
C+D+ D VG+Y CH GGNQ W ++K ++ + CL AG + L C
Sbjct: 444 CLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSQIKLQGC 501
Query: 276 HGSKGNQYFE 285
+ Q +E
Sbjct: 502 RENDTRQKWE 511
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 107/294 (36%), Positives = 140/294 (47%), Gaps = 38/294 (12%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
W++PLL + N S+VV P+I I D TFE G + SS GGF W L F+W I
Sbjct: 202 WVEPLLHRIWENRSNVVMPIIEAIDDKTFEYH---GGVQSSRYAQRGGFSWELHFDWRVI 258
Query: 70 PERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF--- 125
PE E KR K + P+ +PTMAGGLFSIDK++F +LGTYD D WGGENLELSFK
Sbjct: 259 PEYEIKRWKGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMC 318
Query: 126 --NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
P + ++ P P+ E DS D++ N +
Sbjct: 319 GGTLEQPPCSKVGHVFRSSAPYSNPSGPKTFIRNTLRVVEVW--LDSYKDLFYALNPHMQ 376
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSAC 225
+ +GDV+ RK +R L CKSF W+L E+ N C+D+
Sbjct: 377 GE-PYGDVSERKRIRERLQCKSFDWFLENIFPELPIPDKNVQGRGELKNLGGNKCMDTMG 435
Query: 226 KPTDMHKP-VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD---VILYPC 275
+ H P GLY CH GGNQ + + I E CL + + LYPC
Sbjct: 436 E----HAPYTGLYSCHGMGGNQVFSYTWKNVISYQERCLAVSRNKPDRISLYPC 485
>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
Length = 455
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/313 (35%), Positives = 152/313 (48%), Gaps = 49/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV +RWL+PLL +A+N + VVSP+I I DTF S GGF WNL
Sbjct: 157 CEVNERWLEPLLSRVAQNETIVVSPIIDVIHMDTFNY-------IGSSADLKGGFGWNLN 209
Query: 64 FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W ++ E + +R + P+ TP +AGGLFSI K +F K G YD G D+WGGENLE+S
Sbjct: 210 FKWDSMTSEEQSQRAAHPTRPIKTPMIAGGLFSISKNWFIKSGKYDMGMDVWGGENLEIS 269
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ + +P RKRH P P GG F K + G+
Sbjct: 270 LRIWMCGGSLEIVPCSRVGHVFRKRH-----PYTFP--GGGGFVFAKNTRRAAEAWMDGY 322
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSND------W 216
+ + + +GD++ R +LR L C+SFKWY+ E ND
Sbjct: 323 AKFYYKREPGARGVPYGDISDRLKLREKLKCRSFKWYMRNVYPELNVPEGVNDKFGELRQ 382
Query: 217 SGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD--EACLDY-AGGDVIL 272
G C+DS KP D V +PCH GGNQ W M+K +IR + + CL + G+++
Sbjct: 383 GGKCLDSIGGKPGDR---VSTFPCHGGGGNQAWDMTKD-KIRNNFIQRCLTISSSGEIVA 438
Query: 273 YPCHGSKGNQYFE 285
PC Q ++
Sbjct: 439 DPCEDDNEKQIWQ 451
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 154/319 (48%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLKLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASKG 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L+ CH + N D
Sbjct: 560 ALGLWNCHFTGKNSQVPKD 578
>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oreochromis niloticus]
Length = 598
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 146/316 (46%), Gaps = 48/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + S VVSP+I I DTF L GGFDW+L
Sbjct: 248 CEVNKDWLPPLLQRIKEDPSRVVSPVIDIINMDTFAYVAASADLR-------GGFDWSLH 300
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + +R R + +P+ TP +AGGLF ID+A+F LG YD+ DIWGGEN E+SF
Sbjct: 301 FKWEQLSPEQRARRTDPTQPIKTPIIAGGLFVIDRAWFNHLGKYDTAMDIWGGENFEISF 360
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G + K + F
Sbjct: 361 RVWQCGGSLEILPCSRVGHVFRKKH-----PYVFP--EGNANTYIKNTRRTAEVWMDDFR 413
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
++ + +GD+ SR ELR+ L CKSFKWYL+ S+ SG+
Sbjct: 414 LFYYSARPAARGKSYGDIRSRVELRKKLNCKSFKWYLDNVYPELKVPDDSDSQSGVIKQR 473
Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
C++S + L PC G NQ W+ + +IR+ + CL +
Sbjct: 474 QNCLESRKVEGQEMPVLTLAPCTGTEGVPAINQEWVYTHGQQIRQQQHCLSVSTTFPASQ 533
Query: 270 VILYPCHGSKGNQYFE 285
V+L PC+ + G Q ++
Sbjct: 534 VLLLPCNMADGKQRWQ 549
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 97/306 (31%), Positives = 148/306 (48%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSR---GNFDWSLT 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 389 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 442
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + +++ + +FGDV+ R LR L C +F WYL +
Sbjct: 443 FYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPEMFVPDLNPTFSGAIK 502
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N + C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G +
Sbjct: 503 NLGTSQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 561
Query: 271 ILYPCH 276
L C
Sbjct: 562 GLRNCQ 567
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 147/306 (48%), Gaps = 43/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R + H P T +A + + + + Y
Sbjct: 389 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 442
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
F + ++ + +FGD++ R LR L C +F WYL +
Sbjct: 443 FYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIK 502
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
N + C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G +
Sbjct: 503 NLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 561
Query: 271 ILYPCH 276
L C
Sbjct: 562 GLRSCQ 567
>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
Length = 658
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 148/320 (46%), Gaps = 55/320 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P++ ++ + VV+P+I +I T E + + +G FDW +
Sbjct: 313 CEANLNWLEPIMALITEDRRTVVTPVIDSIDHHTMEYSKATQDVPA-----VGTFDWTMD 367
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNW A R+ +A +PV +PTMAGGLF+++K +F +LG+YD D WGGENLE+SF
Sbjct: 368 FNWKA---GVRRAGADATDPVDSPTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMSF 424
Query: 124 KFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--------LGTYD 169
+ W P + P P GG SI F + Y
Sbjct: 425 RI-WQCGGRLVTAPCSHVGHIFRDSHPYTVP---GG--SIHDTFLRNSMRVAEVWMDHYK 478
Query: 170 SGF-DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSGMCIDSACKP 227
F D G+N+ D GDV+ RKELR+ L C FKWYL V D +
Sbjct: 479 QYFLDTRPGQNI-----IDAGDVSERKELRQRLQCHDFKWYLNTVLPDLFIPDANHIQHQ 533
Query: 228 TDMHKP---------------VGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYAGGD-- 269
+H P G+YPCH QG NQ WM S EIR D CLD G
Sbjct: 534 GTLHTPDNICVDKMGQRNGGVAGVYPCHGQGTNQAWMYSITNEIRTHDSLCLDAWGSTLP 593
Query: 270 --VILYPCHGSKGNQYFEYD 287
V L CHG +GNQ + YD
Sbjct: 594 SPVHLGRCHGMRGNQEWRYD 613
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 153/324 (47%), Gaps = 57/324 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------------- 167
SF+ W + E + ++ G +F GT
Sbjct: 387 SFRV-WQCGGQLE----------IIPCSVVGHVFRTKSPHTFPKGTNVIARNQVRLAEVW 435
Query: 168 YDSGFDIWGGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------- 211
DS I+ NL+ ++ + FGD++ R +LR L C +F WYL
Sbjct: 436 MDSFKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTP 495
Query: 212 -----VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL 263
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL
Sbjct: 496 TFYGAIKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCL 554
Query: 264 DYAGGDVILYPCHGSKGNQYFEYD 287
+ G + L CH + N D
Sbjct: 555 HASNGALGLRNCHFTGKNSQVPKD 578
>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
aries]
Length = 674
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 141/300 (47%), Gaps = 48/300 (16%)
Query: 4 CEVQKRWLQPLLDVLARNS--SHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWN 61
CE +RWL+PLL+ +A S + VVSP+I I D F+ + GGFDWN
Sbjct: 328 CECNERWLEPLLERVAEGSDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWN 380
Query: 62 LQFNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
L F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE
Sbjct: 381 LVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLE 440
Query: 121 LSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
+SF+ + +P P P +G +F+ + ++W
Sbjct: 441 ISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVW 491
Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWS 217
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 492 MDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFG 551
Query: 218 GMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
+ + C T H VG+Y CH GGNQ W ++K ++ + CL D A G +I
Sbjct: 552 ALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 611
>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Saccoglossus kowalevskii]
Length = 633
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 147/328 (44%), Gaps = 65/328 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV +WL+PLL+ + + VV P+I I DTFE + P GGF+W L
Sbjct: 274 CEVSTQWLEPLLERIKFDPHTVVCPIIDIINADTFEYQQSP--------LVRGGFNWGLH 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP + K ++ +PV +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 326 FKWDTIPSSQFKGKEDYIKPVRSPTMAGGLFAMDRKYFHELGEYDDGMDIWGGENLEISF 385
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM+ + + ++ Y +
Sbjct: 386 RIWQCGGTLEIIPCSRVGHVFRKR-RPYGSPNGEDTMSKNSLRVAHVWMDE---YKEHYF 441
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+N D+GD++SR LR L C+SFKWYLE
Sbjct: 442 ELKKDNR----NKDYGDISSRLALRERLQCQSFKWYLENVYPEIRLPNQKVSYPVDVERR 497
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
+ + +G+C+ S T V L+ C + W S E + +
Sbjct: 498 QPVKAEIIKRGQIVHLLTGLCLTSENDFTQKGTLVVLHDCSDKDKQMIWSQSTSHEFLLK 557
Query: 259 DEACLDYAGGDVILYP----CHGSKGNQ 282
D CLD D +P CHGS G+Q
Sbjct: 558 DSLCLDTPETDSKAFPRLMKCHGSGGSQ 585
>gi|157118275|ref|XP_001653147.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875773|gb|EAT39998.1| AAEL008252-PA [Aedes aegypti]
Length = 648
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 103/316 (32%), Positives = 151/316 (47%), Gaps = 48/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL+ +ARN + + P + I DT L +L G FDW
Sbjct: 301 CEVG--WLEPLLNQVARNPTAIAIPSMDWIDGDTMTLDPQVSQL------IYGKFDWMGN 352
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +R + + K+ EP +P M GGLF+I++ F LG YD F+ +G E+LELSF
Sbjct: 353 FQWGLRRDRRQPQAKHPMEPFDSPVMPGGLFAINRTLFAHLGWYDEQFETYGAEHLELSF 412
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT------MAGGLFSIDKAFFEKLGTYDSGF 172
K + +P + P T T + L + + + ++ Y +
Sbjct: 413 KTWMCGGSMQIVPCSRVAHVQKPNHPYITKTSGSEDVIKRNLVRMAEVWMDEYALY--YY 470
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
+ +GG + +GDFGDV+SRK+LR++L CKSF+WYLE N
Sbjct: 471 ETFGGPDK----RGDFGDVSSRKQLRQHLNCKSFRWYLENVFPEQFDPSRAVGRGEFRNG 526
Query: 216 WSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILY 273
+G C+D G+ CH +G +Q W ++ GEI R + CLDY G + +
Sbjct: 527 ENGTDRCLDWPLA----RNQCGVTSCHGRGRHQMWYFTREGEITRKDHCLDYDGKTLEMN 582
Query: 274 PCHGSKGNQYFEYDYK 289
CH GNQ +EY K
Sbjct: 583 RCHQMGGNQLWEYAEK 598
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 148/307 (48%), Gaps = 45/307 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388
Query: 124 KFNWHA------IP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
+ W IP R + H P T +A + + + + Y
Sbjct: 389 RV-WQCGGQLGIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKK 441
Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
F + ++ + +FGD++ R LR L C +F WYL +
Sbjct: 442 IFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAI 501
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGD 269
N + C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G
Sbjct: 502 KNLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGST 560
Query: 270 VILYPCH 276
+ L C
Sbjct: 561 LGLRSCQ 567
>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 582
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 147/310 (47%), Gaps = 50/310 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N VV+P+I I DTF+ L GGF+WNL
Sbjct: 236 CECNVNWLEPLLARVKENHRAVVAPVIDIIDKDTFKYVAASADLR-------GGFEWNLI 288
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + R RH P+ TP +AGGLF I K +FEKLGTYD D+WGGENLELS
Sbjct: 289 FKWEYLLGKLRDDRHAQPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEQMDVWGGENLELS 348
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P G +F + ++W G
Sbjct: 349 FRVWLCGGSLEIIPCSRVGHVFRKQHPYTFPGGNGNVFQKNTR---------RAAEVWLG 399
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------VSN 214
+ L + +FGD+T+R +L++ L CK F WYL+ ++
Sbjct: 400 DYKYLYLRKVPSARYVNFGDITARLDLKKRLRCKDFDWYLKEIYPELAIPSKEQGRYLTF 459
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIRRDEACL---DYAGGDV 270
+CIDS + T + VG+Y CH GGNQ W+++ K G ++ + L D G +
Sbjct: 460 RQGNVCIDSLGRHTALSS-VGIYRCHGTGGNQEWVLNDKFGVLKSPYSNLCITDDEKGTL 518
Query: 271 ILYPCHGSKG 280
IL+ C+ ++G
Sbjct: 519 ILHYCNMTRG 528
>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Otolemur garnettii]
gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Otolemur garnettii]
Length = 622
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFSKPIPRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPTHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C++F W+L
Sbjct: 441 MIFYRRNQQAAKMAQEKSFGDISERLQLRERLHCRNFSWFLNNVYPEMFVPDLMPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N C+D + KP+ +Y CH GGNQ++ + ++R + A CL +
Sbjct: 501 IKNLGINQCLDVG-ENNHGEKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASVD 559
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 560 TLGLRSCHFTGKNSQVPKD 578
>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 603
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 102/309 (33%), Positives = 143/309 (46%), Gaps = 46/309 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL+PLL+ + + + VV P+I I DTF+ L GGFDWNL
Sbjct: 259 VECNVNWLEPLLERVVEDKTRVVCPIIDVISMDTFQYIGASADLR-------GGFDWNLV 311
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + +R +R ++ + TP +AGGLF ID+ +F+ LG YD D+WGGENLE+S
Sbjct: 312 FKWEYLTLDQRLRRQQDPTRAIKTPMIAGGLFVIDRLYFDTLGKYDMQMDVWGGENLEIS 371
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P +G +F+ + ++ D
Sbjct: 372 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKK 426
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT---- 228
+ L S FG++ R EL+R L CKSFKWYLE N + + I + P
Sbjct: 427 YYYAAVPLAKSIP--FGNIDDRLELKRKLHCKSFKWYLE--NVYPELSIPHSTSPAFGSI 482
Query: 229 ------------DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVILY 273
+ + VGLY CH GGNQ W M I+ + CL +Y G ++L
Sbjct: 483 RQRQLCLDTLGHSIEQTVGLYVCHDTGGNQEWGMEDDSYIKHHDLCLTIPNYVPGALVLM 542
Query: 274 PCHGSKGNQ 282
NQ
Sbjct: 543 RLCEDADNQ 551
>gi|47217176|emb|CAG11012.1| unnamed protein product [Tetraodon nigroviridis]
Length = 598
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 142/312 (45%), Gaps = 51/312 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ + GGFDWNL
Sbjct: 253 CECNAHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 305
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ++ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 306 FKWDYMTLDQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 365
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 366 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 416
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR EL++ +GCK FKWYLE +
Sbjct: 417 EYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENVYPELRVPDHQDIAFGAL 476
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
G C+D+ D VG+Y CH GGNQ W ++K ++ + CL AG +
Sbjct: 477 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 534
Query: 271 ILYPCHGSKGNQ 282
L C + Q
Sbjct: 535 KLQGCRENDSRQ 546
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 95/290 (32%), Positives = 137/290 (47%), Gaps = 31/290 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VVSP I I +TF+ P + ++ G FDW+L
Sbjct: 267 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLNTFQFHKP---VATARAHNRGNFDWSLT 323
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP+ E + K+ PV TPT AGGLFSI KA+FEK+GTYD +IWGGEN+E+SF
Sbjct: 324 FGWEGIPDYENAKRKDETYPVKTPTFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSF 383
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
+ IP P P + E + Y F
Sbjct: 384 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKLIFYRRSQ 443
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
+++ + FGD++ R +LR +L CK+F WYL + N +
Sbjct: 444 SAAKMAKEKGFGDISDRLKLREDLQCKNFSWYLSNVYPEAFVPDLSPVKFGALKNRGAQQ 503
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
C+D + + KPV +Y CH GGNQ++ + H E+R + + CL +
Sbjct: 504 CLDVG-ESNNGGKPVIMYTCHNMGGNQYFEYTSHKELRHNIGKQLCLQAS 552
>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Xenopus (Silurana) tropicalis]
Length = 625
Score = 146 bits (369), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 146/304 (48%), Gaps = 39/304 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A N + VVSP I I +TF+ P + + G FDW L
Sbjct: 272 CECYYGWLEPLLASIAENYTSVVSPDITGIDLNTFQFSNPSPYGNNHNR---GNFDWTLS 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P E+ R K+ P+ TPT AGGLFSI KA+FE +G+YD +IWGGEN+E+SF
Sbjct: 329 FGWESLPSSEKTRRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEQMEIWGGENIEMSF 388
Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W + E R K+ P T + + + + + L F
Sbjct: 389 RV-WQCGGQLEILPCSVVGHVFRSKSPHTFPKGTQVIVRNQVRLAEVWMDDLKEI---FY 444
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSND 215
E + ++GD++ R +LR L CK+F WYL ++ N
Sbjct: 445 RRNREAANIVKSKEYGDLSKRLDLRHRLQCKNFTWYLNNIYPEMYVPERHPLIHGDLKNV 504
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
+C+D + KP+ +Y CH GGNQ++ + EIR + E CL + +++
Sbjct: 505 GRDLCLDVGGE-NHGDKPLIMYSCHGLGGNQYFEYTSKHEIRHNIQKELCLRPSHSSLVI 563
Query: 273 YPCH 276
PC+
Sbjct: 564 KPCN 567
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 99/308 (32%), Positives = 148/308 (48%), Gaps = 47/308 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFTGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W A+P E +R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWEAVPAHENRRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + Y
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---NYK 440
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F W+L
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWFLHNVYPEMFVPDLNPTFYGA 500
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N G C+D + KP+ +Y CH GGNQ++ + ++R + A CL +
Sbjct: 501 IKNLGLGQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQKDLRHNIAKQLCLHASAS 559
Query: 269 DVILYPCH 276
+ L CH
Sbjct: 560 TLGLRGCH 567
>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Takifugu rubripes]
Length = 558
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 214 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 266
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ++ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 267 FKWDYMTLDQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 326
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 327 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 377
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR EL++ +GCK FKWYLE +
Sbjct: 378 EYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENVYPELRVPDHQDIAFGAL 437
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
G C+D+ D VG+Y CH GGNQ W ++K ++ + CL A +
Sbjct: 438 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTASSLI 495
Query: 271 ILYPCHGSKGNQYFE 285
L C + Q +E
Sbjct: 496 KLQGCRENDSRQKWE 510
>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
Length = 623
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 146/300 (48%), Gaps = 43/300 (14%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL +A + + VVSP I I +TF+ P R + + G FDW+L F W +
Sbjct: 279 WLEPLLARIAEDKTPVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLTFGWEML 335
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
P+ E++R K+ P+ +PT AGGLFSI KA+FE +GTYD+ +IWGGEN+E+SF+
Sbjct: 336 PQHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCG 395
Query: 126 -NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
IP R + H P T +A + + + + Y F
Sbjct: 396 GQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKIFYRRNL 449
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
+ ++ + +FGD++ R +LR L C +F WYL + N +
Sbjct: 450 QAAKMVQENNFGDISERLQLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIKNLGTNQ 509
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPCH 276
C+D + KP+ +Y CH GGNQ++ + ++R + + CL +G + L C
Sbjct: 510 CLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTLSLRSCQ 568
>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 648
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 148/318 (46%), Gaps = 58/318 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ VV P+I I D T + G +F IGGF+W +
Sbjct: 290 CETTPGWLEPLLEPIRRDRRAVVCPVIDIIDDKTLQYVAAEGD-----RFQIGGFNWKGE 344
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+WH IP RK + AEP+ +PTMAGGLF+I++ +F + G+YD D WGGENLE+SF
Sbjct: 345 FSWHNIPAAWRKNRTSIAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSF 404
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWG 176
+ W V P G D ++ D+ ++W
Sbjct: 405 RI-WQC-----------GGHIVIAPCSHVGHIFRDYHPYKFPKGKDTNAINTKRAVEVWM 452
Query: 177 GE--------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
E EL+ K GD+++RK R CKSFKWYL+
Sbjct: 453 DEFKKYFYQTRPELT-KMKVGDISARKAFREKNRCKSFKWYLDNVYPHKYLMEEHSQGFG 511
Query: 212 -VSNDWSGMCIDSACKPTDMHKPVGLYPCH---KQGGNQFWMMSKHGEIRRDEACLDYAG 267
+ N + MC+D+ K D +G++ CH ++ NQ +S+ GE+RRD+ C +
Sbjct: 512 IIRNPHTNMCLDTYGKSEDEISDLGVFECHPIPEEATNQLLSLSRKGELRRDDVCAKVSW 571
Query: 268 GDVILYPCHGSKGNQYFE 285
D P +KG E
Sbjct: 572 VD----PFRRTKGKIVME 585
>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oreochromis niloticus]
Length = 620
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 147/303 (48%), Gaps = 53/303 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VVSP I++I ++F+ P + ++ + G FDW+L
Sbjct: 268 CECFHGWLEPLLARIVEEPTAVVSPEISSIDLNSFQFHKP---VATNRAYNRGNFDWSLT 324
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W AIPE ++ K+ PV TPT AGGLF+I K +FE +GTYD +IWGGEN+E+SF
Sbjct: 325 FGWEAIPEDAKRLRKDETYPVKTPTFAGGLFAISKKYFEHIGTYDDQMEIWGGENVEMSF 384
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGG 177
+ W + E + ++ G +F GT ++W
Sbjct: 385 RV-WQCGGQLE----------IIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMD 433
Query: 178 ENLELSFKGD-----------FGDVTSRKELRRNLGCKSFKWYL---------------- 210
+ ++ ++ + FGD+++R LR L CK+F WYL
Sbjct: 434 DYKKIYYRRNKNAAIMASEHRFGDISARLNLRERLHCKNFSWYLNTVYPEIFIPDLNPEK 493
Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDY 265
+ N S MC+D A + KP+ +Y CH GGNQ++ + H E+R + + CL
Sbjct: 494 SGSIKNLGSNMCLD-AGENNQGGKPLIMYHCHNMGGNQYFEYTSHKELRHNIGKQLCLHA 552
Query: 266 AGG 268
A G
Sbjct: 553 AVG 555
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 92/277 (33%), Positives = 134/277 (48%), Gaps = 36/277 (12%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL + + ++VV P I I +FE S IG F+W ++F W+ I
Sbjct: 411 WLEPLLQRIHDDPTNVVCPAIDAIDATSFEY-------AGSGATIIGAFNWEMKFTWNGI 463
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
PE E +R + + P+ +P MAGGLFSIDK FF ++GTYD GFDIWG ENLELSFK
Sbjct: 464 PEYEARRRDDESWPIRSPAMAGGLFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCG 523
Query: 126 -NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
+ IP +P P F + + + DI+ +L
Sbjct: 524 GSLEIIPCSRVAHIFRKQQPYKFPDGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKPQL-M 582
Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSACK 226
++GDV+ R +LR L C F+WYL +V N + MC+DS K
Sbjct: 583 GQEYGDVSDRIKLREELKCHDFQWYLDNVYPALKVPDTKVRARGDVRNAATSMCLDSMGK 642
Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
+G++PCH +G NQ + ++ +++ CL
Sbjct: 643 GV-----LGMFPCHGEGNNQAFTLTWDDQLKHKNKCL 674
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 151/315 (47%), Gaps = 50/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + S+VV+P+I I + L + T + IG FDW+L
Sbjct: 404 CEASHGWLEPLLARIAEDRSNVVTPVIDVI--NAQNLAYEADNQTPA----IGVFDWSLT 457
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +I R+ K + P+ +PTMAGGLF+ID+++F + G YDSGF+IWG ENLE+S
Sbjct: 458 FRWQSIQRRDLPLLKHDPTHPIPSPTMAGGLFAIDRSYFIETGMYDSGFEIWGAENLEIS 517
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-----GFDIWGG 177
FK W E + + G +F + L + S ++W
Sbjct: 518 FK-TWMCGGRIE----------ILPCSHVGHIFRKHAPYSNTLTDFISYNNKRLAEVWLD 566
Query: 178 ENLEL-------SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
E + K + G+ T R ELR LGC+SF+WYL EV
Sbjct: 567 GYKEFFYFMSPSALKVNAGNYTDRVELRDRLGCRSFQWYLENVFPEGGWPGRNKIYGEVR 626
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVI 271
+ + C+D+ + T + +P+ + C NQ WM ++ EI+ CLDY +
Sbjct: 627 HTATNWCLDTGGRTTPITEPMVAHRC-DNNVNQIWMYTEEQEIKHSSLCLDYDVTTMTLT 685
Query: 272 LYPCHGSKGNQYFEY 286
L CH GNQ ++Y
Sbjct: 686 LMGCHQMGGNQLWDY 700
>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
Length = 584
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/299 (34%), Positives = 145/299 (48%), Gaps = 60/299 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E ++WL+PLL +A N VV+P+I I D F L GGFDW L F
Sbjct: 240 ECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 292
Query: 65 NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + E+ R +RHKN P+ +PTMAGGLF+I K +FE+LGTYD ++WGGENLE+SF
Sbjct: 293 RWEFMNEQLRTERHKNPTAPIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSF 352
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P +G +F + +
Sbjct: 353 RVWQCGGSLEILPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RAAE 398
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWY-------LEVSNDWSGM 219
+W E + K +FGD+T R +R L CKSFKWY LEV +G
Sbjct: 399 VWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLDTVYPQLEVPKKAAGK 458
Query: 220 ---------CIDSACKPTDMHKPVGLYPCHKQGGNQFWM---MSKHGEIRRDEACLDYA 266
C+DS + + + GL+ CH GGNQ W+ ++K + + CLD+A
Sbjct: 459 SVQVKMGHHCLDSMARKEN--EAPGLFACHGTGGNQEWVFDHLTKTFKNAITQLCLDFA 515
>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Saccoglossus kowalevskii]
Length = 2434
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 147/320 (45%), Gaps = 61/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PL+ + N+ VVSP+I I D F+ L GGFDWNL
Sbjct: 2086 CECNQNWIEPLITKIQENNKAVVSPIIDVINMDNFQYVAASADLK-------GGFDWNLV 2138
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + P KR + + TP +AGGLF+I K++FE+LG YD D+WGGENLE+S
Sbjct: 2139 FKWDYMTPAERNKRKSDPIAAIRTPMIAGGLFAISKSWFEELGKYDMMMDVWGGENLEIS 2198
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ IP RK+H P P +G +F+ +
Sbjct: 2199 FRVWQCGGTLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFAKNTR---------RAA 2244
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
++W E + + FG++ SR +LR+ L CKSF WYLE
Sbjct: 2245 EVWMDEYKKYYYSAVPSSKNIAFGNIQSRLDLRKKLQCKSFGWYLENVYPELRIPDKKDI 2304
Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY---- 265
+ +C+D+ TD +GLY CH GGNQ + ++K IR + CL
Sbjct: 2305 AFGALQQGHLCMDTLGHFTD--GTLGLYECHNTGGNQEFALTKDKAIRHQDLCLTVMDHR 2362
Query: 266 AGGDVILYPCHGSKGNQYFE 285
G + L+ C S NQ +E
Sbjct: 2363 PSGVIKLHGCSESNLNQKWE 2382
>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
Length = 512
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/294 (32%), Positives = 143/294 (48%), Gaps = 57/294 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A + + VVSP+I I D F+ L GGF+WNL
Sbjct: 168 CECNVHWLEPLLERVAEDPTRVVSPIIDVINMDNFQYVGASSNLK-------GGFNWNLV 220
Query: 64 FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W ++ PE +R N P+ TP +AGGLF IDK FE++G YD D+WGGENLE+S
Sbjct: 221 FKWDSLTPEEVTQRRGNPTAPIKTPMIAGGLFVIDKERFEEIGKYDMMMDVWGGENLEIS 280
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H P P +G +F+ +
Sbjct: 281 FRVWQCHGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFARNTR---------RAA 326
Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS- 217
++W E + + +FGD+ SR ELR L C+ F W+L+ V ++
Sbjct: 327 EVWMDEYKSYYYAEVPSAKSVNFGDIRSRLELREKLKCRPFSWFLQNVYPSLIVPSEQDV 386
Query: 218 --------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
MC+D+ + VG++ CH GGNQ W+++K+ +I+ + C+
Sbjct: 387 QFGYIQQGSMCVDTLSNA--LGGKVGMFQCHNTGGNQEWVLTKNQKIKHLDLCI 438
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 100/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE L+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 253 CECFHGRLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 307
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 308 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 367
Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
SF+ IP R + H P T +A + + + + +Y
Sbjct: 368 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 421
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
F + +++ + FGD++ R +LR L C +F WYL
Sbjct: 422 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 481
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
+ N + C+D + KP+ +Y CH GGNQ++ + ++R + A CL + G
Sbjct: 482 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 540
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 541 ALGLGSCHFTGKNSQVPKD 559
>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
partial [Strongylocentrotus purpuratus]
Length = 531
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 100/305 (32%), Positives = 146/305 (47%), Gaps = 50/305 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PLL LA + + VV P++ I DTF P L GGF+W +
Sbjct: 192 VEVMIGWLEPLLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQR------GGFNWRFE 245
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ W IP +++ K A P+ +P M GGL ++D++FF +LG +D G ++WGGENLE S
Sbjct: 246 YRWKPIPNYDKRPSKVA--PIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSL 303
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K + IP R +++ + P G +D + ++W
Sbjct: 304 KIWMCGGSIEIIPCSRVGHVYRDTS-----PYSFLGQNPLDIVEHNAMRV----VEVWTD 354
Query: 178 EN-------LELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
E+ L + DFGDV+ RK+LR +L C F WYL +
Sbjct: 355 EHKHHFYDRLPMLKNRDFGDVSKRKKLRESLNCYDFNWYLTNVYPELYVPSSSSVLRQTI 414
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDV 270
+N S +CIDS + K + + CH GGN+++ +K GEIR DE CL+ G V
Sbjct: 415 NNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGNEYFEETKAGEIRNDELCLEANSVGTHV 474
Query: 271 ILYPC 275
IL PC
Sbjct: 475 ILNPC 479
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 104/318 (32%), Positives = 153/318 (48%), Gaps = 45/318 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETVPLHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
SF+ IP P T G+ I + + D +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDDYKEIF 443
Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT---- 228
NL+ ++ + FGD++ R +L+ L C +F W+L N + M + KPT
Sbjct: 444 YRRNLQAAKMAEEKSFGDISERLKLKEQLHCHNFSWFLH--NVYPEMFVPD-LKPTFYGA 500
Query: 229 --------------DMH--KPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGD 269
+ H KP+ +YPCH GGNQ++ + ++R + A CL G
Sbjct: 501 IKSLGTDHCLDVGENNHGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHANAGT 560
Query: 270 VILYPCHGSKGNQYFEYD 287
+ L CH + N D
Sbjct: 561 LGLRSCHFTGKNSQVPKD 578
>gi|311246104|ref|XP_003122084.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Sus
scrofa]
Length = 541
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 89/218 (40%), Positives = 116/218 (53%), Gaps = 26/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + S VV P+I I +TFE + +S + IGGFDW L
Sbjct: 229 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------MGNSREPQIGGFDWRLV 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH +PERER R K+ + + +PTMAGGLF++ K +FE LG YD+G ++WGGENLE SF
Sbjct: 283 FTWHVVPERERLRMKSPIDVIRSPTMAGGLFAVSKKYFEYLGAYDTGMEVWGGENLEFSF 342
Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 343 RI-WQCGGTLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 391
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV 212
L + + FGDVT RK+LR L CK FKW+LE
Sbjct: 392 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLET 429
>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
Length = 442
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 139/276 (50%), Gaps = 45/276 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWN-L 62
CE WL+P L+ +ARN + V ++ NI DTF+ F + T +GG ++ L
Sbjct: 179 CECMHGWLEPQLETIARNYTTVPISVLDNILHDTFQYTFMDLQSTQ-----MGGINFKEL 233
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W IPE ER+R K+ +P+ +PTMAGG+FSI+K +FE LG YD+G ++WGGEN+E+S
Sbjct: 234 TFIWEPIPEHERRRQKSPVDPIRSPTMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMS 293
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
F+ W + V+ PT +S A+ + + ++W + E+
Sbjct: 294 FRI-WQCGGTIVVLPCSHVGH-VFRPTSP---YSTGDAWKKLVHNNRRMAEVWMDDYKEI 348
Query: 183 SF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------------E 211
+ K D GDVT RK LR+ L C+ F WYL
Sbjct: 349 YYRKHPEYRKYDMGDVTQRKLLRKGLHCRDFSWYLSHVFPTLYVPDIRPIAHGQVSHVTS 408
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQF 247
+S++ +G+C+D +P G++PCH +GG Q
Sbjct: 409 ISSEQTGLCLDVI---KAGKEPAGVFPCHGKGGTQV 441
>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
Length = 489
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 115/216 (53%), Gaps = 20/216 (9%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +ARN + V SP I I TF+ + +G FDWNL+
Sbjct: 268 CECTEGWLEPLLDRIARNRNTVASPTIDMIDPKTFQYNY------DGANDVLGVFDWNLE 321
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP RE KR + AEP+ TPT+AGGLF+ID FF +GTYD GF+IWGG+NLELSF
Sbjct: 322 FYWIPIPLRELKRRNHFAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSF 381
Query: 124 KFNW------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIW 175
K W IP P P+ + + A + L Y +
Sbjct: 382 K-TWMCGGILEIIPCSHVGHIFRDDSPYEWPSSRAMMVESNLARLAEVWLDDYAKYYYER 440
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
G N L+ DV+ RK+LR LGCKSFKWYL+
Sbjct: 441 SGGNKSLA-----TDVSDRKKLREKLGCKSFKWYLD 471
>gi|157114758|ref|XP_001652407.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883560|gb|EAT47785.1| AAEL001146-PA [Aedes aegypti]
Length = 552
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 105/308 (34%), Positives = 148/308 (48%), Gaps = 39/308 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P LD +ARN + V P I + D L F + + + G DW LQ
Sbjct: 211 CECTTGWLEPQLDRVARNPTTVAIPTIDWV--DEHNLAF----IANRSHIYYGACDWGLQ 264
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +R+ K +N EP TP MAGGLFSI+K FF +G YD G I+GGEN+ELS
Sbjct: 265 FGWRGRWDRKVK-PENKLEPFPTPIMAGGLFSINKTFFAHIGWYDEGLGIYGGENVELSL 323
Query: 124 KF-----NWHAIPERERKRHKNAAEP----VWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
K IP + A P V T + G + + + ++ +D+
Sbjct: 324 KAWMCGGRLETIPCSRVGHIQKAGHPYLDGVKTDWVRVGSVRVAEVWMDQYA--QVVYDM 381
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS----NDWSGMCIDSACKPTDM 230
+GG F+G+FGDV+ RK+LR +L CKSFKWYLE + D + K T++
Sbjct: 382 FGGP----EFRGNFGDVSDRKKLRESLNCKSFKWYLENAFPELEDPVSYGVGHG-KFTNL 436
Query: 231 HKPVGLYPCHKQGGNQF------------WMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
P +++ G F W+ + GEI CLDY G + ++ CH
Sbjct: 437 GVGKNFCPRYRKAGYTFRMEPCTDDDYQHWVHNMLGEISTSNVCLDYDGITLYMFECHKG 496
Query: 279 KGNQYFEY 286
+GNQ + Y
Sbjct: 497 QGNQKWRY 504
>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 622
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 144/302 (47%), Gaps = 33/302 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + + VVSP I I ++FE P + + G FDW+L
Sbjct: 270 CECFHGWLEPLLSRIAEDHTAVVSPDIPIIDLNSFEFHKPVQYGKTHNR---GNFDWSLT 326
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W AIP E++R K+ P+ TPT AGGLFSI KA+FE +G+YD +IWGGENLE+SF
Sbjct: 327 FGWEAIPAAEKERRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENLEMSF 386
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
+ IP P P +F E + Y +
Sbjct: 387 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTQVIFRNLVRLAEVWMDDYKLLYYQRNE 446
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
+ ++ + FGD++ R +L+ +L CK+F WYLE V N+ S
Sbjct: 447 QAAKMVREKSFGDISKRLKLKADLQCKNFTWYLENIYPEMFVPDRDPTYYGKVKNEGSQN 506
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL--DYAGGDVILYP 274
C+D+ K KP+ + C+ GG Q++ S H E+R + A CL Y G V L
Sbjct: 507 CLDAGEK-NHGGKPLIMNLCNGMGGTQYFEYSTHKELRHNIAKQLCLRSKYVPGPVELGE 565
Query: 275 CH 276
C
Sbjct: 566 CQ 567
>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
glaber]
Length = 622
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 145/298 (48%), Gaps = 46/298 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + VVSP I I DTFE P GR+ S G FDW+
Sbjct: 272 CECFYGWLEPLLARIAEDQVAVVSPDIVTINLDTFEFSKPIPGGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P +E++R ++ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPAQEKQRREDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
SF+ W E R R + P T ++ + + + + Y
Sbjct: 387 SFRV-WQCGGRLEIAPCSVVGHVFRSRSPHTF-PKGTSVISRNQVRLAEVWMDD---YKK 441
Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT-- 228
F + +++ + FGD++ R +LR L C++F W+L N + M + PT
Sbjct: 442 IFYRRNLQAAKIAQEKSFGDISERLQLREQLHCRNFSWFLH--NIYPEMFVPD-LNPTFY 498
Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
K +G+ C G N + G +I+Y CHG GNQYFEY
Sbjct: 499 GAIKNLGINQCLDVGEN------------------NRGGKPLIMYSCHGLGGNQYFEY 538
>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
Length = 554
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 48/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + V SP+I I DTF ++ GGFDW+L
Sbjct: 204 CEVNKDWLPPLLQRVKEDPTSVASPVIDIINMDTFAY-------VAASSDLRGGFDWSLH 256
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + +R + + EP+ TP +AGGLF ID+++F +LG YD+ DIWGGEN E+SF
Sbjct: 257 FKWEQLSAEKRAKRADPTEPIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIWGGENFEISF 316
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P G + K + F
Sbjct: 317 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYIFP--EGNANTYIKNTRRTAEVWMDEFK 369
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
++ + +GD+ R+ELR++L CKSFKWYL+ S+ SG+
Sbjct: 370 LFYYSARPAARGKSYGDIHGRQELRKSLNCKSFKWYLDNVYPELKVPDDSDAKSGVIRQR 429
Query: 220 --CIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEIRRDEACLD----YAGGD 269
C++S + L PC NQ W+ + +IR+ + CL +
Sbjct: 430 QNCLESRVVEGQDLPVLTLAPCIITKETPAANQEWIYTHGQQIRQQQYCLSVSTTFPASQ 489
Query: 270 VILYPCHGSKGNQYFE 285
++L PC+ S G Q ++
Sbjct: 490 ILLMPCNISDGKQRWQ 505
>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
Length = 539
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+P+L+ + + + VV P+I I D F+ L GGFDWNL
Sbjct: 194 CECNQHWLEPMLERVMEDRTRVVCPIIDVINMDNFQYVGASADLR-------GGFDWNLV 246
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + +R R + P+ TP +AGGLF IDK++F++LG YD D+WGGENLE+S
Sbjct: 247 FKWDYMTANQRNARRSDPIAPIRTPMIAGGLFMIDKSWFDELGKYDMMMDVWGGENLEIS 306
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 307 FRVWQCQGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFTRNTR---------RAAEVWMD 357
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E E + FG++ SR ELR+ L CK F WYLE +
Sbjct: 358 EYKEYYYAAVPSARNVPFGNIQSRLELRKKLSCKPFAWYLEHVYPELRIPDKKDVAFGAL 417
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVI 271
+C+D+ D VG+Y CH GGNQ W ++K IR + CL G+++
Sbjct: 418 QQGTLCMDTLGHFAD--GTVGVYECHGSGGNQEWALTKDKSIRHSDLCLTVVNQNPGELL 475
Query: 272 -LYPCHGSKGNQYFE 285
L+ C Q +E
Sbjct: 476 KLHGCQEKNTKQKWE 490
>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 526
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 94/269 (34%), Positives = 137/269 (50%), Gaps = 34/269 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P+++++ ++ + VV P+I I D T + TSS + IGGF+W +
Sbjct: 259 CEATDHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMG-----TSSDFYQIGGFNWKGE 313
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W PE RK K+ A+P+ +PTMAGGLF+ID+ +F + G+YDS + WGGENLE+SF
Sbjct: 314 FIWINTPEAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSF 373
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + P P P+ I+ A ++ + + + +
Sbjct: 374 RIWMCGGSLVIAPCSHVGHIFRDYHPYKFPS-NKDTHGINTARLAEV--WMDNYKYYFYQ 430
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMC 220
N K FGD++ RK LR L CKSFKWYL+ N +GMC
Sbjct: 431 NRPELRKISFGDISERKALRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMC 490
Query: 221 IDSACKPTDMHKPVGLYPCHK---QGGNQ 246
+DS D +P+G+YPCHK GGNQ
Sbjct: 491 LDSMSHNYDNTEPLGIYPCHKDTNSGGNQ 519
>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Strongylocentrotus purpuratus]
Length = 521
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 150/322 (46%), Gaps = 48/322 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLL+ +A N +V P+I I ++ F G + G FDW L
Sbjct: 154 CEANYNWLPPLLERIALNRRRIVCPMIDVISNEDFHYESQAGDVMR------GAFDWELY 207
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF-EKLGTYDSGFDIWGGENLELS 122
+ I E E KR + ++P TP MAGGLF++D+ +F E+LG YD G +IWGGE +LS
Sbjct: 208 YKRIPISEAENKRRSHESDPFRTPIMAGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLS 267
Query: 123 FKFNWHAIPERER---KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
FK W E E R + + T+ GG I+K + + D WG
Sbjct: 268 FKV-WMCGGEMEEIPCSRVGHIYRKFMSYTVPGGAGVINKNLLRVVEVW---MDEWGKYF 323
Query: 180 LELS--FKG-DFGDVTSRKELRRNLGCKSFKWYL----------------------EVSN 214
E KG D+GD++ + LR L CK+F W+L +++
Sbjct: 324 YERRPYLKGQDYGDISKQLALRERLQCKNFTWFLTEVAPDILQYYPPVEPEGGAKGHITH 383
Query: 215 DWSGMCIDSACKPTDMHKPVGLYP-CHKQGGNQFWMMSKHGEIR----RDEACLDYA--- 266
+G C+ + D + P KQGG+QFW ++ H + R + C+D+
Sbjct: 384 TSTGKCLTLSQGGKDELRVQECNPRSMKQGGSQFWELTWHDDFRPSSKSRKQCVDFPYGR 443
Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
G + ILYPCH GNQ + YD
Sbjct: 444 EGAEPILYPCHHGGGNQLWVYD 465
>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
adhaerens]
Length = 490
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 154/318 (48%), Gaps = 65/318 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL+ + N + VV P I I D TF+ +F P L G F+W L
Sbjct: 156 CEVTTGWLEPLLERIYLNETTVVCPEIDVIDDRTFQYQFGPPALMR------GVFNWQLY 209
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP E KR K+ +PVW+PTMAGGLF+I K FF++LGTYD FD+WGGEN+E+SF
Sbjct: 210 FRWALIPPEEHKRRKSPIDPVWSPTMAGGLFAISKKFFKRLGTYDDQFDVWGGENMEISF 269
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG------TYDSGFDIWGG 177
K W + E + + G +F ++ + K G ++W
Sbjct: 270 K-AWLCGGKLE----------IVPCSRVGHVFRHNQPY--KFGGNFLSRNSQRVAEVWLD 316
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
+ E + K +FG++ R EL++ L CK FKWYL+ +
Sbjct: 317 DYKEFFYQVQPHLRKEEFGNIAERLELKKKLKCKPFKWYLQNIYTDVVLPNESSIAKGKL 376
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMM----SKHGEIRRDEACLDYA-- 266
N S MC+D+ K + + + +YPC W M S E+ E CLD +
Sbjct: 377 KNPASNMCLDTMGKTANAY--MSIYPCANS-----WTMEMSYSILEELVVSELCLDVSDN 429
Query: 267 --GGDVILYPCHGSKGNQ 282
G + LY CHG GNQ
Sbjct: 430 KDGARIQLYDCHGQGGNQ 447
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 7/60 (11%)
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLDYAGGDV-ILYPCHGSKGNQYFEYDYKY 290
+ LY CH QGGNQ W+ H +IR CLD G+ ++ PC G +Q + +D Y
Sbjct: 435 IQLYDCHGQGGNQLWL---HKKIRHPNTGKCLDRGSGNTPVMKPCSGGV-SQMWSFDTYY 490
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/265 (35%), Positives = 129/265 (48%), Gaps = 44/265 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
+ K D+GD++SR LR L CK F WYL E+ N
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH 240
+ C+D+ + + + VG++ CH
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCH 460
>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
Length = 494
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/317 (32%), Positives = 142/317 (44%), Gaps = 58/317 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WLQPLL + +N VVSP+I I D F + GGFDW+L
Sbjct: 149 CECNTDWLQPLLKRVVQNKKAVVSPIIDVINMDDFSY-------IGASADIKGGFDWSLH 201
Query: 64 FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+++ R P+ TP +AGGLF + K++FE++G YD+ DIWGGEN E+S
Sbjct: 202 FKWDNLTPEQKQSRRSTPIAPIKTPMIAGGLFVVTKSWFEEMGKYDTMMDIWGGENFEIS 261
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RKRH P P + +
Sbjct: 262 FRTWQCGGSMEIIPCSRVGHVFRKRH-----PYTFPDGNANTY---------MKNTRRTA 307
Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV---------SNDW 216
++W E + +G + SRKELR+ L CK FKWYL+ S D
Sbjct: 308 EVWMDEYKRFYYAARPMARSALYGSIKSRKELRKRLQCKPFKWYLQNVYPELQIPDSQDV 367
Query: 217 S-------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY-AGG 268
S C+D+ + VG++ CH Q GNQ W ++K +R + CL +GG
Sbjct: 368 SFGELKQGKSCLDTL--GSQAGGSVGMFDCHGQAGNQEWALTKKSTVRHLDLCLTLGSGG 425
Query: 269 DVILYPCHGSKGNQYFE 285
V L C Q +E
Sbjct: 426 AVTLEGCRDGDPKQIWE 442
>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oryzias latipes]
Length = 621
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 94/288 (32%), Positives = 132/288 (45%), Gaps = 31/288 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VVSP I I + F P + ++ + G FDW+L
Sbjct: 269 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLNNFNFNKP---IATNRAYNRGNFDWSLT 325
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W AIPE R+ K+ PV TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF
Sbjct: 326 FGWEAIPEEARRLRKDETYPVKTPTFAGGLFSISKKYFEHIGTYDDKMEIWGGENVEMSF 385
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
+ IP P P + E + Y +
Sbjct: 386 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKKIYYRRNK 445
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
++ + +GD++ R +LR +L CK+F WYL + N S
Sbjct: 446 NAAIMAQEKKYGDISDRLKLREDLHCKNFSWYLNTIYPEIFVPDLTPEKFGAIKNLGSDT 505
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLD 264
C+D + KPV +Y CH GGNQ++ S H E+R + + CL
Sbjct: 506 CLDVG-ENNQGGKPVIMYMCHNMGGNQYFEYSSHKELRHNIGKQLCLQ 552
>gi|449271781|gb|EMC82021.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Columba
livia]
Length = 314
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 88/217 (40%), Positives = 118/217 (54%), Gaps = 26/217 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A S VV P+I I +TFE L ++ + IGGFDW L
Sbjct: 107 CECHEGWLEPLLARIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 160
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH+ PERE+KR K+ + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 161 FTWHSTPEREQKRRKSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 220
Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ W E H P P +S KA L ++W E E
Sbjct: 221 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 269
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
L + + +GDVT R+ LR L CK FKW+LE
Sbjct: 270 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLE 306
>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Takifugu rubripes]
Length = 579
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/316 (31%), Positives = 149/316 (47%), Gaps = 48/316 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + ++ + VVSP+I I DTF ++ GGFDW+L
Sbjct: 229 CEVNKDWLPPLLQRIKQDPTRVVSPVIDIINMDTFAY-------VAASADLRGGFDWSLH 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + +R R + A+P+ TP +AGGLF ID+++F LG YD+ DIWGGEN E+SF
Sbjct: 282 FKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGGENFEISF 341
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G + K + F
Sbjct: 342 RVWQCGGSLEILPCSRVGHVFRKKH-----PYVFP--EGNANTYIKNTRRTAEVWMDDFS 394
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
++ + +GD+ R ELR+ L CK+FKWYL+ S+ SG+
Sbjct: 395 LFYYSARPAARGKSYGDIRGRLELRKKLKCKTFKWYLDNVYPELKVPDDSDSKSGVIKQR 454
Query: 220 --CIDSACKPTDMHKPVGLYPCH-KQGGN---QFWMMSKHGEIRRDEACLD----YAGGD 269
C++S + L PC QG N Q W+ + +IR+ + CL +
Sbjct: 455 QNCLESQRVEGQELPVLTLAPCVGSQGVNAIKQEWVYTHGQQIRQQQHCLSLSTTFPASQ 514
Query: 270 VILYPCHGSKGNQYFE 285
V+L PC+ + G Q ++
Sbjct: 515 VLLLPCNMADGKQRWQ 530
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 51/316 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P++ ++ + +V P+I +I D T + +GGF W L
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDSTLAYH-------GDWSLSVGGFSWALH 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IPE E+KR K + + +PTMAGGL + ++ +F ++G YD DIWGGENLE+SF
Sbjct: 283 FTWEGIPEDEQKRRKKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ NW IP A P + T + ++L ++W
Sbjct: 343 R-NWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMD 394
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
+ L + D GD+TSR ELR+ L CKSFKW+L+ +
Sbjct: 395 DYKRLYYMHREDLRTKDVGDLTSRHELRKRLNCKSFKWFLDNIAKGKFIMDEDVVAYGAL 454
Query: 213 SNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGD 269
SG MC D+ + M + +G++ C +G + Q +S+ G +RR+ C G+
Sbjct: 455 HTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSREGNLRRENTCASEENGN 514
Query: 270 VILYPCHGSKGNQYFE 285
V + C SK Q+ E
Sbjct: 515 VRMKTC--SKKAQFNE 528
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 91/290 (31%), Positives = 142/290 (48%), Gaps = 44/290 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + + V+ P+I I +T + + +GGF W+L
Sbjct: 228 CEANEGWLEPLLARIKEKRTAVLCPIIDYISAETMQYS------GDANVNAVGGFWWSLH 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +I + ER R K+A EPV +PTMAGGL + ++ +F ++G YD G DIWGGENLE+SF
Sbjct: 282 FRWDSIGKAERDRRKSAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISF 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + IP A P + T GG + ++L ++W +
Sbjct: 342 RVWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGPGGNLDVHGTNSKRLA------EVWMDD 394
Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
L + D GD++ RK LR+ L CKSFKWYL+ +
Sbjct: 395 YKRLYYLHRPDLKTKDVGDLSERKALRKKLKCKSFKWYLDNVIPHKFIPDEGVVGYGALR 454
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEAC 262
N SG+C+D+ + +G++ C G + Q + ++K G++RR+ C
Sbjct: 455 NPNSGLCLDTLQRDEKSTITLGIFACQTGGSSAQVFSLTKSGQLRREITC 504
>gi|390349674|ref|XP_003727260.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Strongylocentrotus purpuratus]
Length = 379
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 144/307 (46%), Gaps = 52/307 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+PLL LA + + VV P++ I DTF P L GGF+W +
Sbjct: 33 VEVMIGWLEPLLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQR------GGFNWRFE 86
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ W IP +++ K A P+ +P M GGL ++D++FF +LG +D G ++WGGENLE S
Sbjct: 87 YRWKPIPNYDKRPSKVA--PIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSL 144
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K + IP R +++ +P G +D + ++W
Sbjct: 145 KIWMCGGSIEIIPCSRVGHVYRDT-----SPYSFLGQNPLDIVEHNAMRV----VEVWTD 195
Query: 178 EN-------LELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
E+ L + DFGDV+ RK+LR +L C F WYL
Sbjct: 196 EHKYHFYDRLPMLKNRDFGDVSKRKKLRESLNCYDFNWYLANVYPELYVPSSSSVLRQTI 255
Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGG 268
N S +CIDS + K + + CH GGN+++ +K GEIR DE CL+ G
Sbjct: 256 NFQNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGNEYFEETKAGEIRNDELCLEANSVGT 315
Query: 269 DVILYPC 275
VIL PC
Sbjct: 316 HVILNPC 322
>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
Length = 587
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 98/335 (29%), Positives = 151/335 (45%), Gaps = 60/335 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV W+QPLL + N V P+I I DTF P GGF+W L
Sbjct: 217 CEVNVEWIQPLLSHIHGNHKRVAVPIIDIIDQDTFRYESSP--------LVRGGFNWGLF 268
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ W IPE ++ ++ +P+ TPTMAGGLF++++ +F LG YD+G D+WGGENLE+SF
Sbjct: 269 YRWDQIPESLLRKQEDYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGENLEISF 328
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + H +P P +P G+ +I K + + + +
Sbjct: 329 RVWQCGGSMHILPCSRVGHIFRKRRPYGSPV---GVDTITKNSLRVAHVWMDEYIKYFFQ 385
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------------- 218
+ + ++GDV+ RK LR L C+SFKW+L+ + +D G
Sbjct: 386 VRKTADHAEYGDVSDRKALRNELQCQSFKWFLDNVYPEQTLPSDKEGGGLIAKGHNLIKK 445
Query: 219 ----------------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEA 261
+C+ ++ P D + L PC+ Q W + G +R
Sbjct: 446 DPEVIRKAHLKHFSSTLCVVASRSPYDKKSLLELKPCNPNNKQQVWHETFEGSMRLMGVL 505
Query: 262 CLDY---AGGDVILYP----CHGSKGNQYFEYDYK 289
CLD+ +GG YP CH S G+Q + + +K
Sbjct: 506 CLDFVDDSGGGNSPYPMLSKCHFSGGSQQWSWLHK 540
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 95/287 (33%), Positives = 138/287 (48%), Gaps = 41/287 (14%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+P+L + + +VV+P+I I F ++ GGF W +QF W I
Sbjct: 178 WLEPMLQRIKEDRRNVVAPMIDGIDATKFSY--------AASNLIRGGFSWEMQFKWKPI 229
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
P+ E KR K+ P+ +PTMAGGLF+IDK++F ++GTYD G +IWG ENLELSFK
Sbjct: 230 PDYEMKRRKDETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCG 289
Query: 126 -NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
N IP A++P P F + ++ D DI+ L+
Sbjct: 290 GNLEMIPCSHVGHVFRASQPYKFPEGNIKTFMRNNMRVAEVWM-DEYKDIFYA--LKPQL 346
Query: 185 KG-DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
KG D+GDVT RKELR L C FKWYL+ +++ + + Q
Sbjct: 347 KGEDYGDVTERKELRDRLQCHDFKWYLQ-----------------NIYPELPIPDLKVQA 389
Query: 244 GNQFWMMSKHGEIRRDEACLDYAGGDVI-LYPCHGSKGNQYFEYDYK 289
+ + K G C+D G + + +PCHG NQ F + ++
Sbjct: 390 RGELRNLGKIG------YCMDTMGANAMCAHPCHGIGHNQMFSFSWQ 430
>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
Length = 568
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 155/315 (49%), Gaps = 44/315 (13%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV + WL+PL+ +A + + +++P+I NI D+ F F GR GGF W L F
Sbjct: 227 EVSEGWLEPLISRVADDRTRIIAPIIDNISDEDFG--FSTGRTD-----LWGGFSWILSF 279
Query: 65 NWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + + +R AEP+ TPT+AGGLF+I++ +F ++G YD G ++WGGEN+E+SF
Sbjct: 280 KWFDMNGNDTQRLIAKKAEPIRTPTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISF 339
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E + T T F+ + F + + ++W E E
Sbjct: 340 RI-WMCGGSMEIHPCSHVGHVFRTKTPYS--FTKEVNFVIRRNQARTA-EVWMDEYKEFF 395
Query: 184 F-------KGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGM 219
F K + GD+ RK LR L CK FKWYL+ + N +G
Sbjct: 396 FKMVPSAQKMEIGDLQERKSLRERLKCKPFKWYLKNVCSECHMPSEYHSLGAIVNKLNGK 455
Query: 220 CIDSACKPTDMHKPVGLYPC---HKQGGNQFWMMSKHGEIRRDEACL--DYAGGDVILYP 274
C+D + + P GL C H+Q GNQ W + + EIR CL + G ++ +
Sbjct: 456 CVDRGGRV--LGGPPGLGTCIHSHEQQGNQVWSWTGNKEIRSQNFCLSSNKKGSELKIEM 513
Query: 275 CHGSKGNQYFEYDYK 289
C+GS+ +Q FE++ K
Sbjct: 514 CNGSE-DQKFEFNRK 527
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 94/300 (31%), Positives = 146/300 (48%), Gaps = 33/300 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + VVSP I I +TFE P + + G FDW+L
Sbjct: 271 CECFHGWLEPLLSRIAEEPTAVVSPDITTIDLNTFEFSKP---VQYGKQHSRGNFDWSLT 327
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P RER+R K+ P+ +PT AGGLF+I +++FE +G+YD +IWGGEN+E+SF
Sbjct: 328 FGWEVVPPRERQRRKDETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSF 387
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
+ IP + P P + S ++ + + Y F
Sbjct: 388 RVWQCGGQLEIIPCSVVGHVFRSKSPHTFPK-GTQVISRNQVRLAEVWMDDYKEIFYRRN 446
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSG 218
+ +++ + +GD+T R+ LR L CK+F WYL+ + N+ +
Sbjct: 447 QQAAQMAREKTYGDITERRRLRERLHCKNFTWYLQNVYPEMFVPDLNPTSYGAIKNEGTN 506
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPC 275
C+D + KP+ +YPCH GGNQ++ + ++R + + CL G V L C
Sbjct: 507 SCLDVG-ENNHGGKPLIMYPCHGMGGNQYFEYTTQRDLRHNVGKQLCLRAGAGPVQLGEC 565
>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Hydra magnipapillata]
Length = 578
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 150/324 (46%), Gaps = 59/324 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL + N V SP+I I + F + SS GGF WNL
Sbjct: 233 CECNEMWLEPLLQAIKDNRKIVASPIIDVIGHEDF-------KYLSSSSDLRGGFGWNLN 285
Query: 64 FNWHAIPERERKRHKNAAEP-VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +P +H+ + +P +AGGLFSI K++FE+LG YD D+WGGENLE+S
Sbjct: 286 FKWDFLPPNHLIKHQQDGTAFILSPVIAGGLFSIHKSWFEELGKYDPQMDVWGGENLEIS 345
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP R RH P P GG ++ F+K
Sbjct: 346 FRTWQCGGEMYIIPCSRVGHVFRDRH-----PYKFP---GGSMNV----FQK--NTRRAA 391
Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND--- 215
++W + + F FGD+ R +LR++L CKSFKWYLE V +D
Sbjct: 392 EVWMDDYKKYYFAAVPSARYSLFGDIRDRLQLRKDLNCKSFKWYLENIYPELKVPDDDVI 451
Query: 216 ---WSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
+ C T H + +GL+PCH QGGNQ W +K +I+ + CL
Sbjct: 452 KYGQIKYKVSEDCLDTMGHIKGEGIGLFPCHGQGGNQDWSWTKSNQIKHESLCLSGISKK 511
Query: 270 ----VILYPCHGSKGNQYFEYDYK 289
V + PC + Q ++YD K
Sbjct: 512 SEEIVRMVPCVATDNFQKWKYDEK 535
>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 104/305 (34%), Positives = 151/305 (49%), Gaps = 38/305 (12%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+ LL + ++ + VV P+I I DD F LT S GGF+W L F W+ +
Sbjct: 294 WLEYLLYEVKKDRTAVVCPIIDVINDDDF------AYLTGS-DMTWGGFNWRLNFRWYPV 346
Query: 70 PERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH 128
P RE +R+ + + P+ +PTMAGGLF+ID+ +F ++G YD G ++WGGENLE+SF+ W
Sbjct: 347 PNREEVRRNYDHSLPLLSPTMAGGLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRV-WQ 405
Query: 129 AIPERERK--RHKNAAEPVWTP-TMAGG----LFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ H TP T GG +F +K E D D E
Sbjct: 406 CGGKVLIHPCSHVGHVFRKQTPYTFPGGTGKVIFHNNKRLVEVW--LDKYKDFVYAIMPE 463
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID----SACKPTD-------- 229
L D GDV+ R LR L CK F+WYL+ S M +D A + D
Sbjct: 464 LK-NVDAGDVSERLALRERLQCKDFRWYLQNIYPESSMPVDFHHVGALRNQDHGCADSLG 522
Query: 230 ------MHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI-LYPCHGSKGNQ 282
+++ G++PCH QGGNQ + SK GE++ D+ C++ + + L C Q
Sbjct: 523 YDSENGVNQNAGIFPCHNQGGNQIVVFSKSGELKFDDLCMEGSKNSAVKLQKCTEGNQKQ 582
Query: 283 YFEYD 287
+EY+
Sbjct: 583 VWEYN 587
>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
catus]
Length = 552
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 143/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I D F S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY-------IESAAELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF +DK++FE LG YD+ DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFEYLGKYDTDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR ELR+NL C+SFKWYLE V ND S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLELRKNLHCQSFKWYLENVYPELRVPNDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGTIRQRQKCLESQRQRNTEIYNLRLSPCVKIKGEDAKSQIWAFTYTQQILQEELCLSVV 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TIFPGAPVVLVLCKNGDDRQ 500
>gi|405977048|gb|EKC41520.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 635
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 101/313 (32%), Positives = 149/313 (47%), Gaps = 58/313 (18%)
Query: 11 LQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIP 70
L+P+L + + VV P++ I T E G + +GGF W+L F W +P
Sbjct: 294 LEPILSRIKEFPNSVVCPIVDAIDAHTLEYSKNGG-------YQVGGFSWSLHFTWRDVP 346
Query: 71 ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA- 129
R+ H+ +PV +PTMAGGLF+ D+ FF ++G YD G D+WGGENLE+SF+ W
Sbjct: 347 SRDLV-HRKYTDPVGSPTMAGGLFAADRKFFFEIGAYDPGMDVWGGENLEISFR-TWMCG 404
Query: 130 -----IPERERKRHKNAAEPVWTP--TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
IP A+ P P G+ S+ A ++W E L
Sbjct: 405 GKLEFIPCSRVGHIFRASHPYTFPGNKDTHGINSMRLA------------EVWMDEYKRL 452
Query: 183 SFK-------GDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
+ D+GD++ R ELR+ L CKSFKW+L+ V N S
Sbjct: 453 FYTHRKDLLGQDYGDISERVELRKRLNCKSFKWFLDNVYPEKFIPDENVHAWGMVRNPPS 512
Query: 218 GMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYP 274
+C+D+ K +G+Y C N+ + +S + E+RR+EACL GG V L
Sbjct: 513 NLCLDTLQKDEKTVFDMGIYSCQNGASANEVFSLSINDELRREEACLTVVSEGGRVPLES 572
Query: 275 CHGSKGNQYFEYD 287
C G+ NQ +++D
Sbjct: 573 CTGA-ANQKWKHD 584
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/196 (34%), Positives = 97/196 (49%), Gaps = 29/196 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P+L + + + V+ P I I +T + + F +GGF W+L
Sbjct: 219 CETNTGWLEPMLARIKEDRTAVLCPEIDLIDKNTLQY-------GGTGSFSVGGFWWSLH 271
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLF------------SIDKAFFE--KLGTYDS 109
F+W IPE E+KR + P+ + + +ID E K G Y
Sbjct: 272 FSWRPIPEHEQKRRSSGIAPIRLEPILSRIKEFPNSVVCPIVDAIDAHTLEYSKNGGYQV 331
Query: 110 GFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
G W S F W +P R+ H+ +PV +PTMAGGLF+ D+ FF ++G YD
Sbjct: 332 GGFSW-------SLHFTWRDVPSRDLV-HRKYTDPVGSPTMAGGLFAADRKFFFEIGAYD 383
Query: 170 SGFDIWGGENLELSFK 185
G D+WGGENLE+SF+
Sbjct: 384 PGMDVWGGENLEISFR 399
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 145/322 (45%), Gaps = 63/322 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E K WL+PLL +A + + VV P+I I D FE L GGFDWNL
Sbjct: 237 VECNKNWLEPLLQRIADDRTAVVCPIIDVINMDNFEYIGASADLR-------GGFDWNLV 289
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + ER+ R N P+ TP +AGGLFS+DK++F +LG YD+ D+WGGENLE+S
Sbjct: 290 FKWDYMSSEERRSRAGNPTAPISTPMIAGGLFSMDKSYFNQLGKYDTAMDVWGGENLEIS 349
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ IP RK+H P P +G +F+ +
Sbjct: 350 FRVWQCGGRLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFTRNTR---------RAA 395
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS- 217
++W + E + FG++ +R ++R CK FKWYLE V + S
Sbjct: 396 EVWMDDYKEYYYAAVPSAKLIPFGNIENRLQIRVRNQCKPFKWYLENVYPELRVPSKESV 455
Query: 218 ---------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA-- 266
CID+ + +GLY CH GGNQ + M+K +IR + C
Sbjct: 456 AFGSIKQGVNKCIDTLGHVQE--GSIGLYECHDSGGNQEFSMNKEMQIRHQDLCFTAGEG 513
Query: 267 ---GGDVILYPCHGSKGNQYFE 285
G + L C + Q FE
Sbjct: 514 AREGSIIKLRHCDENNTMQKFE 535
>gi|268569766|ref|XP_002648333.1| C. briggsae CBR-GLY-4 protein [Caenorhabditis briggsae]
Length = 523
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 101/322 (31%), Positives = 150/322 (46%), Gaps = 68/322 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E ++WL+PLL +A N VV+P+I I D F L GGFDW L F
Sbjct: 171 ECNQKWLEPLLSRIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 223
Query: 65 NWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + E RK RH + P+ +PTMAGGLF+I K +FE+LGTYD ++WGGENLE+SF
Sbjct: 224 RWEFMNEELRKDRHAHPTAPIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSF 283
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H+ T GG ++ F+K +
Sbjct: 284 RVWQCGGSLEILPCSRVGHVFRKKHQY--------TFPGGSGNV----FQK--NTRRAAE 329
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV-------------- 212
+W E + K ++GD+ R +R L CKSFKWYL+
Sbjct: 330 VWMDEYKAIYLKNVPSARFVNYGDIGDRLAIRDRLQCKSFKWYLDTVYPQLASLTRNVSS 389
Query: 213 -SNDWS-------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWM---MSKHGEIRRDEA 261
+ W +C+DS + + + L+ CH GGNQ W+ ++K + +
Sbjct: 390 QKDAWQIAPMKIGHLCLDSMARKEN--EAPALFACHGTGGNQEWIFDDLTKTFKNAISQM 447
Query: 262 CLDYAG--GDVILYPCHGSKGN 281
CLD++ DV++ C + N
Sbjct: 448 CLDFSAEKKDVVMVKCENLRSN 469
>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
saltator]
Length = 598
Score = 140 bits (352), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 145/314 (46%), Gaps = 45/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + + V+ P+I NI ++T E + F +GGF W+
Sbjct: 240 CEVIKDWLQPLLQRIKEKRNAVLMPIIDNISEETLEYFHD----NEASFFQVGGFTWSGH 295
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I + E K + P +PTMAGGLF+ID+ +F ++G+YD D WGGENLE+SF
Sbjct: 296 FTWINIQKHELKSRLSLISPTRSPTMAGGLFAIDRKYFWEVGSYDDKMDGWGGENLEMSF 355
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
+ IP P P G+ + AF + Y F +
Sbjct: 356 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVW-MDEYKRLFLLHR 414
Query: 177 GENLELSFKGD---FGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E FK FGD++ R +LRR L CKSFKWYL+ V
Sbjct: 415 SE-----FKNKSSLFGDISERLKLRRKLKCKSFKWYLDNIYPEKFIPDEHAIAYGRVRLR 469
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEACLDYAGGD----- 269
+C+D+ + D +GLY CH K +QF+ +S GE+RRD+ C D
Sbjct: 470 NRLLCLDNLQRDEDKPYNLGLYSCHSKLYPSQFFSLSNSGELRRDDNCARVNADDSRVHT 529
Query: 270 -VILYPCHGSKGNQ 282
V + C+ KG +
Sbjct: 530 QVEMSDCNNEKGGK 543
>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
lupus familiaris]
Length = 552
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 143/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I D F S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY-------IESAAELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + AEP+ TP +AGGLF +DK++F LG YD+ DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPAEPIRTPIIAGGLFVMDKSWFNYLGKYDTDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ SR +LR+NL C+SFKWYLE + ND S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIESRLDLRKNLQCQSFKWYLENVYPELRIPNDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQKNTEIYDLRLSPCVKTKGKDAKSQIWAFTYTQQILQEELCLSVV 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TVFPGAPVVLVVCKNGDDKQ 500
>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
ATCC 50818]
Length = 641
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 106/326 (32%), Positives = 155/326 (47%), Gaps = 67/326 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+P+LD++A N + VV+P+I I T E + TS+ +G FDW L
Sbjct: 299 CEANQGWLEPILDIIATNRTTVVTPVIDTIDHRTMEY----AKWTSNIPS-VGTFDWTLD 353
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNW + R ++ +P+ +PTMAGGLF+ID+ +F ++G+YD D WGGEN+E+SF
Sbjct: 354 FNWKSGVLRPGQK---LTDPIDSPTMAGGLFAIDRDYFYEIGSYDEDMDGWGGENVEMSF 410
Query: 124 KFNWHAIPE-------------RERKRHKNAAEPVWTPTMAGGLFSID------KAFFEK 164
+ W R+ +K + + M + + K FF
Sbjct: 411 RI-WQCGGRLVTAPCSHVGHIFRDTHPYKVPGKGIHHTFMKNSMRLAEVWMDDYKQFF-- 467
Query: 165 LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------- 216
YD+ EN+ D GD+T RK LR L CK FKWYL+ V D
Sbjct: 468 ---YDTKPK---RENI------DIGDLTKRKALRERLKCKPFKWYLKHVLPDLFVPDSEH 515
Query: 217 ----------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR-DEACLDY 265
+G+C+D G++ CH +GGNQ WM + + EIR D CLD
Sbjct: 516 VLHKGALRAGNGLCLDKMGHRAGGQ--AGVFSCHGEGGNQGWMYTVNDEIRTADSLCLDV 573
Query: 266 AGGD----VILYPCHGSKGNQYFEYD 287
+ L CH +GNQ ++Y+
Sbjct: 574 YSSKFPAPIHLQRCHQKQGNQAWKYE 599
>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
Length = 560
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 46/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I D + R +S K + G F+W L
Sbjct: 211 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRI---VYSSVDKHYRGIFEWGLL 267
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +E R K+++EP +PT AGGLF+I K +FE+LG YD G IWGGE ELSF
Sbjct: 268 YKETEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSF 327
Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
K W IP + P P ++ + + K + ++ Y
Sbjct: 328 KI-WQCGGGILFIPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDEYEKYYYI 386
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
+ L GD++S+ +LR L CKSF+WY+
Sbjct: 387 REPSAKHRLP-------GDISSQLKLRERLKCKSFEWYMEKVAYDVIVSYPLPPENHVWG 439
Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV 270
E N +G CID+ + + VG PCH GGNQ ++K G++ + E C+ GG++
Sbjct: 440 EAKNHATGKCIDTIGQ--TIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCITPVGGNL 497
Query: 271 ILYPCHGSKGNQYFEYDYK 289
+ C + F YD K
Sbjct: 498 VTKYCVKGTVDGPFAYDEK 516
>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
Length = 562
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 46/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I D + R +S K + G F+W L
Sbjct: 213 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRI---VYSSVDKHYRGIFEWGLL 269
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +E R K+++EP +PT AGGLF+I K +FE+LG YD G IWGGE ELSF
Sbjct: 270 YKETEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSF 329
Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
K W IP + P P ++ + + K + ++ Y
Sbjct: 330 KI-WQCGGGILFIPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDEYEKYYYI 388
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
+ L GD++S+ +LR L CKSF+WY+
Sbjct: 389 REPSAKHRLP-------GDISSQLKLRERLKCKSFEWYMEKVAYDVIVSYPLPPENHVWG 441
Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV 270
E N +G CID+ + + VG PCH GGNQ ++K G++ + E C+ GG++
Sbjct: 442 EAKNHATGKCIDTIGQ--TIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCITPVGGNL 499
Query: 271 ILYPCHGSKGNQYFEYDYK 289
+ C + F YD K
Sbjct: 500 VTKYCVKGTVDGPFAYDEK 518
>gi|72000997|ref|NP_001024216.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
gi|51316004|sp|Q8I136.2|GALT4_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
Short=pp-GaNTase 4; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 4; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|3047189|gb|AAC13670.1| GLY4 [Caenorhabditis elegans]
gi|11064525|emb|CAC14394.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
Length = 589
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 62/301 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E ++WL+PLL +A N VV+P+I I D F L GGFDW L F
Sbjct: 243 ECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 295
Query: 65 NWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
W + E+ RK RH + P+ +PTMAGGLF+I K +F +LGTYD ++WGGENLE+SF
Sbjct: 296 RWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMSF 355
Query: 124 KFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ W E RK+H P P +G +F +
Sbjct: 356 RV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RAA 400
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
++W E + K +FGD+T R +R L CKSFKWYLE
Sbjct: 401 EVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLENVYPQLEIPRKTPG 460
Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
+C+DS + + P GL+ CH GGNQ W+ + + ++ + CLD++
Sbjct: 461 KSFQMKIGNLCLDSMAR-KESEAP-GLFGCHGTGGNQEWVFDQLTKTFKNAISQLCLDFS 518
Query: 267 G 267
Sbjct: 519 S 519
>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
Length = 374
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 114/219 (52%), Gaps = 32/219 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLLD + +N SHVV+P+I I TFE R + GFDW L
Sbjct: 158 CECNIGWLEPLLDRIVQNRSHVVTPVIDVIDFKTFEYRHLA-------IIQVRGFDWRLI 210
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP KR + +P+ +PTMAGGLF+IDK +F LG YD+G +IWGGENLELSF
Sbjct: 211 FRWEKIPASYEKRRGLSVDPILSPTMAGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSF 270
Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ +P R+R ++ + E L + + + ++ Y
Sbjct: 271 RIWQCGGTLEIMPCSRVGHVFRQRFPYQTSTE-----VTTRNLMRVAEVWMDQYKEY--- 322
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL 210
+ +++ K FGDVT R+ELRR L C+ F WYL
Sbjct: 323 --FYQIRHIK---KKSFGDVTERQELRRRLQCRDFHWYL 356
>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Sus scrofa]
Length = 552
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF+ S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDY-------IESATELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF +DK++F+ LG YD+ DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFDYLGKYDTDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ SR +LRRNL C+SFKWYLE + D S
Sbjct: 361 VWMDEYKQYYYASRPFALERPFGNIESRLDLRRNLQCQSFKWYLENVYPELRIPKDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQKQKDQEISNLRLSPCVKIEGKDAKSQIWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|390347275|ref|XP_003726736.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 507
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 146/315 (46%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD + RNS+ VVSP I +I D F F G + GGF W +
Sbjct: 145 CECTEGWLEPLLDCINRNSTRVVSPAIDSISDTDFSYTFIRGIART------GGFSWFPE 198
Query: 64 FNWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W P+RE KR ++AA P+ TPT+AGGLF+ID+ FF+ LG YD +WG ENLELS
Sbjct: 199 FMWTHAPQREMKRVWQDAATPLRTPTIAGGLFAIDRKFFKSLGYYDPELHVWGSENLELS 258
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
FK + +P R H ++P + G ++ L ++WGG
Sbjct: 259 FKVWQCGGSLEVVP-CSRVGHVFRSKPPY--DFPGNPETV------LLRNNKRVLEVWGG 309
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
+ L + D GD++SR +R L CK+F+WYLE
Sbjct: 310 QIKHLFYGLTPEYQAVDAGDISSRIRIRDELKCKNFEWYLENVYPENILPLNFQALGRFM 369
Query: 214 NDWSGMCID--SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-- 269
N+ +CID A M + + C + Q + + ++R D C+ GD
Sbjct: 370 NEGVNLCIDVLHATDGRRMGAHLAVNACREGALAQTFSWNDLSQLRHDRFCITAVEGDNH 429
Query: 270 VILYPCHGSKGNQYF 284
V+L C N+
Sbjct: 430 VMLLECQDVHYNRLL 444
>gi|224047294|ref|XP_002195048.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Taeniopygia guttata]
Length = 552
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/321 (32%), Positives = 143/321 (44%), Gaps = 64/321 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VVSP+I I DTF ++ GGFDW+L
Sbjct: 202 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + EP+ TP +AGGLF IDKA+F LG YDS DIWGGEN E+SF
Sbjct: 255 FKWEQLSPEQKAKRLDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDSAMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------VSNDW 216
+W E + + FG++ SR ELR+ L C SFKWYLE S
Sbjct: 361 VWMDEFKQYYYAARPAAQGRPFGNIQSRVELRKKLKCHSFKWYLENVYPELRIPKESLYQ 420
Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGG----NQFWMMSKHGEIRRDEACLD- 264
+G+ C++S K D P+ L PC+ G Q W + + IR+ + CL
Sbjct: 421 TGIIRQRQSCLESH-KSEDQEFPILSLTPCNSSKGIVPKAQEWTYTYNHHIRQQQLCLSV 479
Query: 265 ---YAGGDVILYPCHGSKGNQ 282
+ G V+L PC Q
Sbjct: 480 YTLFPGSQVLLSPCKEGDNKQ 500
>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
Length = 552
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGHANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQSNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Gorilla gorilla gorilla]
Length = 552
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Anolis carolinensis]
Length = 608
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 144/329 (43%), Gaps = 66/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNELWLQPLLTPIRESRKTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D D
Sbjct: 360 RIWMCGGKLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKD 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + ++G++T R ELR+ L CKSFKWYL+
Sbjct: 413 QYFALRPELRMR-NYGNITDRVELRKKLNCKSFKWYLDNIYPEMQISGSNAKVQPPLFFN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
+ + S C+ + P+ V + C NQ W+ ++ H I
Sbjct: 472 KGQKRPKTLQRGRLRHLQSDKCLVAQGHPSQKGGLVVVRECDYSDQNQVWLYNEDHELIL 531
Query: 258 RDEACLDYAGGDVI----LYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 532 NNLLCLDVSETRTSDPPRLMKCHGSGGSQ 560
>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
Length = 563
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 140/319 (43%), Gaps = 62/319 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + + VVSP+I I D F L GGFDW+L
Sbjct: 222 CEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + TP +AGG+F IDK++F +LG YD+ DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P D + +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYEFP---------DGNALTYIKNTKRTVE 380
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VS 213
+W E + ++ +G V R ELR+ L CKSF+WYL+ +S
Sbjct: 381 VWMDEYKQYYYQARPSAIGKSYGSVADRAELRKKLSCKSFQWYLQNVYPELKVPEKEVIS 440
Query: 214 N--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA- 266
G C++S + T + P+ L C + Q W +S + IR+ + CL +
Sbjct: 441 GLIKQGGNCLESQTRDTTGNNPIMLTQCKGSANSAPAAQEWALSDNV-IRQQDRCLTISS 499
Query: 267 ---GGDVILYPCHGSKGNQ 282
G V++ PC+ Q
Sbjct: 500 FSTGALVMMEPCNQKDSRQ 518
>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Gorilla gorilla gorilla]
Length = 532
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 460
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480
>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
floridanus]
Length = 597
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 140/288 (48%), Gaps = 39/288 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + N + V+ P+I NI ++T E + F +GGF W+
Sbjct: 240 CEVIKDWLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHD----NEASFFQVGGFTWSGH 295
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I + E + + P +PTMAGGLF+I++ +F ++G+YD D WGGENLE+SF
Sbjct: 296 FTWINIQKHEVESRPSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSF 355
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
+ IP P P G+ + AF G Y F +
Sbjct: 356 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVWMDG-YKRLFLLHR 414
Query: 177 GENLELSFKGD---FGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
E FK + FGDV+ R ELR+ L CKSFKWYL+ V
Sbjct: 415 SE-----FKDNPKLFGDVSERLELRKRLKCKSFKWYLDNIYPEKFIPDEDAVAYGRVRLR 469
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEAC 262
+C+D+ + D +GLY CH K +QF+ +S GE+R+D++C
Sbjct: 470 NKPLCLDNLQQEEDKPYNLGLYTCHSKLYPSQFFSLSNAGELRKDDSC 517
>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Gorilla gorilla gorilla]
Length = 517
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 445
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465
>gi|426335183|ref|XP_004029112.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Gorilla gorilla gorilla]
Length = 557
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVITLFPG 490
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 491 APVVLVLCKNGDDRQ 505
>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
adamanteus]
Length = 608
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLTPIQESRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++D+ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLLEMEGPEQATAPIKSPTMAGGLFAMDREYFNALGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D +
Sbjct: 360 RIWMCGGKLVIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + ++G++T R ELR+ L CKSFKWYL+
Sbjct: 413 QYFALRPELRTR-NYGNITDRVELRKKLNCKSFKWYLDNVYPEMQISGPNAKVQPPIFFN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
+ + + C+ + P+ V + C NQ WM ++ H I
Sbjct: 472 KGQKRPKLLQQGRLYHLQTNKCLVAQSNPSQKGGLVVVKECDYSNKNQIWMYNEDHELIL 531
Query: 258 RDEACLDYAGGDVI----LYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 532 NNLLCLDVSETRTSDPPRLMKCHGSGGSQ 560
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 92/272 (33%), Positives = 130/272 (47%), Gaps = 47/272 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N S VV+P I I TF G G F+W L
Sbjct: 145 CEANLGWLEPLLARIGENRSIVVTPDIEVIDLRTFGYTHEHGANNR------GIFNWELT 198
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IPE ER+R K+ ++P+ +PTMAGGLF+IDK++F ++G+YD+ WGGEN+E+SF
Sbjct: 199 FKWRGIPEYERRRRKSDSDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFWGGENVEISF 258
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE- 178
+ W + + + G +F + + G D ++W +
Sbjct: 259 RI-WMC----------GGSLEIIPCSKVGHVFRESQPYKIGEGAIDRNNMRLAEVWMDDY 307
Query: 179 -----NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYL------------------EVSN 214
+ KG D+GDV+ RK LR L CKSFKWYL E+ N
Sbjct: 308 KKIFYAMRPQLKGKDYGDVSGRKALRERLMCKSFKWYLDNVISELAIPDLYPIGRGEIRN 367
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ 246
+ C+D+ K +P G+Y CH G NQ
Sbjct: 368 LGTNTCLDTLAKNEAGGEP-GMYMCHGMGNNQ 398
>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
sapiens]
gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
sapiens]
gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Homo sapiens]
gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_a [Homo sapiens]
gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[synthetic construct]
gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
sapiens]
Length = 552
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 608
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 109/348 (31%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNTMWLQPLLATIQEDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP E A P+ +PTMAGGLF++++ +F++LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLIPPSELGGPGGATAPIKSPTMAGGLFAMNRDYFDELGRYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGRDTMA 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELRR LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + +G C+ + +P+ V L C
Sbjct: 457 ISGPNAKPQQPLFINRGPKRPKVLQRGRLYHLQTGKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PNQVWIYNEEHELVLSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|170038569|ref|XP_001847121.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
gi|167882320|gb|EDS45703.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
Length = 541
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 102/308 (33%), Positives = 145/308 (47%), Gaps = 39/308 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WLQP LD +AR+ + + P I I D L F + Y G DW Q
Sbjct: 202 CECTLGWLQPQLDRVARDPTTIAVPTIDWI--DEHNLAFVSNKSLGYY----GATDWGFQ 255
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +R + + N EP TP MAGGLFSI++ FF LG YD GF+I+GGEN+ELS
Sbjct: 256 FAWRGRWDR-KVQPANKLEPFPTPIMAGGLFSINRTFFGHLGWYDEGFEIYGGENVELSL 314
Query: 124 KF-----NWHAIPERERKRHKNAAEPVW----TPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
K +P + A P T + + + + ++ +D+
Sbjct: 315 KAWMCGGRIETVPCSRVGHVQKAGHPYLRVETTDWVRINTVRVAEVWLDQYA--QVVYDM 372
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDSA 224
+GG F+G+FGDV+SRK+LR +L C SF+WYL E G I+ A
Sbjct: 373 FGGPQ----FRGNFGDVSSRKKLRESLKCHSFRWYLDNVFPELDDPEGRGVGHGEVINLA 428
Query: 225 CKPTD-MHKPV-----GLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
T + P GL C+ Q W+ + GE+ + C+D+ G + +Y CH
Sbjct: 429 AGATRCLQYPTAEGTFGLERCNGD-SRQHWVYNMLGELSTNNTCVDFTGTALAMYKCHKM 487
Query: 279 KGNQYFEY 286
+GNQ + Y
Sbjct: 488 RGNQEWRY 495
>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Nomascus leucogenys]
Length = 552
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|449268007|gb|EMC78887.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Columba
livia]
Length = 514
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 142/322 (44%), Gaps = 65/322 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VVSP+I I DTF ++ GGFDW+L
Sbjct: 163 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 215
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + EP+ TP +AGGLF IDKA+F LG YDS DIWGGEN E+SF
Sbjct: 216 FKWEQLSPEQKAKRLDPTEPIKTPIIAGGLFMIDKAWFNHLGKYDSAMDIWGGENFEISF 275
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P + + +
Sbjct: 276 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 321
Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL----------EVSNDW 216
+W E + +G+V SR ELR+ L C SFKWYL E S
Sbjct: 322 VWMDEFKRYYYAARPAAQGRPYGNVQSRVELRKRLKCHSFKWYLENVYPELRIPEESLYQ 381
Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGGN-----QFWMMSKHGEIRRDEACLD 264
+GM C++S K D PV L PC G Q W + + ++R+ + CL
Sbjct: 382 TGMIRQRQSCLESH-KSEDQEFPVLSLNPCTGSKGTTAATAQEWTYTYNHQVRQQQLCLS 440
Query: 265 ----YAGGDVILYPCHGSKGNQ 282
+ G V+L PC Q
Sbjct: 441 VYTLFPGSQVLLSPCKEGDNKQ 462
>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
mulatta]
Length = 517
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 445
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465
>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan paniscus]
Length = 517
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 445
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465
>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
5 [Macaca mulatta]
Length = 552
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Pan paniscus]
Length = 532
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480
>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Nomascus leucogenys]
Length = 532
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480
>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Pan paniscus]
Length = 552
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500
>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Macaca mulatta]
Length = 532
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480
>gi|62630154|gb|AAX88899.1| unknown [Homo sapiens]
Length = 452
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 102 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 154
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 155 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 214
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 215 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 260
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 261 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 320
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 321 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 380
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 381 TLFPGAPVVLVLCKNGDDRQ 400
>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
sapiens]
gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_d [Homo sapiens]
gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 400
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480
>gi|355565588|gb|EHH22017.1| hypothetical protein EGK_05198 [Macaca mulatta]
Length = 557
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 431 QRQKCLESQRQNNQETANLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 491 APVVLVLCKNGDDRQ 505
>gi|109102570|ref|XP_001104659.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Macaca mulatta]
Length = 557
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 491 APVVLVLCKNGDDRQ 505
>gi|359465583|ref|NP_001240755.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 2 [Homo
sapiens]
gi|10434341|dbj|BAB14227.1| unnamed protein product [Homo sapiens]
gi|119620892|gb|EAX00487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_b [Homo sapiens]
Length = 557
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIR 430
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 491 APVVLVLCKNGDDRQ 505
>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
tropicalis]
gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
Length = 452
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 86/219 (39%), Positives = 113/219 (51%), Gaps = 25/219 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R + + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRRGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ K D+GD+++R LR L CK F WYLE
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLE 416
>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Nomascus leucogenys]
Length = 535
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 185 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 237
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 238 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 297
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 298 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 343
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 344 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 403
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 404 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 463
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 464 TLFPGAPVVLVLCKNGDDRQ 483
>gi|119620893|gb|EAX00488.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_c [Homo sapiens]
Length = 519
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 169 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 221
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 222 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 281
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 327
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 328 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 387
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 388 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 447
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 448 TLFPGAPVVLVLCKNGDDRQ 467
>gi|397513819|ref|XP_003827205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Pan paniscus]
Length = 557
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 491 APVVLVLCKNGDDRQ 505
>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Callithrix jacchus]
Length = 552
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLALCKNGDDRQ 500
>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Saimiri boliviensis boliviensis]
Length = 552
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 143/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FHWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLALCKNGDDRQ 500
>gi|355751232|gb|EHH55487.1| hypothetical protein EGM_04701, partial [Macaca fascicularis]
Length = 516
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 166 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 219 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 278
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 279 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 329
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 330 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 389
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + + L PC K G +Q W + +I ++E CL + G
Sbjct: 390 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 449
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 450 APVVLVLCKNGDDRQ 464
>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 385
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQKILQEELCLSVI 445
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465
>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
Length = 562
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 141/319 (44%), Gaps = 62/319 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + + VVSP+I I D F L GGFDW+L
Sbjct: 221 CEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 273
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + TP +AGG+F IDK++F +LG YD+ DIWGGEN ELSF
Sbjct: 274 FKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSF 333
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P D + +
Sbjct: 334 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYEFP---------DGNALTYIKNTKRTVE 379
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VS 213
+W E + ++ +G V R ELR+ L CKSF+WYL+ +S
Sbjct: 380 VWMDEYKQYYYQARPSAIGKSYGSVADRVELRKKLSCKSFQWYLQNVYPELKIPEKEVIS 439
Query: 214 N--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA- 266
G C++S + T + PV L C + Q W +S++ I++ + CL +
Sbjct: 440 GLIKQGGNCMESQTRDTTGNIPVMLTQCKGSANSAPAAQEWALSENV-IKQQDRCLTISS 498
Query: 267 ---GGDVILYPCHGSKGNQ 282
G V+L PC+ Q
Sbjct: 499 FSTGALVMLEPCNQKDSRQ 517
>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
aries]
Length = 552
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 145/315 (46%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF +DK++F LG YD+ DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P G + K + +
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPD--GNANTYIKNTKRTAEVWMDEYK 367
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
+ + + + FG++ SR LR+NL C+SFKWYLE V D S
Sbjct: 368 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 427
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
C+++ + + L PC K G +Q W + +I ++E CL + G
Sbjct: 428 QKCLEAQKQKDQEISSLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 487
Query: 270 VILYPC-HGSKGNQY 283
V+L C +G K Q+
Sbjct: 488 VVLVLCKNGDKRQQW 502
>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oryctolagus cuniculus]
Length = 551
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 102/313 (32%), Positives = 138/313 (44%), Gaps = 48/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 201 CEVNKDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 253
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD+ DIWGGEN E+SF
Sbjct: 254 FHWEQLSPEQKARRLDPTEPIRTPVIAGGLFVIDKAWFDYLGKYDTDMDIWGGENFEISF 313
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H A T T + + + Y
Sbjct: 314 RVWMCRGSLEIIPCSRVGHVFRKKHPYAFPNGNTNTYIKNTKRTAEVWMDDYKQYYYAAR 373
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM------- 219
+ E FG++ SR LR NL C+ FKWYLE + D S +
Sbjct: 374 PFALER-------PFGNIRSRVMLRANLQCQDFKWYLENVYPELRIPKDSSILKGSIRQR 426
Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD----YAGGD 269
C+ S + + L PC K G Q W + +I ++E CL + G
Sbjct: 427 HKCLASQKQNNQGSPNLKLRPCVKFKGEESKAQVWAFTYTQQIIQEELCLSVVTLFPGAP 486
Query: 270 VILYPCHGSKGNQ 282
VIL C Q
Sbjct: 487 VILAVCKNGDEKQ 499
>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
saltator]
Length = 606
Score = 137 bits (344), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 144/302 (47%), Gaps = 38/302 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + W++PLL +A + + V P+I I DTF+ P GGF+W L
Sbjct: 234 IEVNEVWIEPLLSRIAHSKTIVAMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P K+ + +P+ +PTMAGGLF+ID+ +F K+G YD+G D+WGGENLE+SF
Sbjct: 286 FKWDNLPIGTLKQEDDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISF 345
Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ N IP R+R + +P TM + + ++ Y
Sbjct: 346 RIWMCGGNIELIPCSRVGHVFRRRRPYGSDDP--QDTMLKNSLRVAHVWLDEYKDY---- 399
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
L K DFGD++ R+ LR+ L CK+F WYL+V + D+ + D
Sbjct: 400 ------FLRNVRKIDFGDISERQALRQRLKCKTFGWYLKVVYPELTLPDDTERRLKDKWS 453
Query: 231 ---HKPVGLYPCHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
+PV + K+ +Q+ + +S + E + G +IL PC K ++E
Sbjct: 454 KLDQRPVQPWHSRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSRLILMPCLRIKSQMWYE 513
Query: 286 YD 287
D
Sbjct: 514 TD 515
>gi|402890489|ref|XP_003908519.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Papio anubis]
Length = 551
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 201 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 253
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 254 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 313
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 314 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 359
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG+V SR +LR+NL C+SFKWYLE + + S
Sbjct: 360 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 419
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 420 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 479
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 480 TLFPGAPVVLVLCKNGDDRQ 499
>gi|363731300|ref|XP_419370.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Gallus
gallus]
Length = 552
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/321 (31%), Positives = 145/321 (45%), Gaps = 64/321 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VVSP+I I DTF ++ GGFDW+L
Sbjct: 202 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + +P+ TP +AGGLF IDKA+F LG YD+ DIWGGEN E+SF
Sbjct: 255 FKWEQLSPEQKAKRLDPTKPIKTPIIAGGLFVIDKAWFNHLGKYDNAMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P + + +
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDW--- 216
+W E + + +G++ SR ELR+ L C SFKWYLE + +
Sbjct: 361 VWMDEFKQYYYAARPAAQGRPYGNIQSRVELRKRLKCHSFKWYLENVYPELRIPEELLYQ 420
Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGGN----QFWMMSKHGEIRRDEACLD- 264
+GM C++S K D P+ L PC G Q W + + ++R+ + CL
Sbjct: 421 TGMIRQRQSCLESH-KSEDQELPILSLNPCITSKGTSATAQEWTYTYNHQVRQQQLCLSV 479
Query: 265 ---YAGGDVILYPCHGSKGNQ 282
+ G V+L PC S Q
Sbjct: 480 YTLFPGSPVLLSPCKESDNKQ 500
>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
caballus]
Length = 609
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 148/351 (42%), Gaps = 96/351 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL V+ + VV P+I I DT SS GGF+W L
Sbjct: 249 CEVNVMWLQPLLAVIQEDRRMVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 300
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++ + +F +LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMSRRYFSELGQYDSGMDIWGGENLEISF 360
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 361 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 397
Query: 177 GENLELSF------------------KGDFGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L++ +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 398 HNSLRLAYVWLDEYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 457
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 458 ISGPNAKPQQPIFINRGPKRPKVLQRGRLCHLQTNKCLVAQSRPSQKGSLVVLKACDYGD 517
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
NQ W+ + +H + + CLD + L CHGS G+Q + + K
Sbjct: 518 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTFGKK 568
>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
grunniens mutus]
Length = 509
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 145/315 (46%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 159 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 211
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF +DK++F LG YD+ DIWGGEN E+SF
Sbjct: 212 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISF 271
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G + K + +
Sbjct: 272 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYIFPD--GNANTYIKNTKRTAEVWMDEYK 324
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
+ + + + FG++ SR LR+NL C+SFKWYLE V D S
Sbjct: 325 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 384
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
C+++ + + L PC K G +Q W + +I ++E CL + G
Sbjct: 385 QKCLEAQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 444
Query: 270 VILYPC-HGSKGNQY 283
V+L C +G K Q+
Sbjct: 445 VVLVLCKNGDKRQQW 459
>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
Length = 579
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/316 (29%), Positives = 145/316 (45%), Gaps = 51/316 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P++ ++ + +V P+I +I D T + +GGF W L
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDSTLAYH-------GDWSLSVGGFSWALH 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ E KR + + +PTMAGGL + ++ +F ++G YD DIWGGENLE+SF
Sbjct: 283 FTWEGLPDEELKRRTKVTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ NW IP A P + T + ++L ++W
Sbjct: 343 R-NWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMD 394
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
+ L + D GD+T+R ELR+ L CKSFKW+L+ +
Sbjct: 395 DYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKSFKWFLDNIAKGKFIMDEDVLAYGAL 454
Query: 213 SNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGD 269
SG MC D+ + M + +G++ C +G + Q +SK G +RR+ C G+
Sbjct: 455 HTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGYLRRENTCAAEENGN 514
Query: 270 VILYPCHGSKGNQYFE 285
V + C SK Q+ E
Sbjct: 515 VRMKAC--SKRAQFNE 528
>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
Length = 527
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 46/315 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + + +VSP+I I DTF+ L SS GGF WNL
Sbjct: 231 CECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDY------LGSSADLR-GGFGWNLN 283
Query: 64 FNWHAIPER-ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +P +R P+ +P +AGGLFS+ K +FE LG YD D+WGGENLE+S
Sbjct: 284 FKWDFLPPHILAERQGKPTLPIKSPVIAGGLFSVAKKWFETLGKYDMQMDVWGGENLEIS 343
Query: 123 FKFNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
F+ W IP R RH P P G + K + +
Sbjct: 344 FR-TWQCGGAMEIIPCSRVGHVFRNRH-----PYQFP--GGSMNVFQKNTRRAVEVWMDD 395
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------EVSNDWSGM 219
+ + + + +GD+ R ELRR L C+ FKWY+ E + + +
Sbjct: 396 YKRYYYAAVPYAKNTPYGDIEERVELRRKLRCRPFKWYVQNVYPELKLPSDESTKSFGEI 455
Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD----VIL 272
+ C T H + +GL+ CH GGNQ W ++K ++ + CL G V L
Sbjct: 456 KQGNQCVDTLGHMRGQTIGLFECHGAGGNQMWSLTKSSLLKHETMCLGVNDGKATEPVQL 515
Query: 273 YPCHGSKGNQYFEYD 287
C + Q++EY+
Sbjct: 516 LDCDENNSMQHWEYE 530
>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
glaber]
Length = 553
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 100/320 (31%), Positives = 140/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 203 CEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 255
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 256 FRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 315
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 316 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 361
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ SR LRRNL C+SFKWYLE V D S
Sbjct: 362 VWMDEYKQYYYAARPFALERPFGNIESRLNLRRNLQCQSFKWYLENVYPELSVPQDSSIQ 421
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G Q W + +I ++E CL
Sbjct: 422 KGNIRQRQKCLESQKQNNQEIPNLRLSPCVKLKGEEAKAQGWAFTYTQQIIQEELCLSVV 481
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 482 TLFPGAPVVLVLCKNGDERQ 501
>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus terrestris]
Length = 643
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 135/291 (46%), Gaps = 16/291 (5%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV KRW++PLL +A++ + V P+I I DTF+ P GGF+W L
Sbjct: 271 IEVNKRWIEPLLSQIAQSKTIVAMPIIDIINPDTFQYTGSP--------LVRGGFNWGLH 322
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P ++ +P+ +PTMAGGLF++D+ +F KLG YD+G DIWGGENLE+SF
Sbjct: 323 FKWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISF 382
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E G D L D + L+
Sbjct: 383 RI-WMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSLRVAHVWLDEYKDYFLKNV 441
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
K D+GD++ R LR+ L CK+F WYL V + D+ + D KP+ +
Sbjct: 442 QKVDYGDISERLNLRKRLKCKNFAWYLNVVYPELALPDDNKNRLKDKWAKIEQKPIQPWH 501
Query: 239 CHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYD 287
K+ +Q+ + +S + E + G +IL PC K ++E D
Sbjct: 502 SRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSKLILAPCLRIKSQMWYETD 552
>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
[Homo sapiens]
gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
sapiens]
gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Homo sapiens]
gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[synthetic construct]
gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 608
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 399
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 87/219 (39%), Positives = 112/219 (51%), Gaps = 25/219 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 191 CECTVGWLEPLLTRIKQDRRTVVCPIIDVISDDTFEC-------MAGSDMTYGGFNWKLN 243
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGG FSID+ +F+++GTYD+G DIWG ENLE+S
Sbjct: 244 FRWYLVPQREMDRRKGDRTLPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEIS 303
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
F+ W E H TP T GG I +L ++W E
Sbjct: 304 FRI-WQCGGTLEIVTCSHVGHVFQKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 356
Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ K D+GDV+SR LR L CK F WYLE
Sbjct: 357 KNFFYIISPGVTKVDYGDVSSRLGLRHKLQCKPFSWYLE 395
>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLVKCHGSGGSQQWTF 564
>gi|260793003|ref|XP_002591503.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
gi|229276709|gb|EEN47514.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
Length = 618
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 105/181 (58%), Gaps = 24/181 (13%)
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
F W IP+ ER R K+ +PV +PTMAGGLF+IDK +FE +GTYD+G D+WGGENLE+
Sbjct: 385 LTFTWGLIPDYERSRRKSPVDPVRSPTMAGGLFAIDKWYFEHIGTYDAGMDVWGGENLEM 444
Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMCIDSA 224
SF+ +GDV++R +L+ L CK FKW+++ + N S +C DS
Sbjct: 445 SFRERYGDVSARLDLKDKLHCKPFKWFMQTIMPDMYVPEDRPGRSGALRNSASNLCFDSE 504
Query: 225 CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD----EACLDYAGGDVILYPCHGSKG 280
+P ++ CH GGNQ++ ++ E R + E C++ GG+ ++ H + G
Sbjct: 505 GAENAGKRPT-MWGCHGMGGNQYFELNSREEFRHNTGGKEMCVEAQGGEFVVL-MHCTSG 562
Query: 281 N 281
N
Sbjct: 563 N 563
>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 658
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 137/294 (46%), Gaps = 52/294 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + R+ VV P+I I T + G +F IGGF+W +
Sbjct: 300 CETTPGWLEPLLEPIRRDRRAVVCPVIDVIDYRTLQYVAAEGD-----RFQIGGFNWRGE 354
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH IP R+ + AEP+ +PTMAGGLF+I++ +F + G+YD D WGGENLE+SF
Sbjct: 355 FTWHNIPSAWRRNRVSVAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSF 414
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWG 176
+ W V P G D ++ G D+ ++W
Sbjct: 415 RI-WQC-----------GGHIVIAPCSHVGHIFRDYQPYKIPGGKDTNAINTKRAVEVWM 462
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
E + ++ GD+++R+ R CK FKWYL+
Sbjct: 463 DEFKKYIYQARPELKKIRIGDISARRAFRELNRCKPFKWYLDNVYPHKYLIEEDSQGFGI 522
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCH---KQGGNQFWMMSKHGEIRRDEAC 262
V N + MC+D+ K +G++ CH ++ NQ +S+ GE+R+++ C
Sbjct: 523 VRNPLTNMCLDTYGKARGKTSDLGIFECHPIPEEATNQLLSLSRKGELRQEDLC 576
>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
partial [Pongo abelii]
Length = 587
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELRGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRSDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLCHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|10438776|dbj|BAB15338.1| unnamed protein product [Homo sapiens]
Length = 379
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 147/333 (44%), Gaps = 66/333 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 19 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 70
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E R + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 71 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 130
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + ++ + F
Sbjct: 131 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWLDEY--KEQYFS 187
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ +L K +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 188 L----RPDLKTK-SYGNISERVELRKKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVN 242
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
+ + + C+ + +P+ V L C NQ W+ ++ E+
Sbjct: 243 RGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSDPNQIWIYNEEHELVL 302
Query: 259 DE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
+ CLD + L CHGS G+Q + +
Sbjct: 303 NSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 335
>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Takifugu rubripes]
Length = 580
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 143/328 (43%), Gaps = 65/328 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL + + VV P+I I DT L + P + GGF+W L
Sbjct: 221 CEVNQMWLEPLLASIHEDRRTVVCPVIDIISADT--LSYSPSPIVR------GGFNWGLH 272
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E K K +P+ +PTMAGGLF+I++ +F ++G YD+G DIWGGENLE+SF
Sbjct: 273 FKWDPVPPSELKSPKGPVDPIRSPTMAGGLFAINRKYFNEMGQYDAGMDIWGGENLEISF 332
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + ++ Y +
Sbjct: 333 RIWMCGGQLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWMDE---YKEQYL 388
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
E E D+GD++ R LR L C+SF+WYL+
Sbjct: 389 SMRPELRE----RDYGDISDRVALRERLQCRSFRWYLDNVYPEMQTVSNGNKHPPLFINK 444
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
+ N + C+ + + + V L PC Q Q W + G+ +
Sbjct: 445 DLKRPKVLQRGRLHNRATNRCLVAQGRASQKGGAVVLRPCDPQDPEQEWAYDEEGQLVLA 504
Query: 259 DEACLDYAGGDVI----LYPCHGSKGNQ 282
CLD + L CHGS G+Q
Sbjct: 505 GLLCLDVSEVRTFDPPRLMKCHGSGGSQ 532
>gi|270008661|gb|EFA05109.1| hypothetical protein TcasGA2_TC015209 [Tribolium castaneum]
Length = 565
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 105/324 (32%), Positives = 150/324 (46%), Gaps = 74/324 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W++PLL + + + V+ P+I I +T L + TS + +GGF W+
Sbjct: 218 CEATTDWMEPLLSRIEQEPTAVLVPIIDVIEANT--LAYSTNGDTS---YQVGGFSWSGH 272
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I + E +HK PV +PTMAGGLF+ID+ FF ++G+YD D WGGENLE+SF
Sbjct: 273 FTWIDI-QNEEDKHK--LTPVKSPTMAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSF 329
Query: 124 K----------------------FNWHAIPERERKRHKNAAE--PVWTPTMAGGLFSIDK 159
+ F+ ++ P+ + N A VW F
Sbjct: 330 RIWQCGGRLETVPCSRVGHIFRDFHPYSFPDNKDTHGINTARLAHVWMDDYKRFFFMYQP 389
Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
A EN + GD+T RK+LR+ L CKSFKWYLE
Sbjct: 390 AL----------------ENNPV-----VGDLTHRKQLRQKLRCKSFKWYLENVYPEKFI 428
Query: 212 ----------VSNDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDE 260
V ND+ GMC+D D P+GLY CH +Q++ ++ GE+R++
Sbjct: 429 PDENVYAHGQVQNDY-GMCLDDLQLGEDKIGPLGLYQCHPYLAMSQYFSLNFKGELRKEN 487
Query: 261 ACLDYAG-GDVILYPCHGSKGNQY 283
C + G +V L CHG K Q+
Sbjct: 488 FCAETFGVREVQLTECHGHKREQF 511
>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Cavia porcellus]
Length = 552
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 135/315 (42%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P + + ++W E
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 365
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG++ SR LRRNL C SFKWYLE V D S
Sbjct: 366 YKQYYYAARPFALERPFGNIESRLNLRRNLQCHSFKWYLENVYPELSVPQDSSIQKGNIR 425
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
C++S + L PC K G +Q W + +I ++E CL + G
Sbjct: 426 QRQKCLESQKHNNQEIPNLRLSPCVKLKGEEAKSQGWAFTYTQQIIQEELCLSVITLFPG 485
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 486 APVVLVLCKNGDERQ 500
>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 587
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 140/288 (48%), Gaps = 39/288 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CEV K+WL+PLL + + VV+P+I NI ++TFE + FF +GGF W+
Sbjct: 228 CEVTKQWLEPLLQRIKEKKNAVVTPIIDNISEETFEYSH-----SDEPSFFQVGGFTWSG 282
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W I E + K +A PV +PTMAGGLF+I++ +F +G+YD + WGGENLE+S
Sbjct: 283 HFTWINIQEADLKSKTSAISPVKSPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMS 342
Query: 123 FKFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
F+ W IP P P M I+ A + D ++
Sbjct: 343 FRI-WQCGGVLETIPCSRVGHVFRNFLPYKFP-MDKDTHGINTARLANVWM-DDYKRLYY 399
Query: 177 GENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
E K + GD+ R LR L CKSFKWYL+ V
Sbjct: 400 LHREEYKDKPELIGDIKERVNLREKLKCKSFKWYLDNVYPEKFIPDENVQAFGRVQVQKG 459
Query: 218 GMCIDSACKPTDMHKP--VGLYPCHKQG-GNQFWMMSKHGEIRRDEAC 262
+C+D+ D KP +G+Y CH Q +Q++ +SK GE+RR++ C
Sbjct: 460 NLCLDNL--QNDEEKPYNLGVYECHSQLFPSQYFSLSKVGELRREDTC 505
>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
davidii]
Length = 507
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 141/320 (44%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 157 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFSY-------IESATELRGGFDWSLH 209
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + +EP+ TP +AGGLF +DK++F LG YD DIWGGEN E+SF
Sbjct: 210 FQWEQLSPEQKAQRLDPSEPIRTPIIAGGLFVMDKSWFNFLGKYDMDMDIWGGENFEMSF 269
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 270 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 315
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FGD+ SR +LRR L C+SFKWYLE V D S
Sbjct: 316 VWMDEYKQYFYAARPFALERPFGDIESRLDLRRKLRCQSFKWYLENVYPELRVPKDSSIQ 375
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 376 KGPIRQRQKCLESQRQKNQEVSNLKLRPCVKIKGEDAKSQIWAFTYTQQIIQEELCLSVI 435
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 436 TFFPGAPVVLVLCKNGDDKQ 455
>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
taurus]
Length = 552
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 144/315 (45%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF +DK++F LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDMDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G + K + +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYIFPD--GNANTYIKNTKRTAEVWMDEYK 367
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
+ + + + FG++ SR LR+NL C+SFKWYLE V D S
Sbjct: 368 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 427
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
C+++ + + L PC K G +Q W + +I ++E CL + G
Sbjct: 428 QKCLEAQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 487
Query: 270 VILYPC-HGSKGNQY 283
V+L C +G K Q+
Sbjct: 488 VVLVLCKNGDKRQQW 502
>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Loxodonta africana]
Length = 552
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 48/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YDS DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDSEMDIWGGENFEMSF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H T T + + ++ Y
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKHPYIFPDGNTNTYIKNTKRTAEVWMDEYKQYYYAAR 374
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM--------- 219
+ E FG++ +R LR+NL C+SF+WYL E+S +
Sbjct: 375 PFALER-------PFGNIENRLSLRKNLQCESFQWYLKNVYPELSIPKDSLIQKGNIRQR 427
Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
C+++ + + L PC K G +Q W + +I ++E CL + G
Sbjct: 428 QKCLETQKRKNQEIPNLKLSPCIKIKGEEAKSQVWAFTYTQQILQEELCLSVITFFPGAP 487
Query: 270 VILYPCHGSKGNQ 282
V+L C Q
Sbjct: 488 VVLVLCKNGDDRQ 500
>gi|349605004|gb|AEQ00388.1| Polypeptide N-acetylgalactosaminyltransferase 3-like protein,
partial [Equus caballus]
Length = 337
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 94/282 (33%), Positives = 136/282 (48%), Gaps = 38/282 (13%)
Query: 25 VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPV 84
VVSP IA+I +TFE P ++ + G FDW+L F W ++P+ ER+R K+ P+
Sbjct: 4 VVSPDIASIDMNTFEFNKPSPYRSNHNR---GNFDWSLSFGWESLPDHERQRRKDETYPI 60
Query: 85 WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERK-------- 136
TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF+ W + E
Sbjct: 61 KTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRV-WQCGGQLEIMPCSVVGHV 119
Query: 137 -RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSR 194
R K+ P T +A + + + Y F + ++ + FGD++ R
Sbjct: 120 FRSKSPHSFPKGTQVIARNQVRLAGEVW--MDEYKEIFYRRNTDAAKIVKQKSFGDLSKR 177
Query: 195 KELRRNLGCKSFKWYLE------------------VSNDWSGMCIDSACKPTDMHKPVGL 236
++ L CK+F WYL + + +C+D + KP+ L
Sbjct: 178 FAIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSFGQSLCLDVG-ENNQGGKPLIL 236
Query: 237 YPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPC 275
Y CH GGNQ++ S EIR + E CL A G V L C
Sbjct: 237 YTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQLKAC 278
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 5/103 (4%)
Query: 85 WTPTMAGGLFSIDKAFFE--KLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAA 142
+T ++ + SID FE K Y S + N + S F W ++P+ ER+R K+
Sbjct: 1 YTAVVSPDIASIDMNTFEFNKPSPYRSNHN---RGNFDWSLSFGWESLPDHERQRRKDET 57
Query: 143 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
P+ TPT AGGLFSI K +FE +GTYD +IWGGEN+E+SF+
Sbjct: 58 YPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFR 100
>gi|444727591|gb|ELW68073.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Tupaia chinensis]
Length = 554
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 73/178 (41%), Positives = 95/178 (53%), Gaps = 40/178 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 115 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 167
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGENL
Sbjct: 168 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL--- 224
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
GGLF +DK++FE+LG YD D+WGGENL
Sbjct: 225 -----------------------------GGLFVMDKSYFEELGKYDMMMDVWGGENL 253
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 86/182 (47%), Gaps = 20/182 (10%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLT---SSYKFFIGGFDWN 61
E + L+ ++ VL ++ H++ +I + DD R R+ ++ + D +
Sbjct: 57 EARSALLRTVVSVLKKSPPHLIKEII--LVDDYSNDRLMRSRVRGADAAQAKVLTFLDSH 114
Query: 62 LQFNWH---AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 118
+ N H + ER + P+ ID + + D+ GG +
Sbjct: 115 CECNEHWLEPLLERVAEDRTRVVSPI-----------IDVINMDNFQYVGASADLKGGFD 163
Query: 119 LELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
L FK+++ PE+ R R N P+ TP +AGGLF +DK++FE+LG YD D+WGGE
Sbjct: 164 WNLVFKWDYMT-PEQRRSRQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGE 222
Query: 179 NL 180
NL
Sbjct: 223 NL 224
>gi|357606408|gb|EHJ65055.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 389
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 58/231 (25%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYK-FFIGGFDWNL 62
CE + WL+PLL+ L N V SP+I +I +TFE ++ + K +IGGF+WNL
Sbjct: 149 CECTEGWLEPLLERLVENPKIVASPVIDHIDPNTFEY------ISQNPKDIYIGGFNWNL 202
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
+F W +I E KR +N P+ TPT+AGGLF+IDK FF +G YD GFD+WGGENLELS
Sbjct: 203 KFIWRSI---EYKR-ENFLLPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELS 258
Query: 123 FK-----------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159
FK F ++ E ++ AE VW A K
Sbjct: 259 FKVWMCGGSLEIVPCSHVGHIFRENFPYYTSGETFKRNAARLAE-VWLDDYA-------K 310
Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL 210
F+E++G D GDVT++KELR+ L CKSF WYL
Sbjct: 311 IFYERIGNADVS----------------LGDVTAQKELRKKLKCKSFNWYL 345
>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
Length = 575
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/305 (34%), Positives = 146/305 (47%), Gaps = 49/305 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+ LL V+ ++ + VV P+I I DDTF S++ G F+WNLQ
Sbjct: 219 CECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAY-------VKSFELHWGAFNWNLQ 271
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + RE K KN A +P TPTMAGGLF+ID+ +F ++G YD G +IWGGENLE+S
Sbjct: 272 FRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMS 331
Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG-FDIW 175
F+ W I R H + P P GG I+K F L D W
Sbjct: 332 FRI-WQCGGKVQIAPCSRVGHLFRKSSPYSFP---GG---INKTLFSNLARVARVWMDDW 384
Query: 176 GGENLELSFKGDF----GDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
+ + D +VTSR ELRR CK F+WYL+ + N
Sbjct: 385 ARFYFKFNEPADRIKNEQNVTSRIELRRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRN 444
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEA-CLDYAGGD 269
MC+ K ++P+G+ G+ + ++M+K G I D++ CLD A
Sbjct: 445 LGQNMCLIKPQKKVVSNQPMGIAKIDMCLGDEVILEMFVMTKEGFIMTDDSICLD-APEK 503
Query: 270 VILYP 274
V++ P
Sbjct: 504 VVIGP 508
>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
Length = 608
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 146/333 (43%), Gaps = 66/333 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLTPIREDRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + ++G++T R ELR+ L CKSFKWYL+
Sbjct: 413 QYFALRPELRMR-NYGNITDRVELRKRLNCKSFKWYLDNIYPEMQISGPNAKAPQPVFIN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
+ + + C+ + P+ V + C NQ W+ ++ H I
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDPNQVWIYNEDHELIL 531
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+ CLD + L CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
Length = 607
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 143/302 (47%), Gaps = 48/302 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+ LL V+ ++ + VV P+I I DDTF S++ G F+WNLQ
Sbjct: 251 CECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAY-------VKSFELHWGAFNWNLQ 303
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + RE K KN A +P TPTMAGGLF+ID+ +F ++G YD G +IWGGENLE+S
Sbjct: 304 FRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMS 363
Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG-FDIW 175
F+ W I R H + P P GG I+K F L D W
Sbjct: 364 FRI-WQCGGKVQIAPCSRVGHLFRKSSPYSFP---GG---INKTLFSNLARVARVWMDDW 416
Query: 176 GGENLELSFKGDF----GDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
+ + D +VTSR ELRR CK F+WYL+ + N
Sbjct: 417 ARFYFKFNEPADRIKNEQNVTSRIELRRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRN 476
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEA-CLDYAGGD 269
MC+ K ++P+G+ G+ + ++M+K G I D++ CLD
Sbjct: 477 LGQNMCLIKPQKKVVSNQPMGIAKIDMCLGDEVILEMFVMTKEGFIMTDDSICLDAPEKV 536
Query: 270 VI 271
VI
Sbjct: 537 VI 538
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 128/276 (46%), Gaps = 43/276 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL PLL + + VV P+I I ++F + + GGF+WNL
Sbjct: 212 CECNQGWLPPLLRRVKEDPRRVVCPVIDVINLESF-------KYFGASSDLRGGFNWNLV 264
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + +ER+ R N P+ TP +AGGLF +D+A FE+LG YD+ DIWGGENLELS
Sbjct: 265 FKWEFLSNKEREERANNPTLPIRTPMIAGGLFVVDRAQFERLGAYDTAMDIWGGENLELS 324
Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
F+ W E RK+H P P +G +F+ +
Sbjct: 325 FR-AWQCGGSLEILPCSRVGHVFRKQH-----PYSFPGGSGNVFARQANTRRAAEVWMDD 378
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------VSND 215
+ + + ++ G V R LR++LGC SF+WYL+ S
Sbjct: 379 YKKYYYATVPVARNVPMGSVEERLNLRKSLGCHSFQWYLDNVYPELKVPAAGGERLASLR 438
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS 251
MC+D+ PVGL+ CH GGNQ W ++
Sbjct: 439 QGQMCLDTLGGSEG--NPVGLFTCHGSGGNQQWSLA 472
>gi|345323153|ref|XP_001510349.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Ornithorhynchus anatinus]
Length = 479
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 140/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VVSP+I I DTF ++ GGFDW+L
Sbjct: 129 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 181
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + +P+ TP +AGGLF IDK++F LG YD+ DIWGGEN E+SF
Sbjct: 182 FKWEQLSPEQKAKRTDPTQPIKTPIIAGGLFVIDKSWFNHLGKYDTAMDIWGGENFEISF 241
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ +P RK+H P P + + +
Sbjct: 242 RVWMCGGTLEIVPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 287
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------SNDW 216
+W E + + +GD+ SR EL+++L C+ FKWYLE S
Sbjct: 288 VWMDEFKQYYYAARPAAQGRPYGDIQSRVELKKSLKCRPFKWYLETVYPELRIPEESLAQ 347
Query: 217 SGM------CIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEIRRDEACLD-- 264
+G+ C++S + L PC + G Q W + +IR+ + CL
Sbjct: 348 TGIIRQRQKCLESQRLEGQEFPALILSPCITSKGEASGTQEWTYTFAQQIRQQQLCLSVH 407
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+ PC G Q
Sbjct: 408 TLFPGSQVLFSPCKEEDGKQ 427
>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11 [Sarcophilus
harrisii]
Length = 970
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 103/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + + VV P+I I DT + SS GGF+W L
Sbjct: 610 CEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL--------MYSSSPIVRGGFNWGLH 661
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 662 FKWDLVPFSELGGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 721
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + D +
Sbjct: 722 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKE 774
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL K +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 775 QYFSLRPELKLK-SYGNISERVELRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFIN 833
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
+ + + C+ + P+ V L C NQ W+ + +H I
Sbjct: 834 RGPKRPKILQRGRLYHLQTNKCLAAQGHPSQKGGLVVLKVCDYSDPNQVWIYNEEHELIL 893
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 894 NNLLCLDMSETRSSDPPRLMKCHGSGGSQ 922
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 111/218 (50%), Gaps = 26/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + + VV P+I I DT + SS GGF+W+L
Sbjct: 248 CEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL--------MYSSSPIVCGGFNWDLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + + A P+ +P MAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPFSKLGGPEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + D +
Sbjct: 360 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTNNSLRMAHVWL------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ EL K +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 413 QYFSLRPELKLK-SYGNISERVELRKKLGCKSFKWYLD 449
>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
kowalevskii]
Length = 753
Score = 134 bits (338), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 135/285 (47%), Gaps = 36/285 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV+P + I D TF F T IGGF W +
Sbjct: 397 CECNIGWLEPLLSEIVNDRTTVVAPNLDVISDKTFGYTFIKPEQT-----MIGGFGWLVD 451
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+++P+RER R N + P+ TPT+AGGLF+ID +F ++G YD GFD WG ENLELS
Sbjct: 452 FKWYSLPKRERLRVNNDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELS 511
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P ++ P +I K + + +
Sbjct: 512 FRVWQCGGTLEIVPCSHVGHVFRSSIPYKYKDNKNPGLTIAKNNMRLMDVWMDDLKYFFL 571
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
L + +FGD + RK+LR NL CK FKWYLE + + SG C
Sbjct: 572 AILPHYAEQEFGDTSERKQLRSNLKCKDFKWYLENIYPENTMPMQYQILGHIKHVESGEC 631
Query: 221 IDSACKPTDMHKPVGLYPCHKQGG--NQFWMMSKHGEIRRDEACL 263
++ + K D + P + PC GG ++ M +K ++ D CL
Sbjct: 632 LEMSRK--DGNTP-AIQPC---GGHFDEVLMYTKQSNLQHDYLCL 670
>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
Length = 644
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 141/303 (46%), Gaps = 43/303 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL PLL + + + +V PL+ I TFE R L G FDWNLQ
Sbjct: 299 CECAEGWLPPLLLAIEADRTKIVCPLVDVIEFQTFEYRAAKEELH-------GAFDWNLQ 351
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE E KR + A+ + PT+ GGLF++D+ +F+++G+YDSG DIWG ENLELSF
Sbjct: 352 FIWKDLPEHEMKRRTSPADNIRAPTIIGGLFAVDRLYFKRIGSYDSGMDIWGSENLELSF 411
Query: 124 KFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK----LGTYDSGFD 173
+ W I R H P P GG +I L Y F
Sbjct: 412 RV-WMCGGSLEISPCSRVGHVFRTRIPYGFPN--GGKRTIRNNAMRAAEVWLDDYKKFF- 467
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
+ +N+ DV R +LRR L CKSF+WYL E +++ G I S
Sbjct: 468 -YASQNITRRLTT-VEDVVVRVDLRRKLKCKSFQWYLDNVIPEAVLPEDEDEYFGQ-IQS 524
Query: 224 ACKPT------DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDVILYPCH 276
P+ D H + L C +Q + ++ ++RD+ C D G D+I C
Sbjct: 525 LASPSKCLEFKDNH--LTLSHCKSMKESQMFHLTNQQLLKRDDVTCFDVNGRDLITRDCE 582
Query: 277 GSK 279
S+
Sbjct: 583 ISQ 585
>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
gallus]
Length = 608
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 144/329 (43%), Gaps = 66/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLTPIKEDRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + ++G++T R ELR+ L CKSFKWYL+
Sbjct: 413 QYFALRPELRTR-NYGNITDRVELRKRLNCKSFKWYLDNIYPEMQVSGPNAKAPQPVFIN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
+ + + C+ + P+ V + C NQ W+ ++ H I
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDQNQVWVYNEDHELIL 531
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQ 560
>gi|157114760|ref|XP_001652408.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883561|gb|EAT47786.1| AAEL001151-PA [Aedes aegypti]
Length = 592
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 144/304 (47%), Gaps = 35/304 (11%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+ LLD +ARNS+ + P I I + LR T + + G +DW+L F W
Sbjct: 247 WLEALLDPVARNSTTIAIPTIDWIDEHDMHLR------TENAPSYYGAYDWDLNFGWWGR 300
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
R K +N EP TP MAGGLF+I ++FFE+LG YD GFDI+G EN+ELS K +W
Sbjct: 301 WSRINK-PENKMEPFETPAMAGGLFAITRSFFERLGWYDEGFDIYGIENIELSMK-SWIC 358
Query: 128 ----HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDS-GFDIWGGENLE 181
+P + P T + + E + Y FDI+G
Sbjct: 359 GGKMVTVPCSRVAHIQKTGHPYLIQTKKDVVRANSLRLAEVWMDEYKQIIFDIYGLPRYP 418
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS--------------NDWSGMCI--DSAC 225
+ + GDV+ RK++R CK+FK+Y++ + + M + D+
Sbjct: 419 VE---EIGDVSHRKQIREKAKCKTFKYYVQAAFPEMNNPMVEGAFHGEVKNMALGNDTCL 475
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
+ V + C Q QFW + + E+ + CLDY G + +Y CH S+GNQ ++
Sbjct: 476 EYQLDTNTVRMATCDHQETGQFWAHNYYQELNSHKHCLDYTGDTMGVYGCHRSRGNQAWQ 535
Query: 286 YDYK 289
Y K
Sbjct: 536 YVKK 539
>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
Length = 558
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 147/318 (46%), Gaps = 71/318 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 233 CEVNVMWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 284
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++++++F +LG YDSG DIWGGENLE+SF
Sbjct: 285 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRSYFSELGQYDSGMDIWGGENLEISF 344
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 345 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 381
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLEVSNDWSG 218
+L L S + D +G+V+ R ELR+ LGCKSFKWYL+ + +
Sbjct: 382 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNVSERVELRKKLGCKSFKWYLD--SIYPE 439
Query: 219 MCIDSA-CKPTDMHKPVGL-----YPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGG 268
M I KP +P+ + P Q G + +H + + CLD +
Sbjct: 440 MQISGPNAKP---QQPIFINRGPKRPKILQRGRIWIYNEEHELVLSNLLCLDMSETRSSD 496
Query: 269 DVILYPCHGSKGNQYFEY 286
L CHGS G+Q + +
Sbjct: 497 PPRLMKCHGSGGSQQWTF 514
>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
6 [Macaca mulatta]
gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
Length = 608
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
(Silurana) tropicalis]
gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
tropicalis]
Length = 555
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VVSP+I I DTF ++ GGFDW+L
Sbjct: 203 CEVNKDWLPPLLHRIKEDPTRVVSPVIDIINLDTFAY-------IAASSDLRGGFDWSLH 255
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + EP+ TP +AGGLF I+K++F LG YD+ DIWGGEN E+SF
Sbjct: 256 FKWEQLSAEQKAKRLDPTEPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGGENFEISF 315
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P + + +
Sbjct: 316 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 361
Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------VSNDW 216
+W E + +GD+ R LRR L C+SFKWYLE S
Sbjct: 362 VWMDEFKNHYYAARPAAQGRPYGDIQKRLSLRRTLKCRSFKWYLENVYPELQIPAESLSK 421
Query: 217 SGM------CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
SG+ CI+S + L PC G +Q W+ ++ +I + C+
Sbjct: 422 SGIIRQRQRCIESQKTEGPEPPSLNLVPCSSLKGVSPQSQEWVYTQVQQISQGPLCMSVH 481
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L PC G Q
Sbjct: 482 TLFPGTQVVLLPCREGDGKQ 501
>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
Length = 608
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4-like
[Ornithorhynchus anatinus]
Length = 580
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 100/283 (35%), Positives = 136/283 (48%), Gaps = 46/283 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + RN + VV P+I I +TFE G + IGGFDW L
Sbjct: 232 CECGPGWLEPLLERIGRNETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PERER+R ++ +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 286 FQWQTVPERERRRRRSRIDPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSF 345
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
+ W + + G +F + L ++W E
Sbjct: 346 RV-WQC----------GGTLEILPCSHVGHVFPKRAPYARPSFLRNTARAAEVWMDGYKE 394
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
+ K + D++ R LR L C+SF W E V D W G
Sbjct: 395 HFYNRNPPARKESYWDLSERTSLREXLNCRSFDWLPENVLPRIHVPEDRPGWHGAVRSAG 454
Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
I S C + H P G L+ CH QGGNQF+ + + EIR
Sbjct: 455 ISSECLDYNAPEHNPTGARLSLFGCHGQGGNQFFEYTSNREIR 497
>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
Length = 608
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 653
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 77/220 (35%), Positives = 109/220 (49%), Gaps = 28/220 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL+ + N + V P+I I DTFE P GGF+W L
Sbjct: 286 CEVNVGWLEPLLERIRANRATVTCPIIDIINADTFEYTASP--------IVRGGFNWGLH 337
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + P ++ + A P+ +PTMAGGLF++D+ FF +LG YD G DIWGGENLE+SF
Sbjct: 338 FKWESPPAGLARKGRGAIAPIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWGGENLEISF 397
Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGF 172
+ IP +R + P T+ + + + Y +
Sbjct: 398 RIWMCGGQLEIIPCSRVGHVFRRRRPYGSPNGEDTLTKNSLRVAHVWMDDYKKYYFQTRS 457
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
D+ G +GD+TSR LR+ LGC+SF WY++
Sbjct: 458 DVVGKP---------YGDITSRVALRKRLGCRSFDWYMKT 488
>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
Length = 597
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 142/308 (46%), Gaps = 44/308 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 243 CEGNVGWCEPLLHRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 297 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEF 414
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF WYL +V
Sbjct: 415 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVK 474
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ S +C+D + + VGLYPC K +Q + + +R + +C D
Sbjct: 475 SVSSNLCLDDLLQNNEKPYNVGLYPCGKVLQKSQLFSFTNSQVLRNELSCATVQHSDSPP 534
Query: 270 --VILYPC 275
V++ PC
Sbjct: 535 YRVVMVPC 542
>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Monodelphis domestica]
Length = 609
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 102/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + + VV P+I I DT + SS GGF+W L
Sbjct: 249 CEVNKMWLQPLLVPIQEDRRTVVCPVIDIISADTL--------MYSSSPIVRGGFNWGLH 300
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPFSELEGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 360
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + D +
Sbjct: 361 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTYNSLRLAHVWL------DEYKE 413
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL K +G+++ R LR+ LGCKSFKWYL+
Sbjct: 414 QYFSLRPELKLK-SYGNISERIALRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFIN 472
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
+ + + C+ + P+ V L C NQ W+ + +H I
Sbjct: 473 RGPKRPKILQRGRLYHLQTNKCLAAQGHPSQKGGLVVLRVCDYSDPNQVWIYNEEHELIL 532
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 533 NNLLCLDMSETRSSDPPRLMKCHGSGGSQ 561
>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 142/308 (46%), Gaps = 44/308 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 243 CEGNVGWCEPLLHRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 297 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEF 414
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF WYL +V
Sbjct: 415 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVK 474
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ S +C+D + + VGLYPC K +Q + + +R + +C D
Sbjct: 475 SVSSNLCLDDLLQNNEKPYNVGLYPCGKVLQKSQLFSFTNSQVLRNELSCATVQHSDSPP 534
Query: 270 --VILYPC 275
V++ PC
Sbjct: 535 YRVVMVPC 542
>gi|426337572|ref|XP_004032775.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3, partial
[Gorilla gorilla gorilla]
Length = 413
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 134/265 (50%), Gaps = 36/265 (13%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL +A N + VVSP IA+I +TFE P ++ + G FDW+L F W ++
Sbjct: 157 WLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLSFGWESL 213
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
P+ E++R K+ P+ TPT AGGLFSI K +FE +G+YD +IWGGEN+E+SF+ W
Sbjct: 214 PDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRV-WQC 272
Query: 130 IPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
+ E R K+ P T +A + + + ++ Y F +
Sbjct: 273 GGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFYRRNTDA 329
Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMCI 221
++ + FGD++ R E++ L CK+F WYL + + +C+
Sbjct: 330 AKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCL 389
Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQ 246
D + KP+ +Y CH GGNQ
Sbjct: 390 DVG-ENNQGGKPLIMYTCHGLGGNQ 413
>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
Length = 874
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 145/319 (45%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I TFE R + + + G F+W +
Sbjct: 243 CEVNTNWLPPLLAPIHRDRTVMTVPIIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE+KR K+ +EP +PT AGGLF+I++ FF LG YDSG +WGGEN ELSF
Sbjct: 299 YKENEVPRREQKRRKHDSEPYRSPTHAGGLFAINRKFFLDLGAYDSGLLVWGGENFELSF 358
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W E
Sbjct: 359 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDE 413
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ F D GD++ + L+ L CKSF+WY+
Sbjct: 414 PYKEYFYTREPLAQYLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANLFW 473
Query: 211 -EVSNDWSGMCIDSACK-PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
E+ N C+D+ + P + +GL CH QG NQ ++ G++ E C++
Sbjct: 474 GELQNTGMEKCVDALGRQPPAI---IGLQVCHGQGHNQLIRLNAAGQLGVGERCIEAYNA 530
Query: 269 DVILYPCHGSKGNQYFEYD 287
D+ L C + ++YD
Sbjct: 531 DIKLAFCRLGTVDGPWQYD 549
>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 645
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 150/318 (47%), Gaps = 48/318 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I + + R G S+ K + G F+W L
Sbjct: 296 CEVNVNWLPPLLAPIRQNRKVMTVPVIDGIDKNDWSYRIVYG---SADKHYRGIFEWGLL 352
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +E R K+ +EP +PT AGGLF+I+K +FE+LG YD G IWGGE ELSF
Sbjct: 353 YKETELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSF 412
Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
K W +P + P P ++ + + K + ++ YD
Sbjct: 413 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDE---YDKY 468
Query: 172 FDIWGGENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ I E S + G+++S+ +LR++L CKSFKWY+
Sbjct: 469 YYI-----REPSARHRLPGNISSQLKLRKSLKCKSFKWYMEKVAYDVVVSYPFPPENHVW 523
Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
E N +G CID+ +P + VG PCH GGNQ +++ G++ + E C+ G+
Sbjct: 524 GEAKNHATGKCIDTMGRP--VPGIVGATPCHGYGGNQLIRLNRKGQLAQGEWCITAVHGN 581
Query: 270 VILYPCHGSKGNQYFEYD 287
+I C + F Y+
Sbjct: 582 LITNHCIKGTVDGPFTYN 599
>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
Length = 409
Score = 133 bits (334), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 146/318 (45%), Gaps = 48/318 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I + + R G + Y+ G F+W L
Sbjct: 60 CEVNVNWLPPLLAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYR---GIFEWGLL 116
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +E R K+ +EP +PT AGGLF+I+K +FE+LG YD G IWGGE ELSF
Sbjct: 117 YKETELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSF 176
Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
K W +P + P P ++ + + K + ++ YD
Sbjct: 177 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDE---YDKY 232
Query: 172 FDIWGGENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ I E S K G+++S+ +LR +L CKSFKWY+
Sbjct: 233 YYI-----REPSAKHRLPGNISSQLKLRESLKCKSFKWYMEKVAYDVIVSYPFPPENHVW 287
Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
E N +G CID+ +P VG PCH GGNQ ++ G++ + E C+ G+
Sbjct: 288 GEAKNHATGKCIDTMGRPVP--GIVGATPCHGYGGNQLIRLNMKGQLAQGEWCITAVHGN 345
Query: 270 VILYPCHGSKGNQYFEYD 287
+I C + F Y+
Sbjct: 346 LITNHCIKGTVDGPFTYN 363
>gi|260817709|ref|XP_002603728.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
gi|229289050|gb|EEN59739.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
Length = 501
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 145/305 (47%), Gaps = 50/305 (16%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL + +N+S V P+I +I TF KF GGF W+L F W +
Sbjct: 152 WLEPLLARIRKNNSTVACPVIDHIDTKTFAY--------EQLKFLAGGFTWDLNFMWIYV 203
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
+ E R K+A +PV P MAGGLF+I K +F+ +G YD +I+GGEN+E+SF+
Sbjct: 204 NKEEMARRKSAIDPVRCPVMAGGLFAIYKDYFQHIGAYDQAMEIYGGENVEMSFRVWQCG 263
Query: 126 -NWHAIPERERKRHKNAAEP---VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+P + +P V + ++KA ++ + ++ E
Sbjct: 264 GRIETVPCSRVGHIERTDKPYLYVRSNDTKDINIEVNKARVAEVWMDEYKRYLYAREPQL 323
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSNDWSGMC 220
+ +GD++ R+ LR+ LGC+SF+WY+ E+ N +G+C
Sbjct: 324 KNI--SYGDISERQALRKRLGCQSFQWYMENVYPDRLEQTVENGYYRAWGELRNLQAGLC 381
Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
+D + VGL+ CH QGG QF+ + + + +A GD+ C G++G
Sbjct: 382 LDLMDG-----RGVGLWDCHGQGGQQFFALRRP---EKRKALQTIGTGDM---QCMGTEG 430
Query: 281 NQYFE 285
+ FE
Sbjct: 431 TERFE 435
>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Cricetulus griseus]
Length = 541
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 191 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 243
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 244 FQWEQLSPEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 303
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RK+H P P + + +
Sbjct: 304 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 349
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKWYLE V D S
Sbjct: 350 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 409
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G +Q W + +I ++E CL
Sbjct: 410 KGNIRQRQKCLESQKQKKQETPHLRLSPCTKVKGEEAKSQVWAFTYTQQIIQEELCLSVV 469
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 470 TLFPGAPVVLVLCKNGDERQ 489
>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Nomascus leucogenys]
Length = 527
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 167 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483
>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Taeniopygia guttata]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 145/333 (43%), Gaps = 66/333 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLAPIREDPRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLAELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + +G++T R ELR+ L CKSFKWYL+
Sbjct: 413 QYFALRPELRTRS-YGNITDRVELRKRLNCKSFKWYLDNIYPEMQISGPNAKAPQPVFIN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
+ + + C+ + P+ V + C NQ W+ ++ H I
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDQNQVWIYNEDHELIL 531
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+ CLD + L CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|355689604|gb|AER98888.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mustela putorius
furo]
Length = 306
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 79/221 (35%), Positives = 110/221 (49%), Gaps = 29/221 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGFDWNL
Sbjct: 94 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 146
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 147 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 206
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + +P P P +G +F+ + ++W
Sbjct: 207 FRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 257
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
E + +G++ SR ELR+ L CK FKWYLE
Sbjct: 258 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLE 298
>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Nomascus leucogenys]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Gorilla gorilla gorilla]
Length = 527
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCLVAQGRPSQKGGLVVLKACDYSD 435
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483
>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIQEDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YD G DIWGGENLE+SF
Sbjct: 300 FKWDLVPLPEPGGPEGATAPIKSPTMAGGLFAMNRDYFSELGQYDRGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + +G C+ + +P+ V L C
Sbjct: 457 VSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTGKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H I + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELILSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Gorilla gorilla gorilla]
gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Gorilla gorilla gorilla]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
troglodytes]
Length = 527
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483
>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Pan troglodytes]
gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
4 [Pan troglodytes]
gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
anubis]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Otolemur garnettii]
Length = 608
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDQQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGEEGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPHAKPQQPIFINRGLSRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYGD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Pan paniscus]
gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Pan paniscus]
Length = 608
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLATIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
norvegicus]
gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[Rattus norvegicus]
gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
Length = 552
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 139/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RK+H P P + + +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKWYLE V D S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + L PC K G+ Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLESQKQKNQETPHLRLSPCAKVKGDRAKSQVWAFTYTQQIIQEELCLSVV 480
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 481 TLFPGAPVVLVLCKNGDERQ 500
>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus impatiens]
Length = 643
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 133/289 (46%), Gaps = 16/289 (5%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV KRW++PLL +A + + + P+I I DTF+ P GGF+W L
Sbjct: 271 IEVNKRWIEPLLSQIAHSKTIIAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLH 322
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P ++ +P+ +PTMAGGLF++D+ +F KLG YD+G DIWGGENLE+SF
Sbjct: 323 FKWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISF 382
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E G D L D + L+
Sbjct: 383 RI-WMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSLRVAHVWLDEYKDYFLKNV 441
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
K D+GD++ R LR+ L CK+F WYL V + D+ + D KP+ +
Sbjct: 442 QKVDYGDISERLNLRKRLKCKNFAWYLNVVYPELALPDDNKNRLKDKWAKIEQKPIQPWH 501
Query: 239 CHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
K+ +Q+ + +S + E + G +IL PC K ++E
Sbjct: 502 SRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSKLILAPCLRIKSQMWYE 550
>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 145/343 (42%), Gaps = 95/343 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + ++ VV P+I I DT L + P + GGF+W L
Sbjct: 236 CEVNQMWLQPLLAPIRQDRRTVVCPVIDIISADT--LSYSPSPIVR------GGFNWGLH 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E K + P+ +PTMAGGLF+I++ +F ++G YD+G DIWGGENLE+SF
Sbjct: 288 FKWDPVPPAELKSPQGPVGPIRSPTMAGGLFAINRKYFNEIGQYDAGMDIWGGENLEISF 347
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 348 R--------------------IW---MCGGQLFIIPCSRVGHIFRKRRPYGSPGGQDTMA 384
Query: 177 GENLELSF------------------KGDFGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L+ + D+GD+ R LR+ L C+SF+WYL+
Sbjct: 385 HNSLRLAHVWMDEYKEQYLSMRPDLRQRDYGDIGERVALRKRLQCRSFRWYLDTVYPEMQ 444
Query: 212 ---------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGG 244
+ N + C+ + + + V L PC Q
Sbjct: 445 TVAGGNKHQPLFINKDLKRPKVLQRGRLRNLATNRCLVAQGRASQKGGVVVLRPCDPQDP 504
Query: 245 NQFWMMSKHGE-IRRDEACLDYAGGDVI----LYPCHGSKGNQ 282
Q W + G+ + CLD + L CHGS G+Q
Sbjct: 505 EQEWAYDEEGQLVLAGLLCLDVSEVRTFDPPRLMKCHGSGGSQ 547
>gi|195455372|ref|XP_002074693.1| GK23025 [Drosophila willistoni]
gi|194170778|gb|EDW85679.1| GK23025 [Drosophila willistoni]
Length = 599
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/321 (30%), Positives = 148/321 (46%), Gaps = 45/321 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 244 CEGNVGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKAFQVGGFQWNG 297
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 298 HFDWVNLPEREKQRQRRECDQAREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 357
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 358 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDDY 415
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF WYL +V
Sbjct: 416 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFDWYLKNVYPEKFVPNKNVQYWGKVR 475
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ +C+D + + +GLYPC K +Q + + +R + +C D
Sbjct: 476 AVNANLCLDDLLQNNEKPFNLGLYPCGKTLQKSQLFSYTNSQVLRNELSCATVQHSDSPP 535
Query: 270 --VILYPCHGS-KGNQYFEYD 287
V++ PC S K N ++Y+
Sbjct: 536 RRVVMVPCSESDKFNDQWKYE 556
>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
taurus]
Length = 605
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/333 (29%), Positives = 143/333 (42%), Gaps = 67/333 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 246 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + ++ Y S
Sbjct: 358 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWLDEYKQYFSLRP 416
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
N +G+++ R ELR+ L CKSFKWYL+
Sbjct: 417 DLRTRN--------YGNISERVELRKKLDCKSFKWYLDNIYPEMQISGPNVKPQQPIFIN 468
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
+ + + C+ + +P++ V L C NQ W+ + +H +
Sbjct: 469 RGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSDPNQVWIYNEEHELVL 528
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+ CLD + L CHGS G+Q + +
Sbjct: 529 NNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 561
>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/338 (28%), Positives = 152/338 (44%), Gaps = 74/338 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + V P+I I DTFE SS GGF+W L
Sbjct: 150 CEVNINWLQPLLQHIHDDQKAVACPVIDVISSDTFEY--------SSSPMVRGGFNWGLH 201
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP + ++ +P+ +PTMAGGLF++D+ +F +LG YDSG DIWG ENLE+SF
Sbjct: 202 FTWEPIPPSLLVKPEDYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAENLEISF 261
Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFE--KLGTYDSGF 172
+ + +P +R + TM+ + + + + K Y
Sbjct: 262 RIWMCGGSLDILPCSRVGHLFRRFRPYGSDSKGDTMSRNSMRLAEVWLDGYKKYFYQIRH 321
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY----------------------- 209
D+ G + FGD++ R +LR++L CKSF+WY
Sbjct: 322 DLEGKK---------FGDISQRIKLRKSLQCKSFEWYLKNIYPELKPPGQPGGGAFYPID 372
Query: 210 ----------------LEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKH 253
L+ S D G C+DS P++ ++ C + ++FW +++
Sbjct: 373 RRPQVVIWKGKVICIQLQTSFD-DGYCLDSPGHPSEKKASAVIHQC-ESTKSRFWSLNED 430
Query: 254 GEIRRDE-ACLDYAGGD----VILYPCHGSKGNQYFEY 286
GE++ + CL+ +G + L CH G Q +++
Sbjct: 431 GELKIESLLCLEASGYQSKLGLRLMKCHAQGGGQQWKF 468
>gi|260809642|ref|XP_002599614.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
gi|229284894|gb|EEN55626.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
Length = 432
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 134/289 (46%), Gaps = 61/289 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW-NL 62
CE WL+PLL+ ++ N + V P++ I + F F G L+ +G D +L
Sbjct: 160 CECMYGWLEPLLERISLNHTVVPWPVLDMIQHNDFAYLFHGGVLS------VGSVDLVDL 213
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
+FNWHA+P++E + K+ +P+ +PTM GG+FSI K +FE LG YD G +IWGGEN+ELS
Sbjct: 214 RFNWHAVPQKEFRARKSIIDPIRSPTMPGGVFSIHKKYFEYLGGYDDGMEIWGGENIELS 273
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
F+ W + + G +F + Y + D W N L
Sbjct: 274 FRVIWQC----------GGTIELVPCSHVGHVFRVTSP-------YSAPVDKWMKNNKRL 316
Query: 183 S------FKG------------DFGDVTSRKELRRNLGCKSFKWYLEV------------ 212
+ +K + G+V RK LR+ L C F WY++
Sbjct: 317 AEVWMDDYKNVIYRKHPDYKTVETGNVMPRKVLRKALHCHDFSWYVQNVYPNLYVPDVRP 376
Query: 213 ----SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
+G C+D+ + K L+ CH GGNQ+W ++ GE+R
Sbjct: 377 VAYGQVRMTGKCLDAVSPEKEQPK---LFGCHGLGGNQYWEFTRAGEVR 422
>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
caballus]
Length = 539
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 140/320 (43%), Gaps = 62/320 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I D F S GGFDW+L
Sbjct: 189 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDNFNY-------IESATELRGGFDWSLH 241
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + + AEP+ TP +AGGLF ++K++F+ LG YD DIWGGEN E+SF
Sbjct: 242 FQWEQLSPEQKAQRLDPAEPIRTPVIAGGLFVMNKSWFDYLGKYDMDMDIWGGENFEISF 301
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P + + +
Sbjct: 302 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTVE 347
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ SR +LR L C+SFKWYLE + D S
Sbjct: 348 VWMDEYKQYYYAARPFALERPFGNIDSRVDLRSTLLCQSFKWYLENVYPELRIPKDSSIQ 407
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
C++S + V L PC K G +Q W + +I ++E CL
Sbjct: 408 KGNIRQRQKCLESQKQDNQKISNVKLSPCVKSKGEDTMSQIWAFTYTQQIIQEELCLSVI 467
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 468 TVFPGAPVVLVLCKNEDDKQ 487
>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Pteropus alecto]
Length = 557
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 140/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKISRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + T + +G+ C N Q W S H I++ E CL G
Sbjct: 440 NCLESQGQDTAGNFLLGVGICRGSAKNPLASQAWTFSDH-LIQQQEKCLTATSTSISPGS 498
Query: 269 DVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 PVILQACNPREGRQ 512
>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Ailuropoda melanoleuca]
gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
Length = 608
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + ++ VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIQQDQRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELRR LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTDKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
Q W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PGQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|19922324|ref|NP_611043.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|24653878|ref|NP_725472.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|51315876|sp|Q6WV20.2|GALT1_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
Short=pp-GaNTase 1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 1; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1
gi|10121393|gb|AAG13184.1|AF218236_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
gi|7303062|gb|AAF58130.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|21064373|gb|AAM29416.1| RE14585p [Drosophila melanogaster]
gi|21645385|gb|AAM70974.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|220947986|gb|ACL86536.1| GalNAc-T1-PA [synthetic construct]
Length = 601
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
S +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 AVNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562
>gi|34042906|gb|AAQ56699.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 601
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
S +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 AVNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562
>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Columba livia]
Length = 502
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/312 (31%), Positives = 133/312 (42%), Gaps = 48/312 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 161 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 213
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 214 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 273
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 274 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 326
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
+ E + FG V R E RR L CKSF+WYLE G
Sbjct: 327 QYYYEARPSAIGKSFGSVAERVEQRRKLNCKSFQWYLENVYPELKIPEKELIPGIIKQGG 386
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GGDV 270
C++S + T + VG+ C N Q W+ S IR+ + CL G +
Sbjct: 387 NCLESQAQDTTGNTLVGMGNCKGTVSNPPVTQEWVFS-DPLIRQQDKCLSITSFSMGSHI 445
Query: 271 ILYPCHGSKGNQ 282
L C+ G Q
Sbjct: 446 TLEACNQKDGRQ 457
>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
Length = 353
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 106/211 (50%), Gaps = 11/211 (5%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WLQPLL + + + V P+I I F P + IGGF W++Q
Sbjct: 131 CEANVDWLQPLLSRIHSDRTIVAVPVIDIISSTNFMYSGTPSAV-------IGGFSWDMQ 183
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F WH++P + K+ P+ TPTMAGGLFSID+ +F + G+YD G D+WGGENLE+SF
Sbjct: 184 FTWHSLPNNRQSERKDRTAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSF 243
Query: 124 KFNWHAIPERER---KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
+ W + E R + + + GG + + + ++ +
Sbjct: 244 RI-WQCGGKLEILPCSRVGHVFRTRFPYSFPGGYSEVSVNLARVVHVWMDEYNQYVYMKR 302
Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+GD+TSR LR L CKSFKWYLE
Sbjct: 303 PDLQSLKYGDITSRVALRNKLKCKSFKWYLE 333
>gi|195488539|ref|XP_002092358.1| GE11714 [Drosophila yakuba]
gi|194178459|gb|EDW92070.1| GE11714 [Drosophila yakuba]
Length = 601
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKHDREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
S +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 ALNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562
>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11-like [Oryzias
latipes]
Length = 574
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 141/328 (42%), Gaps = 65/328 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + ++ VV P+I I DT SS GGF+W L
Sbjct: 215 CEVNQDWLQPLLAPIQKDRRTVVCPIIDIISADTLTY--------SSSPIVRGGFNWGLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + AA P+ +PTMAGGLF++++ +F +LG YD G DIWGGENLE+SF
Sbjct: 267 FKWDPVPPSEISGPEGAAGPIRSPTMAGGLFAMNREYFNELGRYDPGMDIWGGENLEISF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + ++ Y +
Sbjct: 327 RIWMCGGQLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWMDE---YKEQYL 382
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
E S +GD++ R LR+ L C+SF+WYL+
Sbjct: 383 SLRPELRNRS----YGDISERVALRKRLQCRSFRWYLDTVYPEMQAVASGNRPPPLFVNK 438
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
+ N G C+ + + + V + PC + Q W + G+ +
Sbjct: 439 GLKRPKVLQRGRLRNLAVGRCLTAQGRASQKGGAVVVRPCDPRDPEQEWSYDEEGQLVLA 498
Query: 259 DEACLDYAGGDVI----LYPCHGSKGNQ 282
CLD + L CHGS G+Q
Sbjct: 499 GLLCLDVSEVRTFDPPRLMKCHGSGGSQ 526
>gi|432111808|gb|ELK34851.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Myotis davidii]
Length = 539
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 135/297 (45%), Gaps = 47/297 (15%)
Query: 20 RNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNW-HAIPERERKRHK 78
++ + VVSP+I I D F+ + GGFDWNL F W + PE+ R R
Sbjct: 223 QDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLVFKWDYMTPEQRRARQG 275
Query: 79 NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-----NWHAIPER 133
N P+ TP +AGGLF +DK++FE+LG YD D+WGGENLE+SF+ + IP
Sbjct: 276 NPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCS 335
Query: 134 ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG------- 186
P P +G +F+ + ++W E +
Sbjct: 336 RVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMDEYKNFYYAAVPSARNV 386
Query: 187 DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGMCIDSACKPTDMH---K 232
+G++ SR ELR+ L CK F+WYLE + + + C T H
Sbjct: 387 PYGNIQSRLELRKKLSCKPFRWYLENVYPELRVPDHQDIAFGALQQGTNCLDTLGHFADG 446
Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGNQYFE 285
VG+Y CH GGNQ W ++K ++ + CL D G +I L C + Q +E
Sbjct: 447 VVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRTPGSLIKLQGCRENDSRQKWE 503
>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
Length = 640
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/311 (31%), Positives = 141/311 (45%), Gaps = 34/311 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + RN + P+I I T+ R G S+ + F G F+W L
Sbjct: 289 CEVNINWLPPLLAPIRRNRKVMTVPVIDGIDMHTWSYRRVYG---SADRHFRGIFEWGLL 345
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I + E +R K +EP +PT AGGLF+IDK +FE+LG YD G IWGGE ELSF
Sbjct: 346 YKETEITKEEARRRKYNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSF 405
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K W +P + P ++G I + T+ ++ +
Sbjct: 406 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPV-ISTNMVRVIKTWMDEYEKYYY 463
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSNDW 216
+ GD++++ ELR+ L CKSFKWY+ E N
Sbjct: 464 IREPSAKHRSPGDISAQLELRKRLHCKSFKWYMEKVAYDVVYSYPFLPENHVWGEAKNLQ 523
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCH 276
+ CID+ + + VG PCH GGNQ ++K G++ + E C+ G + C
Sbjct: 524 TSKCIDTMGRA--IPGIVGATPCHGYGGNQLIRLNKKGQLTQGEWCMTPLGNQLQTGHCA 581
Query: 277 GSKGNQYFEYD 287
+ F+YD
Sbjct: 582 KGTVDGPFQYD 592
>gi|194755004|ref|XP_001959782.1| GF13042 [Drosophila ananassae]
gi|190621080|gb|EDV36604.1| GF13042 [Drosophila ananassae]
Length = 599
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 152/324 (46%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 245 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 298
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + + P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 299 HFDWINLPEREKQRQRRECKQQREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 358
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 359 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 416
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDW---- 216
+I+ +L F D GDVT R LR+ L CK+F+WYL+ N W
Sbjct: 417 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKNFEWYLKNIYPEKFVPTHNVNAWGKVQ 476
Query: 217 ---SGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+C+D + + VGLYPC K +Q + +K +R + +C +
Sbjct: 477 AVSGNLCLDDLLQNNEKPYNVGLYPCGKTLQKSQLFSFTKSQVLRNELSCATVQHSESPP 536
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ ++Y+ ++
Sbjct: 537 YRVVMVPCLENDEFNEQWKYERQH 560
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 143/315 (45%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P++ ++ + +V P+I +I D+T + GGF W L
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYH-------GDWSLSTGGFSWALH 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E E+KR + + +PTMAGGL + ++ +F ++G YD DIWGGENLE+SF
Sbjct: 283 FTWEGLSEEEQKRRTKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + IP A P + T + ++L ++W +
Sbjct: 343 RAWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMDD 395
Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
L + D GD+T+R ELR+ L CK FKW+L+ +
Sbjct: 396 YKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDVVAYGALH 455
Query: 214 NDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGDV 270
SG MC D+ + M + +G++ C +G + Q +SK G +RR+ C G++
Sbjct: 456 TVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCASEENGNI 515
Query: 271 ILYPCHGSKGNQYFE 285
+ C SK Q+ E
Sbjct: 516 RMKTC--SKKAQFNE 528
>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
lupus familiaris]
Length = 608
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIQEDQQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
Q W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PTQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
Length = 607
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 148/322 (45%), Gaps = 61/322 (18%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV K WL+P++ + + HVV P+I +I D+F + GG D L F
Sbjct: 256 EVSKGWLEPMMARINEDRKHVVMPIIDSIDPDSF-------------NYMRGGLDI-LGF 301
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+W + R + EP+ +P MAGGLFS+D+ +F LG YD G ++GGE LE+SF+
Sbjct: 302 SWGMGQKSIGSRRRTRVEPMPSPIMAGGLFSMDRKYFFDLGGYDPGMKLYGGEELEISFR 361
Query: 125 F-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
IP R H W G ++++ K ++W E
Sbjct: 362 IWQCGGTLECIP-CSRVGHVFRTGAYWK----GQVYTVPGHVIVK--NKLRAAEVWMDEY 414
Query: 180 LELSFK--------GDFGDVTSRKELRRNLGCKSFKWYL--------------------E 211
E+ + D GD+++ +E+RR CK FKW+L E
Sbjct: 415 KEVVQRVMPPLPRGMDLGDLSAMQEIRRKFQCKPFKWFLKNVYPEMFVPNDEESIEASGE 474
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD----EACLDYAG 267
+ N + C D+ K +G+YPCH G Q +++SK G++R + CLD
Sbjct: 475 IRNPQTNACFDTLGASHQGAK-IGVYPCHHSHGTQEFVLSKAGDVRVAAMDFDNCLDRGN 533
Query: 268 GD--VILYPCHGSKGNQYFEYD 287
GD V ++PCH + GNQ +++D
Sbjct: 534 GDGSVGIWPCHQTGGNQAWKWD 555
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 143/315 (45%), Gaps = 49/315 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P++ ++ + +V P+I +I D+T + GGF W L
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYH-------GDWSLSTGGFSWALH 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E E+KR + + +PTMAGGL + ++ +F ++G YD DIWGGENLE+SF
Sbjct: 283 FTWEGLSEEEQKRRTKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + IP A P + T + ++L ++W +
Sbjct: 343 RAWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMDD 395
Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
L + D GD+T+R ELR+ L CK FKW+L+ +
Sbjct: 396 YKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDVVAYGALH 455
Query: 214 NDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGDV 270
SG MC D+ + M + +G++ C +G + Q +SK G +RR+ C G++
Sbjct: 456 TVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCASEENGNI 515
Query: 271 ILYPCHGSKGNQYFE 285
+ C SK Q+ E
Sbjct: 516 RMKTC--SKKAQFNE 528
>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Loxodonta africana]
Length = 555
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 141/314 (44%), Gaps = 52/314 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKISRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
+ E + FG V +R E R+ + CKSF+WYLE N + +
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLE--NVYPELTVPEKEVLPGTIKQ 437
Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GG 268
C++S + T +G+ C N Q W+ S H I++ CL A G
Sbjct: 438 GVNCLESQGQDTAGDTLLGMGICRGSAKNPVAAQEWLFSDH-LIQQQGKCLAAAFPSPGA 496
Query: 269 DVILYPCHGSKGNQ 282
V L C+ +G+Q
Sbjct: 497 LVALQACNSKEGSQ 510
>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Felis catus]
Length = 527
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 167 CEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 278
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELRR LGCKSFKWYL+
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 375
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 376 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
Q W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 436 PGQVWIYNEEHELVLNNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 483
>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Felis catus]
Length = 608
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELRR LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
Q W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PGQVWIYNEEHELVLNNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
Length = 654
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPAVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELAGAGGATAPIKSPTMAGGLFAMNRQYFSELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGQLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTRSYGNISERVELRKRLGCKSFKWYLDNVYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 IPGPNARPQQPVFVHRGPKRPRVLLRGRLYHLQTSRCLVAQGRPSQKGGLVVLKACDYGD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PNQVWVYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Callithrix jacchus]
Length = 607
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 247 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPIVRGGFNWGLH 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 299 FRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 358
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 359 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 395
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 396 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERIELRKKLGCKSFKWYLDNIYPEMQ 455
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 456 TSGPHAKPQQPIFVNKGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYTD 515
Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
+Q W+ ++ E+ + CLD + L CHGS G+Q + +
Sbjct: 516 PDQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 563
>gi|403276614|ref|XP_003929989.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Saimiri boliviensis boliviensis]
Length = 566
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 95/323 (29%), Positives = 133/323 (41%), Gaps = 88/323 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPIVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMDRQYFHELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ E+ FS+ K
Sbjct: 360 RVILFFCVLNEQ------------------YFSLRPDLKTK------------------- 382
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLE-------------------------------- 211
+G+++ R ELR+ LGCKSFKWYL+
Sbjct: 383 ---SYGNISERVELRKKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQ 439
Query: 212 ---VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE-ACLDY-- 265
+ + S C+ + +P+ V L C NQ W+ ++ E+ + CLD
Sbjct: 440 RGRLRHLQSNTCLVAQGRPSQKGGLVVLKACDYTDPNQIWIYNEEHELVLNSLLCLDMSE 499
Query: 266 --AGGDVILYPCHGSKGNQYFEY 286
+ L CHGS G+Q + +
Sbjct: 500 TRSSDPPRLMKCHGSGGSQQWTF 522
>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Monodelphis domestica]
Length = 563
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 134/312 (42%), Gaps = 48/312 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDKA+F LG YD+ DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDKAWFNHLGKYDTQMDIWGGENFELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
+ E + FG + R+E R+ + CKSF+WYLE
Sbjct: 388 QYYYEARPSAIGKSFGSIADREEQRKKMNCKSFQWYLENVYPELKIPEKEMIPGIIKQGT 447
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
+C++S + T + V + C N Q W+ S IR+ + CL G V
Sbjct: 448 ICLESQGQDTAGNNLVVMGSCKGTSNNPSMTQEWVFSD-PLIRQQDKCLAITSFSTGSQV 506
Query: 271 ILYPCHGSKGNQ 282
L C+ G Q
Sbjct: 507 TLEACNQKDGRQ 518
>gi|170038571|ref|XP_001847122.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
gi|167882321|gb|EDS45704.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
Length = 560
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 143/303 (47%), Gaps = 39/303 (12%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+ LLD +ARN + P I I + LR T + + G +DW+L F W
Sbjct: 214 WLEALLDPVARNWMTIAIPTIDWIDEHDMHLR------TENAPTYYGAYDWDLNFGWWGR 267
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
R K+ +N EP TP MAGGLF+I++ FFE LG YD GF+I+G EN+ELS K +W
Sbjct: 268 WSRV-KQPQNKLEPFETPAMAGGLFAINRTFFELLGWYDEGFEIYGIENIELSMK-SWIC 325
Query: 128 ----HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSG-FDIWGGENLE 181
+P + P + + E + Y FDI+G
Sbjct: 326 GGKMLTVPCSRVAHIQKTGHPYLMKANKDVVRANSLRLAEVWMDEYKQVIFDIYGLPRYP 385
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDS 223
+ + GDV+SRKE+RR CK+F++Y+ EV N + + D+
Sbjct: 386 VE---EVGDVSSRKEVRRKANCKTFRYYIETAYPEMKNPLIEGAFRGEVKN--AALGNDT 440
Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
VG+ C +QFW+ + + E+ + CLDY G ++ ++ CH +GNQ
Sbjct: 441 CLTYHAATNTVGMASCDHAEKSQFWVHNYYQELNSYKHCLDYTGSELGVFGCHRGRGNQA 500
Query: 284 FEY 286
++Y
Sbjct: 501 WQY 503
>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Cavia porcellus]
Length = 608
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 99/333 (29%), Positives = 144/333 (43%), Gaps = 66/333 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLATIRGDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGEDGATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + D D
Sbjct: 360 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKD 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ +L K +G+++ R ELR+ LGC+SFKWYL+
Sbjct: 413 QYFSLRPDLKTK-SYGNISERVELRKRLGCRSFKWYLDNIYPEMQVQGPNAKAQQPVFVN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
+ + + C+ + +P+ V L C + Q W+ + +H +
Sbjct: 472 RGPKRPRVLRRGRLYHFQTNKCLVAQGRPSQKGSLVVLKACDYRDPAQVWIYNEEHELVL 531
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+ CLD + L CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
mellifera]
Length = 603
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 138/297 (46%), Gaps = 40/297 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A+N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 250 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 302
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R +N EP TP MAGGLFS+++ +F +LG+YD+ IWGGENLELS
Sbjct: 303 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELS 362
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
F+ + P + P T GG+ I ++ D + +
Sbjct: 363 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 419
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
N E + D + SR ELR+ L CK+F+WYL+ + + S
Sbjct: 420 FKFNAEAARLRDKQTIRSRLELRKKLQCKNFEWYLDNIWPEHFFPKDDRFFGRIVHILSK 479
Query: 219 MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
CI +P G H + NQ ++M+ G I DE+ CLD D
Sbjct: 480 KCIMRPSAKGTYSQPSGYAILHSCVPRPLLNQMFVMTADGIIMTDESVCLDAPENDT 536
>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Otolemur garnettii]
Length = 558
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 139/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 383
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 119/225 (52%), Gaps = 35/225 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLLD +A + V+P+I I D+TF+ + + GGF+WNLQ
Sbjct: 155 CECTEGWLEPLLDRIAFDRKIAVAPVIDVINDETFQYQ-------KGIDVYRGGFNWNLQ 207
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W++ P E KR N PV TPT+AGGLFSID+ FF ++G YD IWGGENLE+S
Sbjct: 208 FRWYSSPPSELKRRGNDVTHPVRTPTIAGGLFSIDRQFFFEIGAYDKEMKIWGGENLEMS 267
Query: 123 FKF-----NWHAIP-------ERERKRHK----NAAEPVWTPTMAGGLFSIDKAFFEKLG 166
F+ IP R++ H N+A T+ L + + + ++
Sbjct: 268 FRIWQCGGQLEIIPCSHVGHVFRKKSPHDFPRGNSAR-----TLTTNLVRVAEVWMDE-- 320
Query: 167 TYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ S F I +S + DV+ RKELR+ L CKSF WYL+
Sbjct: 321 -WKSLFYIISSAAKNIS---EIIDVSERKELRKRLKCKSFAWYLD 361
>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
Length = 679
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 149/327 (45%), Gaps = 70/327 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL + + + VVSP+I I D F L GGF+W+L
Sbjct: 333 VEANEGWLEPLLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLK-------GGFNWDLV 385
Query: 64 FNWHAIPERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ER+ + A P+ TP +AGGLFSIDK +F +LG YD D+WGGENLE+S
Sbjct: 386 FKWDYMSEQERRERRRAPTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEIS 445
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ IP RK+H P P +G +F+ +
Sbjct: 446 FRVWQCHGTLEIIPCSRVGHVFRKKH-----PYTFPGGSGNVFAKNTR---------RAA 491
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
++W E E F FGD++ R E+R L CKSF W+LE
Sbjct: 492 EVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENVYPELRIPNKDAI 551
Query: 212 ----VSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG-EIRRDEACLD 264
VS G+ CI + T +G+Y CH GGNQ + ++K G E R ++ C+
Sbjct: 552 GWGAVSQTNKGLEECIGN----THGGGTLGMYRCHGDGGNQEFTLTKEGKEFRHNDLCIG 607
Query: 265 Y-----AGGDVILYPCHGSKGNQYFEY 286
Y G V CH +Q +EY
Sbjct: 608 YNAKEPVGNPVKFNTCH-QMSHQRWEY 633
>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Apis florea]
Length = 595
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 97/297 (32%), Positives = 138/297 (46%), Gaps = 40/297 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A+N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 242 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 294
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R +N EP TP MAGGLFS+++ +F +LG+YD+ IWGGENLELS
Sbjct: 295 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELS 354
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
F+ + P + P T GG+ I ++ D + +
Sbjct: 355 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 411
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
N E + D + SR ELR+ L CK+F+WYL+ + + S
Sbjct: 412 FKFNAEAARLRDKQTIRSRLELRKKLQCKNFEWYLDNIWPEHFFPKDDRFFGRIVHILSK 471
Query: 219 MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
CI +P G H + NQ ++M+ G I DE+ CLD D
Sbjct: 472 KCIMRPSAKGTYSQPSGYAILHSCVPRPLLNQMFVMTTDGIIMTDESVCLDAPENDT 528
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 105/327 (32%), Positives = 144/327 (44%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F G T + G FDW +
Sbjct: 245 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 299 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 356
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P PT +A L + + + ++ Y
Sbjct: 357 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 413
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDVT++KELR NL CKSFKW++ E+
Sbjct: 414 RPEYRHLS----AGDVTAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEIR 469
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C+D+ K + P+ L C K G W S +IR +
Sbjct: 470 NVGTGLCVDT--KHGSLGSPLRLESCVKDRGEAAWNNVQVFTFSWREDIRPGDPQHTKKF 527
Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
C D V LY CHG KGNQ + Y
Sbjct: 528 CFDAISHSSPVTLYDCHGMKGNQLWRY 554
>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
mutus]
Length = 606
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 246 CEVNVLWLQPLLAAIREDRQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ L CKSFKWYL+
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P++ V L C
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562
>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Loxodonta africana]
Length = 527
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 88/233 (37%), Positives = 115/233 (49%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + VV P+I I DT L SS GGF+W L
Sbjct: 248 CEVNEMWLQPLLAAVREDPHTVVCPVIDIISADTL--------LYSSSPIVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPFDELGGPEGATAPIKSPTMAGGLFAMNRHYFSELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTRSYGNISERVELRKKLGCKSFKWYLD 449
>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
Length = 696
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 149/327 (45%), Gaps = 70/327 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E + WL+PLL + + + VVSP+I I D F L GGF+W+L
Sbjct: 350 VEANEGWLEPLLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLK-------GGFNWDLV 402
Query: 64 FNWHAIPERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ER+ + A P+ TP +AGGLFSIDK +F +LG YD D+WGGENLE+S
Sbjct: 403 FKWDYMSEQERRERRRAPTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEIS 462
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ IP RK+H P P +G +F+ +
Sbjct: 463 FRVWQCHGTLEIIPCSRVGHVFRKKH-----PYTFPGGSGNVFAKNTR---------RAA 508
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
++W E E F FGD++ R E+R L CKSF W+LE
Sbjct: 509 EVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENVYPELRIPNKDAI 568
Query: 212 ----VSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG-EIRRDEACLD 264
VS G+ CI + T +G+Y CH GGNQ + ++K G E R ++ C+
Sbjct: 569 GWGAVSQTNKGLEECIGN----THGGGTLGMYRCHGDGGNQEFTLTKEGKEFRHNDLCIG 624
Query: 265 Y-----AGGDVILYPCHGSKGNQYFEY 286
Y G V CH +Q +EY
Sbjct: 625 YNAKEPVGNPVKFNTCH-QMSHQRWEY 650
>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
aries]
Length = 606
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 246 CEVNVLWLQPLLAAIREDRRAVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ L CKSFKWYL+
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P++ V L C
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562
>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos taurus]
gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
taurus]
Length = 557
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + F
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEFK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + T + +G+ C N Q W+ + H I++ CL G
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 499 LVVLQACNPREGRQ 512
>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos grunniens mutus]
Length = 557
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + F
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEFK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + T + +G+ C N Q W+ + H I++ CL G
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 499 LVVLQACNPREGRQ 512
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/328 (32%), Positives = 144/328 (43%), Gaps = 62/328 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F G T + G FDW +
Sbjct: 195 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 248
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 249 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 306
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P PT +A L + + + ++ Y
Sbjct: 307 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 363
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDVT++KELR NL CKSFKW++ E+
Sbjct: 364 RPEYRHLS----AGDVTAQKELRNNLNCKSFKWFMSEVAWDLPKFYPPVEPPAAAWGEIR 419
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW-------MMSKHGEIRRDEA----- 261
N +G+C+D+ K + P+ L C K G W S +IR +
Sbjct: 420 NVGTGLCVDT--KHGSLGSPLRLESCVKDRGEAAWNNVQVTXTFSWREDIRPGDPQHTKK 477
Query: 262 -CLDYA--GGDVILYPCHGSKGNQYFEY 286
C D V LY CHG KGNQ + Y
Sbjct: 478 FCFDAISHSSPVTLYDCHGMKGNQLWRY 505
>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Bos taurus]
gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
taurus]
Length = 606
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 246 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ L CKSFKWYL+
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P++ V L C
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
NQ W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562
>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Strongylocentrotus purpuratus]
Length = 654
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 108/217 (49%), Gaps = 22/217 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV ++WL+PLL+ + +S VV P+I I DTF P GGF+W +
Sbjct: 286 CEVNEQWLEPLLERIKADSHTVVCPIIDIINHDTFAYTASP--------LVKGGFNWGMH 337
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I R+ ++ +P+ +PTMAGGLF++++ +F KLG YD G DIWGGENLE+SF
Sbjct: 338 FKWDTIRSRQLVGKEDYVKPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISF 397
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ +P P +P D + + D +
Sbjct: 398 RIWQCGGKLEIVPCSRVGHVFRKRRPYGSPNRQ------DTTTKNAVRVAEVWMDEYKEH 451
Query: 179 NLELSFKG---DFGDVTSRKELRRNLGCKSFKWYLEV 212
++ K D+GD++SR LR L CKSFKWYL+
Sbjct: 452 FYQVQPKAKNIDYGDISSRVALREELKCKSFKWYLDT 488
>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 499
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/295 (31%), Positives = 136/295 (46%), Gaps = 45/295 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + P+I I TFE R P +Y+ G F+W +
Sbjct: 146 CEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTFEYR-PVYSHGINYR---GIFEWGML 201
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P+RE HK+ +EP +PT AGGLF+I++ +F ++G YD G +WGGEN ELSF
Sbjct: 202 YKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFLEIGAYDPGLLVWGGENFELSF 261
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H A + P G L K + Y + W E
Sbjct: 262 KI-WQCGGSIEWVPCSRVGHVYRA---FMPYSFGNLAKNRKGSLITI-NYKRVIETWFDE 316
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------------------- 211
+ F D GD++ + LR L CKSF WY+E
Sbjct: 317 EHKEFFYTREPMARFLDMGDISEQVALRDKLNCKSFSWYMENVAYDVYDKFPKLPKNVHW 376
Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
V N G+C+D+ K + +G+ CH G NQ + +++ G++ E CL+
Sbjct: 377 GMVKNKAIGLCLDTMGKAAPSY--IGIQSCHGAGNNQLYRLNEAGQLGVGERCLE 429
>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
Length = 598
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 144/327 (44%), Gaps = 63/327 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I TFE R + + + G F+W +
Sbjct: 245 CEVNTNWLPPLLAPIHRDRTVMTVPIIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 300
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE+KR K+ +EP +PT AGGLF+I++ FF +LG YDSG +WGGEN ELSF
Sbjct: 301 YKENEVPRREQKRRKHDSEPYRSPTHAGGLFAINRKFFLELGAYDSGLLVWGGENFELSF 360
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF----FEKLGTYDSG--FDIWGG 177
K W E W P G + + F F KL G I
Sbjct: 361 KI-WQCGGSIE-----------WVPCSRVG--HVYRGFMPYNFGKLANKKKGPLITINYK 406
Query: 178 ENLELSFKG----------------DFGDVTSRKELRRNLGCKSFKWYL----------- 210
+E F G D GD++ + L+ L CKSF+WY+
Sbjct: 407 RVIETWFDGPYKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKY 466
Query: 211 ----------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
E+ N C+D+ + +GL CH QG NQ ++ G++ E
Sbjct: 467 PMLPANVKWGELQNVGKEKCVDALGRQPP--AVIGLQQCHGQGHNQLIRLNGAGQLGVGE 524
Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
C++ ++ L C + ++YD
Sbjct: 525 RCIEAYNSEIKLAFCRLGTVDGPWQYD 551
>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 136/302 (45%), Gaps = 59/302 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VVSP I I +TF+ P + SS+ + G FDW L
Sbjct: 266 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLETFQFNKP---VASSHAYNRGNFDWGLT 322
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IPE RK K+ PV TPT AGGLFSI K++FE +GTYD +IWGGEN+E+SF
Sbjct: 323 FGWEQIPEAARKLRKDETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSF 382
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W + E + ++ G +F T+ G D+ + L+
Sbjct: 383 RV-WQCGGQLE----------IIPCSVVGHVFRTKSPH-----TFPKGTDVITRNQVRLA 426
Query: 184 ------FKGDF------------GDVTSRK-----------ELRRNLGCKSFK------- 207
+K F D+T K L R+ K+
Sbjct: 427 EVWMDDYKKIFYRRNRNAENMAKEDLTPEKYGAVRHTFLSITLERSSFLKNVTPLFIFDP 486
Query: 208 WYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLD 264
+ ++ N S C+D + KPV +Y CH GGNQ++ S H E+R + E CL
Sbjct: 487 YVAQIQNQGSKTCLDVG-ENNKGGKPVIMYQCHNMGGNQYFEYSSHNELRHNIGKEFCLH 545
Query: 265 YA 266
A
Sbjct: 546 AA 547
>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
Length = 693
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/215 (36%), Positives = 114/215 (53%), Gaps = 15/215 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K+H + P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTEMAKQHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + G ++ + + +
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKV-LNANLLRVAEVWMDEWKYYFY 425
Query: 178 ENLELSFK-GDFGDVTSRKELRRNLGCKSFKWYLE 211
+ ++F+ + DV+ R ELR+ L CKSFKWYL+
Sbjct: 426 KIAPVAFRMRESIDVSERVELRKKLNCKSFKWYLQ 460
>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Ornithorhynchus anatinus]
Length = 608
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 143/336 (42%), Gaps = 66/336 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNAMWLQPLLVPIREDRRTVVCPVIDIIGADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPGRATAPIKSPTMAGGLFAMNREYFRELGQYDSGMDIWGGENLEISF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TMA + + D +
Sbjct: 360 RIWMCGGQLFIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
+ EL + +G+++ R LR+ LGCKSFKWYL+
Sbjct: 413 QYFALRPELRLR-SYGNISERVTLRKKLGCKSFKWYLDTVYPEMQISGPNARPQPPAFVN 471
Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
+ + + C+ + P+ V L C NQ W+ + +H I
Sbjct: 472 RGPKRPRILQRGRLYHLQTNKCLAAQGHPSQKGGRVVLKECDYGDLNQVWIYNEEHELIL 531
Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
+ CLD + L CHGS G+Q + + K
Sbjct: 532 NNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTFGRK 567
>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Taeniopygia guttata]
Length = 570
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 134/314 (42%), Gaps = 52/314 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 229 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 282 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 341
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 342 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 394
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
+ E + FG V R ELR L CKSF+WYLE N + +
Sbjct: 395 QYYYEARPSAIGKSFGSVADRVELRHKLNCKSFQWYLE--NVYPELKIPEKELIPGIIRQ 452
Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GG 268
C++S + + G+ C N Q W+ S IR+ + CL A G
Sbjct: 453 GENCLESQAQDITGNVLAGMGNCKGTVNNPPVTQEWIFSDPS-IRQQDKCLSIASFSTGS 511
Query: 269 DVILYPCHGSKGNQ 282
+ L C+ G Q
Sbjct: 512 QITLEACNQKDGRQ 525
>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mus musculus]
Length = 550
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 64/320 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RK+H P P + + +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKWYLE V D S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + + L PC K G+ Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498
>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
Length = 550
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 136/315 (43%), Gaps = 54/315 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP P P + + ++W E
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 365
Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
+ + + FG++ +R LR+NL C++FKWYLE V D S
Sbjct: 366 YKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQKGNIR 425
Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD----YAG 267
C++S + + + L PC K G+ Q W + +I ++E CL + G
Sbjct: 426 QRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVVTLFPG 483
Query: 268 GDVILYPCHGSKGNQ 282
V+L C Q
Sbjct: 484 APVVLALCKNGDERQ 498
>gi|157134100|ref|XP_001663146.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870595|gb|EAT34820.1| AAEL012972-PA [Aedes aegypti]
Length = 600
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 142/311 (45%), Gaps = 48/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +WL+PLL+ + ++S+ V+ P+I D E + F IGGF W+
Sbjct: 243 CECMHQWLEPLLERIKQSSTSVLVPII-----DVIEAKNFYYSTNGVTDFQIGGFTWDGH 297
Query: 64 FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+WH + +RE++R K A P ++PTMAGGLF+I + +F ++G+YD D WGG
Sbjct: 298 FDWHDVTQREKERQKRECPEKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 357
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
ENLE+SF+ IP P P G+ ++ A D
Sbjct: 358 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 413
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
D+ +L + GDVT R+ LR L CKSF WY++
Sbjct: 414 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGR 473
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
VS+ +C+D+ + D +G+Y C + +Q ++K G +R + +C +
Sbjct: 474 VSSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNN 533
Query: 270 -----VILYPC 275
V++ PC
Sbjct: 534 AETRFVVMIPC 544
>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 557
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 138/314 (43%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + T + +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQDTAGNFLLGVGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSVSPGS 498
Query: 269 DVILYPCHGSKGNQ 282
V L C+ +G Q
Sbjct: 499 PVTLQACNLREGRQ 512
>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
musculus]
Length = 551
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 64/320 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 203 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 255
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 256 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 315
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RK+H P P + + +
Sbjct: 316 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 361
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKWYLE V D S
Sbjct: 362 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 421
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + + L PC K G+ Q W + +I ++E CL
Sbjct: 422 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 479
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 480 TLFPGAPVVLALCKNGDERQ 499
>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Saimiri boliviensis boliviensis]
Length = 558
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V SR E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGM 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
V L C+ +G Q
Sbjct: 499 SPVTLQMCNPREGKQ 513
>gi|157133631|ref|XP_001662949.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870752|gb|EAT34977.1| AAEL012823-PA [Aedes aegypti]
Length = 600
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 142/311 (45%), Gaps = 48/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +WL+PLL+ + ++S+ V+ P+I D E + F IGGF W+
Sbjct: 243 CECMHQWLEPLLERIKQSSTSVLVPII-----DVIEAKNFYYSTNGVTDFQIGGFTWDGH 297
Query: 64 FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+WH + +RE++R K A P ++PTMAGGLF+I + +F ++G+YD D WGG
Sbjct: 298 FDWHDVTQREKERQKRECPEKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 357
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
ENLE+SF+ IP P P G+ ++ A D
Sbjct: 358 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 413
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
D+ +L + GDVT R+ LR L CKSF WY++
Sbjct: 414 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGR 473
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
VS+ +C+D+ + D +G+Y C + +Q ++K G +R + +C +
Sbjct: 474 VSSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNN 533
Query: 270 -----VILYPC 275
V++ PC
Sbjct: 534 AETRFVVMIPC 544
>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Mus musculus]
gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
Length = 558
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQTCNPKEGKQ 513
>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus laevis]
gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
Length = 603
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/217 (37%), Positives = 106/217 (48%), Gaps = 24/217 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + N VV P+I I DT + SS GGF+W L
Sbjct: 243 CEVNEMWLQPLLAPIRENPKTVVCPVIDIISSDTL--------IYSSSPVVRGGFNWGLH 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + P +PTMAGGLF +D+ +F LG YDSG DIWGGENLE+SF
Sbjct: 295 FKWDPVPLSELGGPEGYTAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISF 354
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ + +P P +P TMA + + D D
Sbjct: 355 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAYNSLRLAHVWM------DEYKDQ 408
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ EL K D+GD++ R LR+ L CKSFKWYL+
Sbjct: 409 YFALRPELRNK-DYGDISERLALRKRLKCKSFKWYLD 444
>gi|307183924|gb|EFN70514.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Camponotus
floridanus]
Length = 471
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 148/320 (46%), Gaps = 46/320 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + +N++ VVSP+I I DDTF T S++ G F+W+L
Sbjct: 155 CECTVGWLEPLLEAIGKNATRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 207
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R N EP TP MAGGLFS++K +F KLG+YD IWGGENLELS
Sbjct: 208 FRWLTLNGRLLKERRDNIIEPFRTPAMAGGLFSMNKDYFFKLGSYDDEMRIWGGENLELS 267
Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTY--DSGFDIWGG 177
F+ W E H +P T GG+ I ++ D D +
Sbjct: 268 FR-TWQCGGSVEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLARVALVWMDQWADFYFK 326
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGMCIDSACKPT 228
N E + + SR LR L CKSF+WYLE + + G + +A K
Sbjct: 327 FNPEAAKLRYKQQIRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGKIVHAATKRC 386
Query: 229 DMHKPVGLYPCHKQGGN-------------QFWMMSKHGEIRRDEA-CLDYAGGD----- 269
M +P + GN Q ++M+K+G I DE+ CLD D
Sbjct: 387 LM-RPTAKSLYAQPSGNAILHSCIPRPILGQMFVMTKNGVIMTDESVCLDAPERDMQQRT 445
Query: 270 --VILYPCHGSKGNQYFEYD 287
V + C G + Q ++YD
Sbjct: 446 PKVKIMACSG-RERQRWQYD 464
>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 147/344 (42%), Gaps = 96/344 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL V+ + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAVVHGDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E +A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGADSATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCQSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C +
Sbjct: 457 ILGPNAKAQQPVFVNRGPKRPRVLQRGRLYHFQTNKCLVAQGRPSQKGGLVVLKACDYED 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQ 282
Q W+ + +H + + CLD + L CHGS G+Q
Sbjct: 517 PAQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQ 560
>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Macaca
mulatta]
Length = 558
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQSTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ornithorhynchus anatinus]
Length = 555
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 136/314 (43%), Gaps = 52/314 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNSEWLQPLLQRVKEDYTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRAAEVWMDDYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
+ E + FG V R E R+ + CKSF+WYLE N + +
Sbjct: 380 QYYYEARPSAIGKAFGSVAERVEQRQKMNCKSFQWYLE--NVYPELKVPEKEPAPGIIRQ 437
Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIRRDEACLDY----AGG 268
C++S + P G+ C G Q W+ S IR+ + CL AG
Sbjct: 438 GASCLESRGRDAAGDSPAGVGGCRGTAGGPAGTQEWVFS-DPLIRQQDQCLSITSFSAGS 496
Query: 269 DVILYPCHGSKGNQ 282
V L C+ G Q
Sbjct: 497 QVTLERCNQKDGRQ 510
>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Ovis
aries]
Length = 557
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + T + +G+ C N Q W+ + H I++ CL G
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 499 LVVLQACNPREGRQ 512
>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Cricetulus griseus]
gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
griseus]
Length = 608
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 112/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGADGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 449
>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
Length = 927
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/215 (36%), Positives = 111/215 (51%), Gaps = 15/215 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N VV P+I I D TF + + F GGF+WNLQ
Sbjct: 255 CECTKGWLEPLLARIKENRKAVVCPVIDVINDRTFAYQ-------KGIELFRGGFNWNLQ 307
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P + + R + P+ +PTMAGGLFSIDK +FE+LG YD G +IWGGEN+E+S
Sbjct: 308 FRWYAVPPDIVKGRANDPTMPIQSPTMAGGLFSIDKRYFEELGAYDPGMEIWGGENIEIS 367
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
F+ +P A P P + G + + + ++ + + +
Sbjct: 368 FRIWQCGGRIEILPCSHVGHIFRKASPHDFPGKSSGKILNSNLLRVAEVWMDEWKYLFYK 427
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
L + DV+ R ELR+ L CK F WYL+
Sbjct: 428 TAPQALQMRSSI-DVSERIELRKRLQCKDFNWYLQ 461
>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 623
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 148/315 (46%), Gaps = 61/315 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WLQPLL + + +VV P+I I DTF+ SS GGF+W L
Sbjct: 255 IEVNVNWLQPLLSRIVDSKKNVVVPIIDIINADTFKY--------SSSPLVRGGFNWGLH 306
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ K +++ +P+ +PTMAGGLF+I++A+F++LG YD+G +IWGGENLE+SF
Sbjct: 307 FKWENLPKSTLKSNEDFVKPILSPTMAGGLFAINRAYFKELGEYDNGMNIWGGENLEISF 366
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ N IP RKR + P TM + +
Sbjct: 367 RIWMCGGNLELIPCSRVGHVFRKR-RPYGSPNGEDTMMRNSLRVA--------------N 411
Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACK 226
+W + E +K FGD++ R +LR+ L C SF+WYL+ N + + +
Sbjct: 412 VWMDDYKEFFYKQHPEGKTFPFGDISDRLKLRKKLHCHSFEWYLQ--NIYPELIL----- 464
Query: 227 PTDMHKPVGL-YPCHKQGGNQFWMMSKHG-----EIR--------RDEACLDYAGGDVIL 272
P+D + + + +Q Q W + K ++R E + + G +IL
Sbjct: 465 PSDNEQKSKIKWNALEQQKFQPWHLRKRNYTAQFQLRLFNTSLCVTSERDVKHKGSPLIL 524
Query: 273 YPCHGSKGNQYFEYD 287
PC K +++ D
Sbjct: 525 SPCLRRKTQVWYQTD 539
>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
Length = 667
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 323 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 375
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 376 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 435
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 436 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 488
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 489 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 548
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 549 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 607
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 608 SPVILQTCNPKEGKQ 622
>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
Length = 558
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDLTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQTCNPKEGKQ 513
>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Metaseiulus occidentalis]
Length = 610
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 96/320 (30%), Positives = 144/320 (45%), Gaps = 44/320 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + + VV P+I I DDTF S++ G +W +
Sbjct: 253 CECTTGWLEPLLQRIKEDRTRVVCPIIDIIHDDTFAY-------VKSFELHWGAINWEMH 305
Query: 64 FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + P ++RH + +EP TP MAGGLFSIDK +F ++G YD DIWGGEN+E+S
Sbjct: 306 FRWYPVGPHVLKQRHGDPSEPFKTPVMAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMS 365
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVW--TPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
F+ + +P + P P GG+ + A ++ D + +
Sbjct: 366 FRIWQCGGSLEIVPCSHVGHVFRRSSPYTFPHPKGVGGILFSNLARVAEVWM-DDWAEFY 424
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSG 218
N E DV RK LR L CK F WYL ++ N +
Sbjct: 425 FNMNTEAKKLRSTMDVAKRKALRDRLHCKPFSWYLTNVWPENFFPSENRFFGKIRNRAAE 484
Query: 219 MCIDSACKPTDMHKPVG---LYPCH-KQGGNQFWMMSKHGEIRRDEA-CLD----YAGGD 269
C + H+P+G L C Q+++M+ G + DE+ CLD Y +
Sbjct: 485 KCFGRPVSKS-YHQPIGKVKLEDCAVTHYARQYFVMTGEGYLMTDESVCLDSPEGYEDTN 543
Query: 270 VILYPCHGSKGNQYFEYDYK 289
V++ C G + Q + +D K
Sbjct: 544 VVMIACQGIQ-RQKWRFDVK 562
>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
Length = 596
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 145/319 (45%), Gaps = 47/319 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I TFE R + + + G F+W +
Sbjct: 243 CEVNTNWLPPLLAPIYRDRTVMTVPVIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE+KR K+ +EP +PT AGGLF+I++ FF ++G YD G +WGGEN ELSF
Sbjct: 299 YKENEVPRREQKRRKHDSEPYKSPTHAGGLFAINREFFLEIGAYDPGLLVWGGENFELSF 358
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L + K + Y + W E
Sbjct: 359 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLANKKKGPLITI-NYKRVIETWFDE 413
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ F D GD++ + L+ L CKSF+WY+
Sbjct: 414 QYKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFW 473
Query: 211 -EVSNDWSGMCIDSACK-PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
E+ N + C+D+ + P M +GL CH QG NQ ++ G++ E C++
Sbjct: 474 GELKNSGTEKCVDALGRQPPAM---IGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNM 530
Query: 269 DVILYPCHGSKGNQYFEYD 287
+ L C + ++YD
Sbjct: 531 GIKLAFCRMGTVDGPWQYD 549
>gi|380016857|ref|XP_003692388.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like,
partial [Apis florea]
Length = 556
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 134/295 (45%), Gaps = 20/295 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV K+W++PLL + + + P+I I DTF+ P GGF+W L
Sbjct: 184 IEVNKQWIEPLLSRIVYSKTITAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLH 235
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P ++ +P+ +PTMAGGLF++++ +F KLG YD+G DIWGGENLE+SF
Sbjct: 236 FKWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISF 295
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E G D L D + L+
Sbjct: 296 RI-WMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDTMLKNSLRVAHVWLDEYKDYFLQNI 354
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
K D+GD+T R LR+ L CK+F WYL+V + D+ + D KP+ P
Sbjct: 355 KKIDYGDITERINLRKRLACKNFAWYLKVVYPELTLPDDNKNRLKDKWAKIEQKPIQ--P 412
Query: 239 CHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
H + N Q+ + +S + E + G +IL PC K ++E D +
Sbjct: 413 WHSKKRNYTDQYQIRLSNSTLCIQSEKDIKTKGSKLILAPCLRIKSQMWYETDKR 467
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/295 (31%), Positives = 138/295 (46%), Gaps = 55/295 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PL+ + + V P+I I D +F + + GGF W LQ
Sbjct: 174 CECTEGWLEPLVSRIGDDRKTRVQPIIDIIDDRSFAY-------IGASESNSGGFTWQLQ 226
Query: 64 FNWHAIPERERKRHKNAAEPVW-------TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
W IPE E+ R + + + TPTMAGGLFSI+K +FEK+G YD+G D+WGG
Sbjct: 227 HQWVRIPEYEQNRRVSEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGG 286
Query: 117 ENLELSFKF-----NWHAIPERE----RKRHKNAAEPVWT-PTMAGGLFSIDKAFFEKLG 166
EN+E+SF+ IP +R+ + P + PT+ + + + +
Sbjct: 287 ENIEMSFRIWMCGGKIEIIPCSRIGHVYRRYIPYSFPNGSDPTIYRNAMRVAEVWMDHYK 346
Query: 167 TYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------- 210
+ + +L D+GDV+ R ELRR LGC +F WYL
Sbjct: 347 KF------FYATQTKLHMV-DYGDVSDRLELRRKLGCHNFTWYLKNIIPEMILPVDDANY 399
Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
E+ ND +G+C+DSA KP+ + C F + S H ++R + CL
Sbjct: 400 FGEIRNDATGLCLDSASG-----KPLRVDICAATSDQIFTLTSDH-QLRIGKECL 448
>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 511
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 48/319 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL ++ + VV P+I I DDTF S++ G F+WNL
Sbjct: 163 CECTKGWLEPLLVRVSEDRKKVVCPVIDIINDDTFAY-------VRSFELHWGAFNWNLH 215
Query: 64 FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + E K+ KN EP TP MAGGLF+I + +F ++G YD IWGGENLE+S
Sbjct: 216 FRWYTLGTTEIKKRKNDVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGENLEMS 275
Query: 123 FKFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDI 174
F+ W +P + P T GG+ I A ++ D +
Sbjct: 276 FR-GWQCGGSVEIVPCSHVGHLFRKSSPY---TFPGGVGEILHANLARVALVWMDEWQEF 331
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWS 217
+ N E + + D V +R +LR L CKSF+WYL+ + +
Sbjct: 332 FFKFNPEAARQRDKQSVRARIQLRSRLKCKSFEWYLDNVWPQHFFPKNDRFFGLIKSASD 391
Query: 218 GMCIDSACKPTDMHKPVG---LYPCHKQGGNQFWMMSKHGEIRRDEA-CLDY-----AGG 268
C+ P ++P G L PC K+ F++ +K ++ DE+ CLD
Sbjct: 392 NKCLTRPHGPPSTNQPTGVVTLTPC-KETLEHFFVYTKFSDVMTDESVCLDLLDKNEMKA 450
Query: 269 DVILYPCHGSKGNQYFEYD 287
V + C GS ++ YD
Sbjct: 451 KVKVMACSGSPRQKWM-YD 468
>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
Length = 655
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 311 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 363
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 364 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 423
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 424 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 476
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 477 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 536
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 537 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 595
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 596 SPVILQTCNPKEGKQ 610
>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Cavia
porcellus]
Length = 556
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 138/313 (44%), Gaps = 49/313 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNVEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDKA+F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGIIKQGL 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA-----GGD 269
C+++ + T +G+ C N Q W+ + H I++ CL G
Sbjct: 440 NCLETQGQDTAGDFLLGMGICRGSAKNPPPAQAWLFTDH-LIQQQGRCLAATSVSPPGSP 498
Query: 270 VILYPCHGSKGNQ 282
VIL C+ + Q
Sbjct: 499 VILQVCNSKESKQ 511
>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Canis lupus
familiaris]
Length = 557
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + + + +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSITPGS 498
Query: 269 DVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 LVILQVCNPREGRQ 512
>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
Length = 558
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
Length = 558
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
Length = 617
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 140/299 (46%), Gaps = 55/299 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K H + P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPSSMAKEHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFEKLGTYDSG 171
F+ +P + P P + G L + + + ++ Y
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKILNANLLRVAEVWMDEWKYY--- 423
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
+ + + DV+ R ELR+ L CKSFKWYL+ +SN
Sbjct: 424 --FYKLAPVAYRMRQSI-DVSERVELRKKLNCKSFKWYLQNVFKDHFLPTPLDKFGRISN 480
Query: 215 DWSGMCIDSACKPTDM----HKPVGLYPCHKQGGN--QFWMMSKHGEIRRDE-ACLDYA 266
S C +A +P D H+ +G PC G + Q W+ + IR DE CL
Sbjct: 481 --SNYC--AAFRPGDTGPKNHRLLGA-PC-TMGFDLWQLWLYTGDSRIRTDEHLCLSVV 533
>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Oryctolagus cuniculus]
Length = 608
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 148/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E+ + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSEQGGAEGATAPIKSPTMAGGLFAMNRLYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCQSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 VPGPNAKAQQPVFFNRGPKRPKVLRRGRLYHFQTNKCLVAQGRPSQKGGLVVLKACDYGD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+Q W + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 PDQVWFYNEEHELVLHNLLCLDVSETRSSDPPRLMKCHGSGGSQQWAF 564
>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Nomascus leucogenys]
Length = 558
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Rattus norvegicus]
Length = 609
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 113/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 249 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 300
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + +A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 360
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 361 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 397
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 398 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 450
>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Rattus norvegicus]
Length = 558
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 140/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 499 SPVVLQSCNPKEGKQ 513
>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
Length = 600
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 135/328 (41%), Gaps = 64/328 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + N VV P+I I DT + S GGF+W L
Sbjct: 240 CEVNEMWLQPLLAPIRENPKTVVCPVIDIISADTL--------IYSQSPVVRGGFNWGLH 291
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + P +PTMAGGLF++D+ +F LG YDSG DIWGGENLE+SF
Sbjct: 292 FKWDPVPLSELGGPEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSGMDIWGGENLEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ + +P P +P TMA + + D D
Sbjct: 352 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWM------DEYKDQ 405
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------------- 211
+ EL + DFGD+ R LR+ L CKSFKWYL+
Sbjct: 406 YFALRPELRNR-DFGDIRDRLTLRKRLNCKSFKWYLDNIYPEMQVSGPNAKPQPPVFINK 464
Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIRR 258
+ N + C+ + P+ V + C Q W + +H I
Sbjct: 465 GQKRPKILQRGRLINMQTNKCLVAQGHPSQKGGLVVVKDCDFNDSEQVWSYNEEHELILS 524
Query: 259 DEACLDY----AGGDVILYPCHGSKGNQ 282
+ CLD + L CHGS G+Q
Sbjct: 525 NLLCLDMSETRSSDPPRLMKCHGSGGSQ 552
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 146/323 (45%), Gaps = 55/323 (17%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PLL+ +A + V P I I DTF+ R S + G FDW +F
Sbjct: 221 EVNTNWLPPLLEPIAEDYRTCVCPFIDVIAHDTFQYR-------SQDEGKRGAFDW--KF 271
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P + +P +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 272 YYKRLPLLPGDL-DDPTKPFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 330
Query: 125 FNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
W P P P G+ + + F + + + E
Sbjct: 331 I-WQCGGRLVDAPCSRVGHVYRGYAPFGNPR---GVNFVVRNFKRVAEVWMDEYSQFLYE 386
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV-----------------------SND 215
K D GD+++++ELR L CK FKW+LEV S
Sbjct: 387 RNPQFAKTDPGDLSAQRELRERLQCKPFKWFLEVVAPDLLVRYPPRDPQPFASGRVQSVA 446
Query: 216 WSGMCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEI--RRDEACLDYA-- 266
+C+DS +P+GLY C H Q NQF+ +S H +I R ++ CLD A
Sbjct: 447 NPRLCLDSLNH--QAKEPIGLYACAFNKTHPQ-NNQFFTLSYHRDIRVRSNDKCLDAAKL 503
Query: 267 GGDVILYPCHGSKGNQYFEYDYK 289
+++L+ CH S+GNQ + YDY+
Sbjct: 504 NDEIVLFSCHESQGNQMWRYDYE 526
>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Rattus norvegicus]
Length = 608
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 113/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + +A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 449
>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
Length = 499
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 142/306 (46%), Gaps = 45/306 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + P+I I +TFE R P + ++Y+ G F+W +
Sbjct: 146 CEVNVNWLPPLLAPIYRDYRTMTVPVIDGIDYNTFEYR-PVYQHGTNYR---GIFEWGML 201
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P+RE HK+ +EP +PT AGGLF+I++ +F ++G YD G +WGGEN ELSF
Sbjct: 202 YKENEVPDREAHLHKHKSEPYKSPTHAGGLFAINRRYFLEIGAYDPGLLVWGGENFELSF 261
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H A + P G L K + Y + W E
Sbjct: 262 KI-WQCGGSIEWVPCSRVGHVYRA---FMPYTFGNLAKNRKGSLITI-NYKRVIETWFDE 316
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------------------- 211
+ F D GD++ + L+ L CKSF W++E
Sbjct: 317 EHKEYFYTREPMARFLDMGDISEQVALKERLKCKSFGWFMENVAYDVYDKFPKLPKNVHW 376
Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
V N +G C+D+ K + +G CH G +Q + +++ G++ E C++ G +
Sbjct: 377 GMVKNKATGACLDTMGKAAPAY--IGTSSCHGMGNSQLFRLNEAGQLGVGERCVETDGDN 434
Query: 270 VILYPC 275
V C
Sbjct: 435 VKQAIC 440
>gi|327263882|ref|XP_003216746.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Anolis carolinensis]
Length = 536
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 113/216 (52%), Gaps = 11/216 (5%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + VVSP I I +TFE P + + G FDW+L
Sbjct: 271 CECFHGWLEPLLSRIAEEPTAVVSPDITTIDLNTFEFSKP---IQYGKQHSRGNFDWSLT 327
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W AIP+ E++R K+ P+ TPT AGGLF+I KA+FE +G+YD +IWGGEN+E+SF
Sbjct: 328 FGWEAIPQHEKERRKDETYPIKTPTFAGGLFAISKAYFEHVGSYDDQMEIWGGENVEMSF 387
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
+ IP + P P + S ++ + + Y F
Sbjct: 388 RVWQCGGQLEIIPCSVVGHVFRSKSPHTFPK-GTQVISRNQVRLAEVWMDDYKEIFYRRN 446
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
+ +++ + +GD++ R +L+ L CK+F WYL+
Sbjct: 447 QQASQMAREKTYGDLSDRLDLKERLHCKNFTWYLQT 482
>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Callithrix jacchus]
Length = 558
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 136/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------VSNDWSG 218
+ E + FG V SR E R+ + CKSF+WYLE V
Sbjct: 380 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPVIIKQGM 439
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
V L C+ +G Q
Sbjct: 499 SPVTLQMCNPREGKQ 513
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 101/331 (30%), Positives = 141/331 (42%), Gaps = 69/331 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F G T + G FDW +
Sbjct: 189 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 242
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 243 YKRIPIPPELQK--PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 300
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P PT ++ + ++W E
Sbjct: 301 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRV-----------AEVWMDE 349
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
E ++ GDV ++KELR NL CKSFKW++
Sbjct: 350 YAEFIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAW 409
Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA-- 261
E+ N +G+C+D+ K + P+ L C K G W S +IR +
Sbjct: 410 GEIRNVGTGLCVDT--KHGALGSPLRLENCVKDRGEAAWNNVQVFTFSWREDIRPGDPQH 467
Query: 262 ----CLDYA--GGDVILYPCHGSKGNQYFEY 286
C D V LY CHG KGNQ + Y
Sbjct: 468 TKKFCFDAISHSSPVTLYDCHGMKGNQLWRY 498
>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
[Homo sapiens]
gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
[synthetic construct]
Length = 542
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 136/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_c [Rattus norvegicus]
Length = 560
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 109/218 (50%), Gaps = 26/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 200 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 251
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + +A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 252 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 311
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RKR + P TM + + D +
Sbjct: 312 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKE 364
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ +L K FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 365 QYFSLRPDLKTKS-FGNISERVELRKKLGCQSFKWYLD 401
>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Papio
anubis]
Length = 558
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Macaca mulatta]
Length = 558
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513
>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
musculus]
gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
musculus]
gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
musculus]
Length = 608
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF + + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 449
>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
musculus]
Length = 529
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 169 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 220
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 221 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 280
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF + + F K Y S G D
Sbjct: 281 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 317
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 318 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 370
>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
Length = 608
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL ++ + VV P+I I DT SS GGF+W L
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A P+ +PTMAGGLF++++ +F LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF + + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
+L L S + D FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 449
>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Pan
paniscus]
Length = 529
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 185 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 237
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 238 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 297
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 298 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 350
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 351 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 410
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 411 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 469
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 470 SPVILQMCNPREGKQ 484
>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
Length = 499
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 155 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 207
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 208 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 267
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 268 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 320
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 321 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 380
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 381 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 439
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 440 SPVILQMCNPREGKQ 454
>gi|328792011|ref|XP_624873.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Apis mellifera]
Length = 637
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 134/294 (45%), Gaps = 20/294 (6%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++W++PLL + + + P+I I DTF+ P GGF+W L F
Sbjct: 266 EVNRQWIEPLLSRIVYSKTITAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLHF 317
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +P ++ +P+ +PTMAGGLF++++ +F KLG YD+G DIWGGENLE+SF+
Sbjct: 318 KWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFR 377
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
W E G D L D + L+
Sbjct: 378 I-WMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDTMLKNSLRVAHVWLDEYKDYFLQNIK 436
Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYPC 239
K D+GD+T R LR+ L CK+F WYL+V + D+ + D KP+ P
Sbjct: 437 KIDYGDITERINLRKRLACKNFAWYLKVVYPELTLPDDNKNRLKDKWAKIEQKPIQ--PW 494
Query: 240 HKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
H + N Q+ + +S + E + G +IL PC K ++E D +
Sbjct: 495 HSKKRNYTDQYQIRLSNSTLCIQSEKDIKTKGSKLILAPCLRIKSQMWYETDKR 548
>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
Length = 575
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 247 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 300 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 359
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 360 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 413 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 472
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 473 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 531
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 532 SPVILQMCNPREGKQ 546
>gi|241133788|ref|XP_002404588.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215493637|gb|EEC03278.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 459
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 142/313 (45%), Gaps = 59/313 (18%)
Query: 18 LARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRH 77
+ R ++ VV P+I I D+TF S++ G F+W L F W + ERE KR
Sbjct: 117 ITRQATVVVCPVIDIINDETFAY-------VRSFEMHWGAFNWELHFRWFPVGEREHKRR 169
Query: 78 K-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-----NWHAIP 131
NA P TP MAGGLFSID+ +F ++G YD DIWGGEN+E+SF+ + +P
Sbjct: 170 SGNATAPFRTPVMAGGLFSIDRGYFYEMGAYDDQMDIWGGENMEISFRIWQCGGSVEVVP 229
Query: 132 ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFG-- 189
P P G + F L + +W E F + G
Sbjct: 230 CSHVGHLFRRTSPYTFPNPGG----VGSVLFSNLARVAA---VWMDEWAAFYFNMNRGEK 282
Query: 190 -----DVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKP 227
DVT+RK+LR L CKSFKWYL +V N SG C +P
Sbjct: 283 RHMLQDVTARKKLREKLQCKSFKWYLKNIWPENFLPNDNIFFGKVRNKKSGKCF---VRP 339
Query: 228 T--DMHKPVGLYPCHKQG----GNQFWMMSKHGEIRRDEA-CLDY----AGGDVILYPCH 276
+ + H+PVG + Q ++ ++ G I+ DE+ CLD A +V++ C+
Sbjct: 340 SSKNYHQPVGRVVLEECALTYYAMQHFVFTEEGFIKTDESICLDSPESKADTNVVMIACN 399
Query: 277 GSKGNQYFEYDYK 289
+ Q + YD K
Sbjct: 400 DLQ-RQKWRYDPK 411
>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Tupaia chinensis]
Length = 566
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 222 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 275 FKWEQIPLDQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 387
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 388 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIMKQGV 447
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + +G+ C N Q W+ S H I++ CL G
Sbjct: 448 NCLESQGQSPAGDFLLGMGICRGSAKNPPSAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 506
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 507 SPVILQVCNPREGKQ 521
>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Myotis davidii]
Length = 518
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 99/316 (31%), Positives = 139/316 (43%), Gaps = 52/316 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 171 CEVNTEWLQPLLQRVQEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 223
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 224 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 283
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 284 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 336
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V SR E R+ + CKSF+WYLE V +
Sbjct: 337 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPSIIKQGV 396
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA-------- 266
C++S + T + +G+ C N Q W+ S H I++ CL
Sbjct: 397 NCLESQGQDTAGNFLLGVGTCRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSASISP 455
Query: 267 GGDVILYPCHGSKGNQ 282
G V L C+ +G Q
Sbjct: 456 GSPVGLQTCNPREGKQ 471
>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
scrofa]
Length = 608
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 147/348 (42%), Gaps = 96/348 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL + + VV P+I I DT P GGF+W L
Sbjct: 248 CEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTLAYSASP--------VVRGGFNWGLH 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FRWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
+ +W M GG LF I + F K Y S G D
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396
Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
+L L S + D +G+++ R ELR+ L CKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 456
Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
+ + + C+ + +P+ V L C
Sbjct: 457 VSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLAAQGRPSQKGGLVVLKACDYGD 516
Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+Q W+ + +H I + CLD + L CHGS G+Q + +
Sbjct: 517 PDQIWIYNEEHELILNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564
>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1,
partial [Felis catus]
Length = 553
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 210 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 262
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 263 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 322
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 323 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFPE--GNALTYIRNTKRTAEVWMDEYK 375
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 376 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 435
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + + + +G+ C N Q W+ S H I++ CL G
Sbjct: 436 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSITPGS 494
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 495 LVVLQVCNPREGRQ 508
>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
Length = 673
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 106/334 (31%), Positives = 147/334 (44%), Gaps = 71/334 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A+N V P I I F R + + G FDW+
Sbjct: 302 VEANYNWLPPLLEPIAQNKRTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDWDFF 354
Query: 64 FN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
+ +PE K+ AEP +P MAGGLF+I FF +LG YD G DIWGGE ELS
Sbjct: 355 YKRLPLLPED----LKHPAEPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELS 410
Query: 123 FKF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
FK + A R ++ V P L Y ++W
Sbjct: 411 FKIWMCGGQMYDAPCSRIGHIYRGPRNHVSNPRGGDYLHK----------NYKRVAEVWM 460
Query: 177 GENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------VSN-------D 215
E + + G D GD+T++K +R L CKSFKW++E + N D
Sbjct: 461 DEYKQYLYNGADGVYERIDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLIKNYPPIDPPD 520
Query: 216 WSG----------MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIR--RD 259
++ +C+D+ KP H VG+Y C K Q+W +S ++R R
Sbjct: 521 YASGAIQNVGDPTLCVDTLSKPR--HNRVGIYSCARNLVKPQRTQYWSLSWKRDLRLHRK 578
Query: 260 EACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
+ CLD A V L+ CHG +GNQY+ YDY+
Sbjct: 579 KDCLDVQIWDANAPVWLWDCHGQQGNQYWYYDYR 612
>gi|312370888|gb|EFR19193.1| hypothetical protein AND_22920 [Anopheles darlingi]
Length = 812
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 147/309 (47%), Gaps = 41/309 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL+ LLDV+A N + + P I + D + +++ + FIG +DW+L
Sbjct: 459 VEVTIGWLEALLDVVAHNWTTIAIPTIDWV--DEYNMKYKDDKA----PIFIGAYDWDLN 512
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +K++ N P TP MAGGLF+I++ FFE+LG YD GFDI+G EN+ELS
Sbjct: 513 FGWWG-RWSMKKKYDNKMVPFDTPAMAGGLFTINRTFFERLGWYDEGFDIYGIENIELSM 571
Query: 124 KFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG--FDIW 175
K +W +P + A P T + + E G +DI+
Sbjct: 572 K-SWMCGGKMVTVPCSRVGHIQKAGHPYLTRETKDVVRANSIRLAEVWMDEYKGIIYDIY 630
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWS 217
G + + +FG V RK +R+ GC+ F++YL EV N
Sbjct: 631 GIPHYS---EEEFGSVEHRKAIRQKAGCQPFRYYLENAFPEMHNPMVPGAFRGEVHNGAL 687
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
G + TD +G+ PC +QFW + + E+ + C+D G ++ +Y CH
Sbjct: 688 GNGTCLTYRGTD--NFLGMAPCDHLEKSQFWTHNYYQELNSYQNCID--GPNLAVYRCHK 743
Query: 278 SKGNQYFEY 286
S+GNQ ++Y
Sbjct: 744 SRGNQAWKY 752
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 75/255 (29%), Positives = 124/255 (48%), Gaps = 33/255 (12%)
Query: 54 FIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 113
+IG +DW+L F W +K++ N P TP MAGGLF+I++ FFE+LG YD GFDI
Sbjct: 11 YIGAYDWDLNFGWWG-RWSMKKKYDNKMVPFDTPAMAGGLFTINRTFFERLGWYDEGFDI 69
Query: 114 WGGENLELSFKFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT 167
+G EN+ELS K +W +P + P + + + ++
Sbjct: 70 YGIENIELSMK-SWMCGGKMVTVPCSRVAHIQKVGHP-YLRNEKKDVVRANSIRLAEVWM 127
Query: 168 YDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------SNDWSG--- 218
+ I+ + + +FG V +RK +R GC+ F++Y+E S D +G
Sbjct: 128 DEYKHVIFDIHGIPHYLEEEFGSVENRKAIRERAGCRDFRYYIENAFPEMHSPDVAGAFR 187
Query: 219 -----------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
MC++ + TD +G+ PC + +QFW + + E+ C+D+ G
Sbjct: 188 GEVHSVVLGVTMCLEY--RHTDSF--LGMGPCDGKQRSQFWTHNYYEELNSYRYCIDFTG 243
Query: 268 GDVILYPCHGSKGNQ 282
++ ++ CH S+GNQ
Sbjct: 244 SNLGVFGCHRSRGNQ 258
>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Equus caballus]
Length = 619
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 139/315 (44%), Gaps = 52/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F ++ GGFDW+L
Sbjct: 276 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNFAY-------LAASAILRGGFDWSLH 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 329 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 388
Query: 124 KF-----NWHAIPERE-----RKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ + +P RKRH N E G + + + +
Sbjct: 389 RVWMCGGSLEIVPCSRVGHVFRKRHPYNFPE--------GNALTYIRNTKRTAEVWMDEY 440
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM---- 219
+ E + FG V +R E R+ + CKSF+WYL+ V G+
Sbjct: 441 KQYYYEARPSAIGKAFGSVATRIEQRKKMSCKSFRWYLDNVYPELTVPVKEVLPGIIKQG 500
Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------G 267
C++S + T + +G+ C N Q W+ S H I++ CL G
Sbjct: 501 VNCLESQGQDTAGNFLLGMGICRGSVKNPPAPQAWLFSDH-LIQQQGKCLTATSTSVSPG 559
Query: 268 GDVILYPCHGSKGNQ 282
V L C+ +G Q
Sbjct: 560 SLVTLQVCNPREGRQ 574
>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Gorilla gorilla gorilla]
Length = 598
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL P+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 254 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 306
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 307 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 366
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 367 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 419
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 420 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 479
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C N Q W+ S H I++ CL G
Sbjct: 480 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 538
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 539 SPVILQMCNPREGKQ 553
>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
Length = 617
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/218 (37%), Positives = 114/218 (52%), Gaps = 21/218 (9%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K+H + P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPSSMAKQHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + G + A ++ + D W
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSG--KVLNANLLRVA--EVWMDEWKY 422
Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
+++ + DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAYRMRPSIDVSERVELRKTLNCKSFKWYLQ 460
>gi|195028169|ref|XP_001986949.1| GH20244 [Drosophila grimshawi]
gi|193902949|gb|EDW01816.1| GH20244 [Drosophila grimshawi]
Length = 599
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 139/308 (45%), Gaps = 44/308 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE + W +PLL + + + V+ P+I I D+ + ++ T+ YK F +GGF WN
Sbjct: 244 CEANEGWCEPLLQRIKDSRTSVLVPIIDVI--DSVDFQYS----TNGYKSFQVGGFQWNG 297
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE+ R P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 298 HFDWVNLPEREKLRQSRECNQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 357
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 358 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 415
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
+++ +L F D GDVT R LR+ L CKSF WYL+ V
Sbjct: 416 INVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKSFDWYLQNVYPEKFVPNKNVKAWGRVR 475
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ +CID + +GLYPC K +Q + + +R + +C
Sbjct: 476 SVHDNLCIDDLLNNNEKPYNLGLYPCGKTLQHSQLFSFTNSQVLRNELSCATVQHSSSPP 535
Query: 270 --VILYPC 275
+++ PC
Sbjct: 536 YRIVMVPC 543
>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
gigas]
Length = 613
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 130/266 (48%), Gaps = 34/266 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL ++ + + VV P+I I DT E + P GGF+W L
Sbjct: 242 CEVNTDWLEPLLLRISHDPTTVVVPVIDIINHDTMEYQQSP--------LVRGGFNWGLH 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +P+ E+ ++P+ +PTMAGGLF++ + +F LG YD G DIWGGENLE+SF
Sbjct: 294 FSWDRLPDNEKNDPDLGSKPILSPTMAGGLFAMKRDYFHHLGEYDLGMDIWGGENLEISF 353
Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
+ IP R+R+ + N P T + + +K Y
Sbjct: 354 RIWMCGGKLEIIPCSRVGHIFRKRRPYGN---PKGRDTFLKNSLRVANVWMDKYKEY--- 407
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMH 231
+ + D+GD++ R LR++L CKSFKWYL+ + + + + KP++
Sbjct: 408 ----FLKQRPQAQVVDYGDISDRISLRKHLSCKSFKWYLD--HVYPELSLPGDVKPSN-- 459
Query: 232 KPVGLYPCHKQGGNQFWMMSKHGEIR 257
K P + ++ +HG I+
Sbjct: 460 KSSHHQPMKSNDKKKKPVIVRHGRIK 485
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 149/322 (46%), Gaps = 55/322 (17%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PLL+ +A + V P I I DTF+ R + + G FDW +F
Sbjct: 252 EVNNNWLPPLLEPIAEDYRTCVCPFIDVIAHDTFQYR-------AQDEGKRGAFDW--KF 302
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P + +P +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 303 YYKRLPLLPGDL-DDPTKPFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 361
Query: 125 FNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
W P P P G+ + + F + + + E
Sbjct: 362 I-WQCGGRLVDAPCSRVGHVYRGYAPFGNPR---GVNFVVRNFKRVAEVWMDEYAKFLYE 417
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW-------------SG------ 218
L K D GD+T+++ELR L C+ FKW+L E++ D SG
Sbjct: 418 RNPLFEKTDPGDLTAQRELRERLQCRPFKWFLEEIAPDLLIRYPVREPQPFASGRVQSVA 477
Query: 219 ---MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEI--RRDEACLDYA-- 266
+C+DS +P+GLY C H Q NQF+ +S H +I R ++ CLD +
Sbjct: 478 DRRLCLDSLNH--QAKQPIGLYTCASNQTHPQ-NNQFFTLSFHRDIRVRSNDKCLDASRL 534
Query: 267 GGDVILYPCHGSKGNQYFEYDY 288
+VIL+ CH S+GNQ + YDY
Sbjct: 535 NDEVILFSCHESQGNQMWRYDY 556
>gi|194882801|ref|XP_001975498.1| GG20529 [Drosophila erecta]
gi|190658685|gb|EDV55898.1| GG20529 [Drosophila erecta]
Length = 601
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQQREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHVFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
S +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 ALNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+++ Y++++
Sbjct: 539 YRVVMVPCMENDEFNEHWRYEHQH 562
>gi|383848548|ref|XP_003699911.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Megachile rotundata]
Length = 604
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 147/323 (45%), Gaps = 52/323 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A+N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 251 CECTVGWLEPLLEAVAKNKTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R +N EP TP MAGGLFS+++ +F +LG+YD IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDDQMKIWGGENLELS 363
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
F+ + P + P T GG+ I ++ D + +
Sbjct: 364 FRVWQCGGSVEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
N E S V +R LR+ L CKSF+WYL+ N W
Sbjct: 421 FKFNAEASRLRHKQPVRARLALRKRLQCKSFEWYLD--NVWPEHFFPKNDRFFGRIVHVS 478
Query: 217 SGMCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
+ CI +P G L C + NQ ++M+K G + DE+ CLD D
Sbjct: 479 TKKCIMRPTAKGTYSQPSGYALLESCIPRPVLNQMFVMTKSGIVMTDESICLDAPDRDTQ 538
Query: 270 -----VILYPCHGSKGNQYFEYD 287
V + C S+ Q ++YD
Sbjct: 539 HKTPRVKIMAC-SSQSRQNWQYD 560
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 134/297 (45%), Gaps = 39/297 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL ++++ + VVSP+I I DTF+ L GGFDW+L
Sbjct: 178 CECNIGWLEPLLHRVSQDRTIVVSPIIDVISMDTFDYIGASSELR-------GGFDWSLH 230
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +R + K+ EP+ TP +AGGLFSI++ F + G YD DIWGGEN E+SF
Sbjct: 231 FKWDGFTPAQRAKRKSPIEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGENFEISF 290
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RKRH P P G + K + +
Sbjct: 291 RTWMCGGSLEIIPCSRVGHVFRKRH-----PYVFP--GGNAMTYMKNTKRAAEVWMDNYK 343
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGMCID 222
+ + D G + SR LR+ L C +F WY++ +N + +
Sbjct: 344 DYYYSARPSAKGRDMGSIKSRVALRKRLNCTTFDWYMKNVYPELSVPSSTNNKHGKLKQN 403
Query: 223 SACKPTDMHK---PVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+ C T H+ PVGL C + + G Q W ++ G IR CL+ G V L C
Sbjct: 404 NLCLDTLGHQAGEPVGLQDCQQSRQGYQDWSIAMKGLIRHLNLCLEARGQIVHLQYC 460
>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Otolemur garnettii]
Length = 678
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/222 (34%), Positives = 110/222 (49%), Gaps = 32/222 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-------DIWG 176
+ W + + + G +F + G ++ ++W
Sbjct: 315 RV-WMC----------GGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWM 363
Query: 177 GENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
E + + + FG++ SR +LR+NL C+SFKWYLE
Sbjct: 364 DEYKQYYYAARPFALERPFGNIESRLDLRKNLRCQSFKWYLE 405
>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Cricetulus griseus]
Length = 797
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 139/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 198 CEVNIEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 250
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 251 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 310
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 311 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFPE--GNALTYIRNTKRTAEVWMDEYK 363
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE G+
Sbjct: 364 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIKQGV 423
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 424 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 482
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 483 SPVILQVCNPKEGKQ 497
>gi|321469963|gb|EFX80941.1| hypothetical protein DAPPUDRAFT_224457 [Daphnia pulex]
Length = 498
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 112/222 (50%), Gaps = 34/222 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + W+QPL+ + N + VV+P+I I DTF+ P GGF+W L
Sbjct: 122 CEVNREWVQPLIARIQENRTFVVTPIIDIINSDTFQYTSSP--------LVRGGFNWGLH 173
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ K +++ +P+ +PTMAGGLF+I++ +F +G YD+G ++WGGENLE+SF
Sbjct: 174 FKWDSLPDDTLKTNEDFVKPILSPTMAGGLFAIEREYFFDIGEYDAGMNVWGGENLEISF 233
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
+ IP P +P E TY+S +W
Sbjct: 234 RIWMCGGRLEIIPCSRVGHVFRRRRPYGSPN------------GEDTMTYNSLRAAHVWL 281
Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ +E F +GDV R+ LRR + C+SF WYL+
Sbjct: 282 DDYIEHFFHVRPDARHVSYGDVGPRQRLRRLMKCQSFDWYLK 323
>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Cricetulus griseus]
Length = 513
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 139/315 (44%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 169 CEVNIEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 221
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 222 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 281
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 334
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE G+
Sbjct: 335 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIKQGV 394
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ S H I++ CL G
Sbjct: 395 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 453
Query: 268 GDVILYPCHGSKGNQ 282
VIL C+ +G Q
Sbjct: 454 SPVILQVCNPKEGKQ 468
>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Metaseiulus occidentalis]
Length = 622
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 35/304 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + W++PLL + S VV P+I + DTF FP +S + GGFDWNL
Sbjct: 259 CECNEGWIEPLLARIRDEPSKVVCPVIDVLSMDTFGY-FPA---SSDLR---GGFDWNLV 311
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I + + A +P+ TP MAGGLF+I K FE+LG+YD+ DIWG ENLE+SF
Sbjct: 312 FKWEFITSKP----ELATDPIKTPAMAGGLFAITKKEFERLGSYDTQMDIWGAENLEMSF 367
Query: 124 KFNWHA-----IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ W I R H + +T G + + + + E
Sbjct: 368 RV-WQCGSGIEILPCSRVGHVFRKQHPYTFPGGGSGKVFARNSRRAAEVWMDDYKKYYYE 426
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWY-------LEVSNDWSG------MCIDSAC 225
+ + +GD++ R +LR L CKSF+WY L++ ++ G C+D+
Sbjct: 427 QVPAAKSVAYGDISERLKLREKLRCKSFEWYMKNVYPELKLPSNVHGYVRQNNRCLDTLG 486
Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGN 281
+D V +YPCH GGNQ + ++K+ + + C+ AG ++ L C+G
Sbjct: 487 AISD-GSTVHVYPCHYLGGNQDFRLAKNHLLMVHDMCVSLGSLAGQQLVKLRTCNGENSQ 545
Query: 282 QYFE 285
++
Sbjct: 546 KWVR 549
>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ailuropoda melanoleuca]
Length = 555
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 142/314 (45%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 212 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 264
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 265 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 324
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 325 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 377
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + C+SF+WYL+ V G+
Sbjct: 378 QYYYEARPSAIGKAFGSVATRIEQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIKQGV 437
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + + + +G+ C N Q W+ S H I++ CL + G
Sbjct: 438 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTVSSTSVTPGS 496
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 497 LVLLQGCNPREGRQ 510
>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
Length = 562
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K+H + P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + G L + D W
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSG----KVLNTNLLRVAEVWMDDWKH 422
Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
+++ + DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460
>gi|350416150|ref|XP_003490858.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus impatiens]
Length = 604
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 106/323 (32%), Positives = 147/323 (45%), Gaps = 52/323 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A+N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 251 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R +N EP TP MAGGLFS+++ +F +LG+YD IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELS 363
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
F+ + P + P T GG+ I ++ D + +
Sbjct: 364 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
N E + D V R ELR+ L CK+F+WYL +N W
Sbjct: 421 FKFNTEAARLRDKQPVRGRLELRKRLQCKNFEWYL--NNIWPEHFFPKDDRFFGRILHIS 478
Query: 217 SGMCIDSACKPTDMHKPVG---LYPCHKQGG-NQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
S CI +P G L C + +Q ++M+K G I DE+ CLD D
Sbjct: 479 SNKCIMRPTAKGTYSQPSGYAVLETCLPRPILSQMFVMTKDGIIMTDESVCLDAPDHDTQ 538
Query: 270 -----VILYPCHGSKGNQYFEYD 287
V + C G+ Q + YD
Sbjct: 539 HKTPKVKIMACSGN-DRQKWRYD 560
>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
Length = 553
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 142/314 (45%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 210 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 262
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 263 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 322
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 323 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 375
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + C+SF+WYL+ V G+
Sbjct: 376 QYYYEARPSAIGKAFGSVATRIEQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIKQGV 435
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C++S + + + +G+ C N Q W+ S H I++ CL + G
Sbjct: 436 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTVSSTSVTPGS 494
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 495 LVLLQGCNPREGRQ 508
>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 136/314 (43%), Gaps = 62/314 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + N + P+I I DTFE R + + F G F+W +
Sbjct: 290 CEVGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYR----PVYHGRQHFRGIFEWGML 345
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP+ E KR K +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 346 YKEIEIPDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSF 405
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
K W W P G +S K ++ G Y
Sbjct: 406 KI-WQC-----------GGMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRV 453
Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
++W E E L+ D GD+ + LR L CKSF+W++
Sbjct: 454 VEVWMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLL 513
Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
E+ +D + C+D+ H P L CH GGNQ + ++ G++ E
Sbjct: 514 PRNLYWGEIRHDATDQCLDA----MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGER 569
Query: 262 CLDYAGGDVILYPC 275
C+D + + + C
Sbjct: 570 CMDASSHSMDVVYC 583
>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
partial [Oryctolagus cuniculus]
Length = 499
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/315 (30%), Positives = 138/315 (43%), Gaps = 51/315 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 155 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 207
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F IDKA+F LG YD+ DIWGGEN ELSF
Sbjct: 208 FKWEQIPLEQKITRTDPTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSF 267
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 268 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 320
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CKSF+WYLE V G+
Sbjct: 321 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 380
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
C++S + T +G+ C + Q W+ + H I++ CL G
Sbjct: 381 NCLESQGQNTAGDFLLGMGICRGSAKSPPPAQAWLFTDH-LIQQQGKCLAATSTLMSSPG 439
Query: 268 GDVILYPCHGSKGNQ 282
V L C+ +G Q
Sbjct: 440 SPVTLQVCNPREGKQ 454
>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 136/314 (43%), Gaps = 62/314 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + N + P+I I DTFE R + + F G F+W +
Sbjct: 290 CEVGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYR----PVYHGRQHFRGIFEWGML 345
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP+ E KR K +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 346 YKEIEIPDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSF 405
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
K W W P G +S K ++ G Y
Sbjct: 406 KI-WQC-----------GGMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRV 453
Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
++W E E L+ D GD+ + LR L CKSF+W++
Sbjct: 454 VEVWMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLL 513
Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
E+ +D + C+D+ H P L CH GGNQ + ++ G++ E
Sbjct: 514 PRNLYWGEIRHDATDQCLDA----MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGER 569
Query: 262 CLDYAGGDVILYPC 275
C+D + + + C
Sbjct: 570 CMDASSHSMDVVYC 583
>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Nasonia vitripennis]
Length = 610
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 145/326 (44%), Gaps = 54/326 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +++N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 254 CECTAGWLEPLLEAISKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 306
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R+R +N +P TP MAGGLFS+D+ +F +LG+YD IWGGENLELS
Sbjct: 307 FRWLMLNGALLRERRENIVDPFKTPAMAGGLFSMDREYFFELGSYDEHMRIWGGENLELS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTP-----TMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + P + P P + G L + + ++ G + F
Sbjct: 367 FRVWQCGGSVEIAPCSHVGHIFRKSSPYTFPGGVDEILYGNLARVALVWMDEWGKFYFNF 426
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
N + D + SR ELR L CKSF+WYL+ + +
Sbjct: 427 ------NPQAQRVRDKQQIRSRLELRERLKCKSFEWYLDNVWPDHFFPKDDRFFGYILHP 480
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIRRDEA-CLDYAGGD- 269
+ C+ +P G +Q ++M K G I DE+ CLD D
Sbjct: 481 SNKKCLMRPMSKGAYSQPSGFVAYQDCIVPPNLSQMFVMRKDGVIMTDESVCLDAPEKDN 540
Query: 270 ------VILYPCHGSKGNQYFEYDYK 289
V L C G +Q +EYD K
Sbjct: 541 RHEKPKVKLMACSGF-ASQKWEYDEK 565
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 107/329 (32%), Positives = 146/329 (44%), Gaps = 70/329 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A++ V P I I +TFE R + + G FDW +F
Sbjct: 251 EANVNWLPPLLEPIAQDYKTCVCPFIDVIQYETFEYR-------AQDEGARGAFDW--EF 301
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P ++ EP +P MAGGLF+I + FF +LG YD G DIWGGE ELSFK
Sbjct: 302 FYKRLPLLPEDL-EHPTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFK 360
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
W V P G A F G Y ++W
Sbjct: 361 I-WQC-----------GGLMVDAPCSRVGHIYRKYAPFPNPGKGDFVGRNYRRVAEVWMD 408
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------- 216
E E +K D GD+T +K LR L CK FKW++E V+ D
Sbjct: 409 EYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYPPIEPGDFG 468
Query: 217 ---------SGMCIDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRR--DEAC 262
+C+DS K D + +GL C K + G Q + ++ H ++R C
Sbjct: 469 VGEIRNLAAPELCVDSGHK--DRDQVIGLAECVKGTNKNGEQNFALTWHKDLRVKGKTLC 526
Query: 263 LDYAG----GDVILYPCHGSKGNQYFEYD 287
LD + D++LYPCHGS+GNQY+ YD
Sbjct: 527 LDVSDPNDKADIVLYPCHGSQGNQYWRYD 555
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 107/331 (32%), Positives = 147/331 (44%), Gaps = 70/331 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A++ V P I I +TFE R + + G FDW +F
Sbjct: 229 EANVNWLPPLLEPIAQDYKTCVCPFIDVIQYETFEYR-------AQDEGARGAFDW--EF 279
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P ++ EP +P MAGGLF+I + FF +LG YD G DIWGGE ELSFK
Sbjct: 280 FYKRLPLLPEDL-EHPTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFK 338
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
W V P G A F G Y ++W
Sbjct: 339 I-WQC-----------GGLMVDAPCSRVGHIYRKYAPFPNPGKGDFVGRNYRRVAEVWMD 386
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------- 216
E E +K D GD+T +K LR L CK FKW++E V+ D
Sbjct: 387 EYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYPPIEPGDFG 446
Query: 217 ---------SGMCIDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRR--DEAC 262
+C+DS K D + +GL C K + G Q + ++ H ++R C
Sbjct: 447 VGEIRNLAAPELCVDSGHK--DRDQVIGLAECVKGTNKNGEQNFALTWHKDLRVKGKTLC 504
Query: 263 LDYAG----GDVILYPCHGSKGNQYFEYDYK 289
LD + D++LYPCHGS+GNQY+ YD +
Sbjct: 505 LDVSDPNDKADIVLYPCHGSQGNQYWRYDVE 535
>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
Length = 550
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 64/320 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RK+H P P + + +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKW LE V D S
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + + L PC K G+ Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498
>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oreochromis niloticus]
Length = 619
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 141/332 (42%), Gaps = 73/332 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + ++ VV P+I I DT L + P + GGF+W L
Sbjct: 224 CEVNQAWLQPLLAPIQKDHRTVVCPVIDIISADT--LAYSPSPIVR------GGFNWGLH 275
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + A+ P+ +PTMAGGLF++++ +F +LG YD+G DIWGGENLE+SF
Sbjct: 276 FKWDPVPPSELSGPEGASGPIRSPTMAGGLFAMNRKYFNELGQYDAGMDIWGGENLEISF 335
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ IP P +P TMA + + +
Sbjct: 336 RIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWMD----------- 384
Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------------------ 211
G + LS + + +GD+ R LR+ L C SF+WYL+
Sbjct: 385 -GYKEQYLSLRPELRNRSYGDIGERVALRKRLQCHSFRWYLDTVYPEMQTAANGNKQQPL 443
Query: 212 ----------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
+ N C+ + + + V L PC + +Q W + G+
Sbjct: 444 FINKGLKRPKVLQRGRLRNLAIRRCLVAQGRASQKGGAVVLRPCDPRDPDQDWAYDEEGQ 503
Query: 256 -IRRDEACLDYAGGDVI----LYPCHGSKGNQ 282
I CLD + L CHGS G+Q
Sbjct: 504 LILAGLLCLDVSEVRTFDPPRLMKCHGSGGSQ 535
>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
Length = 550
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 64/320 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ IP RK+H P P + + +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360
Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
+W E + + + FG++ +R LR+NL C++FKW LE V D S
Sbjct: 361 VWMDEYKQYYYAARPFALERHFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSIQ 420
Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
C++S + + + L PC K G+ Q W + +I ++E CL
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478
Query: 265 --YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498
>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
floridanus]
Length = 667
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 143/304 (47%), Gaps = 42/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + W++PLL +A + + V P+I I DTF+ P GGF+W L
Sbjct: 295 IEVNEIWIEPLLSRIAYSKTIVPMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 346
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P K + +P+ +PTMAGGLF+ID+ +F K+G YD+G D+WGGENLE+SF
Sbjct: 347 FKWDNLPIGTLKHENDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISF 406
Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ + IP R+R + +P TM + + ++ Y
Sbjct: 407 RIWMCGGSIELIPCSRVGHVFRRRRPYGSDDP--HDTMLKNSLRVAHVWMDEYKDY---- 460
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
L+ + D+GD++ R LR+ L CK+F WYL+V + D+ + D
Sbjct: 461 ------FLKNAKAIDYGDISERLALRQKLECKTFDWYLKVVYPELTLPDDTEKRLKDKWS 514
Query: 231 ---HKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
+P + P H + N Q+ + +S + E + G +IL PC K +
Sbjct: 515 KLEQRP--MQPWHSRKRNYTDQYQIRLSNSVLCIQSEKDIKTKGSKLILMPCLRIKSQMW 572
Query: 284 FEYD 287
+E D
Sbjct: 573 YETD 576
>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Sarcophilus harrisii]
Length = 563
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 135/314 (42%), Gaps = 52/314 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
+ E + FG + R+E R+ + CKSF+WYLE N + +
Sbjct: 388 QYYYEARPSAIGKSFGSIADREEQRKKMNCKSFQWYLE--NVYPELKIPEKEVIPGIIKQ 445
Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGG 268
C++S + T + V + C N Q W+ S IR+ + CL G
Sbjct: 446 GTNCLESQGQDTAGNNLVVMGGCKGTSNNPLMTQEWVFS-DPVIRQQDKCLSITSFSTGS 504
Query: 269 DVILYPCHGSKGNQ 282
V L C+ Q
Sbjct: 505 QVTLEVCNQKDDRQ 518
>gi|195334637|ref|XP_002033984.1| GM21620 [Drosophila sechellia]
gi|194125954|gb|EDW47997.1| GM21620 [Drosophila sechellia]
Length = 601
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 149/324 (45%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 AVNANLCLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNALRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562
>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 478
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 140/314 (44%), Gaps = 62/314 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + N + P+I I DTFE R + + F G F+W +
Sbjct: 173 CEVGINWLPPLLAPIRANRYTMTVPVIDGIDKDTFEYR----PVYHGGQHFRGIFEWGML 228
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IPE E KR K +EP +PT AGGLF+ID+ +F KLG YD G +WGGEN ELSF
Sbjct: 229 YKEIEIPEEEIKRRKYHSEPYKSPTHAGGLFAIDRKYFLKLGGYDPGLLVWGGENFELSF 288
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
K W W P G +S K ++ G Y
Sbjct: 289 KI-WQC-----------GGSIYWVPCSRVGHVYRGFMPYSFGKLAHKRKGPIVTVNYKRV 336
Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
++W E E ++ D GD++ + ELR++LGCK F W++
Sbjct: 337 VEVWMDEYKEYFYTREPMARHYDPGDLSGQLELRQSLGCKGFDWFMKNVSYDVLKNFPLL 396
Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
E+ +G C+D+ + H P V + CH GGNQ + ++ G++ E
Sbjct: 397 PRNIHWGEIRTMVTGQCLDT----MNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGER 452
Query: 262 CLDYAGGDVILYPC 275
C+D + + L C
Sbjct: 453 CVDASSHSMQLVFC 466
>gi|158286701|ref|XP_565317.3| AGAP006881-PA [Anopheles gambiae str. PEST]
gi|157020594|gb|EAL41927.3| AGAP006881-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 96/334 (28%), Positives = 139/334 (41%), Gaps = 71/334 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSH---VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
CE WL+PLL+++A N + V P I + + T L+ + G FDW
Sbjct: 225 CECLAGWLEPLLELVASNQENRKVVAVPTIDWLNETTLALQ------VGASSGLYGAFDW 278
Query: 61 NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
NL F W +R + +N EP TP MAGGLF I+KAFF +LG YD G ++GGEN+E
Sbjct: 279 NLSFQWRPRYDRLQAPQENLLEPFDTPVMAGGLFCIEKAFFAQLGWYDPGLQVYGGENME 338
Query: 121 LSFKF-----NWHAIP------------------ERERK---RHKNAAEPVWTPTMAGGL 154
LSFK +P +ER R+ VW A L
Sbjct: 339 LSFKVWMCGGAIRTVPCSHVAHIQKRNNPYIGSYTKERDLTMRNSLRVAEVWMDEYAEFL 398
Query: 155 FSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--- 211
+ + + L + S N+ L +R++LR LGCKSF+WYL+
Sbjct: 399 YRLHPDYRALLASRTSH----SLSNVNLD---------ARRQLRSELGCKSFRWYLQHVF 445
Query: 212 ----------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
N+ +C+ + + + L CH GG Q W K GE
Sbjct: 446 PEQDDPSEAQAAGWIRHENEAGQLCLTWPMR----DRSLALLHCHGLGGQQIWFHRKTGE 501
Query: 256 IRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
I R+ CL +V + C + + + Y+
Sbjct: 502 IAREGHCLGVDSAEVTIALCSSEGSSGAYRWLYR 535
>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mustela putorius
furo]
Length = 609
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 149/351 (42%), Gaps = 99/351 (28%)
Query: 4 CEVQKRWL---QPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
CEV WL QPLL + ++ VV P+I I DT SS GGF+W
Sbjct: 248 CEVNVMWLMWLQPLLAAIQQDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNW 299
Query: 61 NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
L F W +P E + A P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE
Sbjct: 300 GLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLE 359
Query: 121 LSFKFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFD 173
+SF+ +W M GG LF I + F K Y S G D
Sbjct: 360 ISFR--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD 396
Query: 174 IWGGENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE---- 211
+L L S + D +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 TMTHNSLRLAHVWLDDYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYP 456
Query: 212 -------------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCH 240
+ + + C+ + +P+ V L C
Sbjct: 457 EMQISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACD 516
Query: 241 KQGGNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
+Q W+ + +H + + CLD + L CHGS G+Q + +
Sbjct: 517 YSDPSQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 567
>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
vitripennis]
Length = 583
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 143/330 (43%), Gaps = 65/330 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL +A ++ + P+I I TFE R + + G F+W +
Sbjct: 230 CEVNVNWLPPLLSPIAEDNKVMTVPIIDGIDHKTFEYR----PVYQEGHLYRGIFEWGML 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P+RE K K+ +EP +PT AGGLF+I++ +F LG YD G +WGGEN ELSF
Sbjct: 286 YKENELPQREAKTRKHNSEPYRSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSF 345
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGL-------FSIDKAFFEKLG-----TYDSG 171
K W +W P G ++ K +K G Y
Sbjct: 346 KI-WQC-----------GGSILWVPCSHVGHVYRGFMPYNFGKLAQKKKGPLITINYKRV 393
Query: 172 FDIWGGENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------- 210
+ W E + L+ D GD+T + E +R GCKSF+W++
Sbjct: 394 IETWFDEKHKEFFYTREPLARLLDHGDITEQLEFKRRKGCKSFQWFMDNIAYDVLDKFPE 453
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYP---CHKQGGNQFWMMSKHGEIRRD 259
E+ N + MC+D T H P L CH G NQ ++ G++
Sbjct: 454 LPPNIHWGEMKNVATQMCLD-----TMGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVG 508
Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
E C++ G V L C + ++YD K
Sbjct: 509 ERCVEADGQGVKLAFCRLGTVDGPWQYDEK 538
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/327 (30%), Positives = 144/327 (44%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F G T + G FDW +
Sbjct: 246 CEANVNWLPPLLDRIARNHKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 300 YKRIPIPPELQK--PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 357
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P PT +A L + + + ++ Y
Sbjct: 358 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 414
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++KELR NL CKSF+W++ E+
Sbjct: 415 RPEYRHLS----AGDVATQKELRSNLNCKSFRWFMNEVAWDLRKFYPPVEPPAAAWGEIH 470
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N + +C+D+ K + P+ + C K G W S +IR +
Sbjct: 471 NVGTSLCVDT--KHGALGSPLKIETCVKSRGEAAWNNVQVFTFSWREDIRPGDPQHTKKF 528
Query: 262 CLDYAGGD--VILYPCHGSKGNQYFEY 286
C D + V LY CHG KGNQ ++Y
Sbjct: 529 CFDAVSHNSPVTLYDCHGMKGNQLWKY 555
>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 6; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
Length = 618
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K+H + P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + G L + D W
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKVLNTNL----LRVAEVWMDDWKH 422
Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
+++ + DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460
>gi|10436305|dbj|BAB14795.1| unnamed protein product [Homo sapiens]
Length = 457
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 109/218 (50%), Gaps = 24/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + + + VV P+I I DTF S GGFDW+L
Sbjct: 169 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 221
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + ++ R + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E+SF
Sbjct: 222 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 281
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G + K + +
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPD--GNANTYIKNTKRTAEVWMDEYK 334
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ + + FG+V SR +LR+NL C+SFKWYLE
Sbjct: 335 RYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLE 372
>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 547
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 29/222 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K W++PLL + N VV P+I I + TF + + F GGF+WNLQ
Sbjct: 230 CECTKGWMEPLLARIKENRKAVVCPVIDVINERTFAYQ-------KGIELFRGGFNWNLQ 282
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P E + R + +P+ +PTMAGGLFSID+ +FE++GTYD +IWGGEN+E+S
Sbjct: 283 FRWYALPPEMIKSRSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEIS 342
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFE--KLGTYD 169
+ +P A P P+ G L + + + + K Y
Sbjct: 343 LRVWQCGGRIEILPCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYR 402
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ ++ + DV+ R ELR+ L CKSFKW+L+
Sbjct: 403 TAPQVYKMR--------ETVDVSDRVELRKRLHCKSFKWFLD 436
>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
Length = 617
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL + N V P+I I D+TF+ + + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P K+H + P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ +P + P P + G L + D W
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKVLNTNL----LRVAEVWMDDWKH 422
Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
+++ + DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460
>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 597
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 29/222 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K W++PLL + N VV P+I I + TF + + F GGF+WNLQ
Sbjct: 219 CECTKGWMEPLLARIKENRKAVVCPVIDVINERTFAYQ-------KGIELFRGGFNWNLQ 271
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P E + R + +P+ +PTMAGGLFSID+ +FE++GTYD +IWGGEN+E+S
Sbjct: 272 FRWYALPPEMIKSRSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEIS 331
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFE--KLGTYD 169
+ +P A P P+ G L + + + + K Y
Sbjct: 332 LRVWQCGGRIEILPCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYR 391
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ ++ + DV+ R ELR+ L CKSFKW+L+
Sbjct: 392 TAPQVYKMR--------ETVDVSDRVELRKRLHCKSFKWFLD 425
>gi|392923087|ref|NP_001256888.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
gi|255068800|emb|CBA11615.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
Length = 480
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 111/227 (48%), Gaps = 41/227 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL +A N VV+P+I I D F L GGFDW L
Sbjct: 242 IECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLV 294
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ RK RH + P+ +PTMAGGLF+I K +F +LGTYD ++WGGENLE+S
Sbjct: 295 FRWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMS 354
Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
F+ W E RK+H P P +G +F +
Sbjct: 355 FRV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RA 399
Query: 172 FDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
++W E + K +FGD+T R +R L CKSFKWYLE
Sbjct: 400 AEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLE 446
>gi|72000999|ref|NP_507850.2| Protein GLY-4, isoform b [Caenorhabditis elegans]
gi|27151758|emb|CAB81985.3| Protein GLY-4, isoform b [Caenorhabditis elegans]
Length = 453
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 111/227 (48%), Gaps = 41/227 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E ++WL+PLL +A N VV+P+I I D F L GGFDW L
Sbjct: 242 IECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLV 294
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + E+ RK RH + P+ +PTMAGGLF+I K +F +LGTYD ++WGGENLE+S
Sbjct: 295 FRWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMS 354
Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
F+ W E RK+H P P +G +F +
Sbjct: 355 FRV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RA 399
Query: 172 FDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
++W E + K +FGD+T R +R L CKSFKWYLE
Sbjct: 400 AEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLE 446
>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
Length = 567
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 141/292 (48%), Gaps = 41/292 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRTHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID-KAFFEKLGTYDSGFDIWGGENLEL 182
K W ++K+ G L +I+ K E +D + L
Sbjct: 354 KI-WQCGGSIDKKK--------------GPLITINYKRVIETW--FDDTHKEYFYTREPL 396
Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM-----------CIDSACK 226
+ D GD+T + L++ LGCKSF+W++ +V + + G+ C
Sbjct: 397 ARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDGCL 456
Query: 227 PTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+ H+P +GL CH G NQ ++ G++ E C++ + L C
Sbjct: 457 DSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIKLAVC 508
>gi|195583656|ref|XP_002081633.1| GD11122 [Drosophila simulans]
gi|194193642|gb|EDX07218.1| GD11122 [Drosophila simulans]
Length = 601
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 149/324 (45%), Gaps = 45/324 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R + P ++PTMAGGLF+ID+ +F ++G+YD D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+I+ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ +C+D + + GLYPC K +Q + + +R + +C +
Sbjct: 479 AVNANICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538
Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
V++ PC + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562
>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Nasonia vitripennis]
Length = 637
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 9/208 (4%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV K WL+PLL ++ + + V P+I I DTF+ SS GGF+W L
Sbjct: 269 IEVNKMWLEPLLARISHSRTIVPMPVIDIINADTFQY--------SSSPLVRGGFNWGLH 320
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P ++ +P+ +PTMAGGLF++D+ +F +LG YD+G D+WGGENLE+SF
Sbjct: 321 FKWDSLPIGTLSLEQDFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISF 380
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E GG D L D + L+
Sbjct: 381 RI-WMCGGSIELIPCSRVGHVFRRRRPYGGNDQQDTMLKNSLRVAYVWMDQYKKYFLKNV 439
Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLE 211
K D+GD+T R++LR+ L CK F WYLE
Sbjct: 440 KKIDYGDITERQQLRQKLHCKDFAWYLE 467
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/327 (33%), Positives = 145/327 (44%), Gaps = 67/327 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PLL+ +A++ V PLI I DTFE R S + G FDW +F
Sbjct: 245 EVNVNWLPPLLEPIAQDYRTCVCPLIDVIVHDTFEYR-------SQDEGKRGAFDW--KF 295
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P R + EP +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 296 YYKRLPLRPGDL-DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFK 354
Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
W V P + G ++ F G + ++W
Sbjct: 355 I-WQC-----------GGRMVDAPCSRVGHVYRGYSPFPNPRGVNFVTRNFKRVAEVWMD 402
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
E + + K + GD+T +K LR L CK FKW+LE
Sbjct: 403 EYKQFLYERNPQFDKTNPGDLTKQKALRERLKCKPFKWFLEEVAPDLLVRYPLREPLPFA 462
Query: 213 SNDWSGMCIDSACKPTDMHK---PVGLYPC-----HKQGGNQFWMMSKHGEIRRD--EAC 262
S + C T HK P+G++ C H Q NQF+ ++ + +IR E C
Sbjct: 463 SGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQ-NNQFFTLTYYRDIRAASVEKC 521
Query: 263 LDYAGGD--VILYPCHGSKGNQYFEYD 287
LD + D VIL+ CH S+GNQ + YD
Sbjct: 522 LDASSDDAEVILFNCHESQGNQLWRYD 548
>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 2
[Gallus gallus]
Length = 557
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 216 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 268
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F I+K++F LG YD+ DIWGGEN ELSF
Sbjct: 269 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 328
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 329 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 381
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
+ E + +G + R E RR L CKSF+WYLE G
Sbjct: 382 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 441
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
C++S + T + G+ C N Q W+ S IR+ + CL G +
Sbjct: 442 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWVFS-DPLIRQQDKCLSITSFSTGSHI 500
Query: 271 ILYPCHGSKGNQ 282
L C+ G Q
Sbjct: 501 TLEACNQKDGRQ 512
>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
saltator]
Length = 493
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 150/324 (46%), Gaps = 54/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + +N++ ++SP+I I D+TF T S++ G F+W+L
Sbjct: 139 CECTVGWLEPLLEAVGKNATRIISPVIDIINDNTFSY-------TRSFELHWGAFNWDLH 191
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R ++ EP TP MAGGLFS+++ +F +LG+YD IWGGENLELS
Sbjct: 192 FRWLTLNGRLLKERRESIVEPFRTPAMAGGLFSMNRNYFFQLGSYDDQMRIWGGENLELS 251
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTP-----TMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + P + P P + G L + + ++ + F
Sbjct: 252 FRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLVRVASVWMDQWAEFYFKF 311
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
+ E L +K V SR LR L CKSF+WYLE V +
Sbjct: 312 N---PEAARLRYK---QQVRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGRVIHA 365
Query: 216 WSGMCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGD- 269
+ C+ +P G L+ C + +Q ++M+K+G I DE+ CLD D
Sbjct: 366 TTNRCLMRPTAKGSYTQPSGHAVLHSCIPRPMLSQMFVMTKNGVIMTDESVCLDAPERDT 425
Query: 270 ------VILYPCHGSKGNQYFEYD 287
V + C G + Q ++YD
Sbjct: 426 QQKTPKVKIMACSG-RDRQKWQYD 448
>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
Length = 586
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 138/325 (42%), Gaps = 79/325 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP++ + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 253 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 305
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F +DK++F +LG YD+ DIWGGEN ELSF
Sbjct: 306 FKWEQIPIEQKMARSDPTQPIRTPVIAGGIFVMDKSWFNRLGQYDTHMDIWGGENFELSF 365
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
+ VW M GG I F K Y+
Sbjct: 366 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYEFPEGNALTY 402
Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWS 217
++W E + + FG +T R LR+ L CK F+WY+E N +
Sbjct: 403 IRNTRRAAEVWMDEYKQYYYSARPSAQGKAFGSITDRVSLRKKLNCKPFRWYME--NVYP 460
Query: 218 GMCIDSACKPTDMHKP------------VGLYPCHKQGGN----QFWMMSKHGEIRRDEA 261
+ + T + + +GL C G N Q W + + IR+ +
Sbjct: 461 ELRVPEQEAVTSVLRQGGLCLEARGAEWLGLAECRGVGTNRPQSQRWELIE-PLIRQQDL 519
Query: 262 CLDYA----GGDVILYPCHGSKGNQ 282
CL + G V + PC+ + Q
Sbjct: 520 CLAISAFSPGSKVKMEPCNAKEARQ 544
>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 1
[Gallus gallus]
Length = 563
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F I+K++F LG YD+ DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
+ E + +G + R E RR L CKSF+WYLE G
Sbjct: 388 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 447
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
C++S + T + G+ C N Q W+ S IR+ + CL G +
Sbjct: 448 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWVFS-DPLIRQQDKCLSITSFSTGSHI 506
Query: 271 ILYPCHGSKGNQ 282
L C+ G Q
Sbjct: 507 TLEACNQKDGRQ 518
>gi|340711409|ref|XP_003394268.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus terrestris]
Length = 604
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 145/323 (44%), Gaps = 52/323 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ +A+N + VVSP+I I DDTF T S++ G F+W+L
Sbjct: 251 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R +N EP TP MAGGLFS+++ +F +LG+YD IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELS 363
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
F+ + P + P T GG+ I ++ D + +
Sbjct: 364 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
N E + D V R ELR+ L CK+F+WYL +N W
Sbjct: 421 FKFNTEAARLRDKQPVRGRLELRKRLQCKNFEWYL--NNIWPEHFFPKDDRFFGRILHIS 478
Query: 217 SGMCIDSACKPTDMHKPVG---LYPCHKQGG-NQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
S CI +P G L C + +Q ++M+ G I DE+ CLD D
Sbjct: 479 SNKCIMRPTAKGTYSQPSGYAVLETCLPRPILSQMFVMTTDGIIMTDESVCLDAPDHDTQ 538
Query: 270 -----VILYPCHGSKGNQYFEYD 287
V + C G Q + YD
Sbjct: 539 HKTPKVKIMACSG-HSRQKWRYD 560
>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
musculus]
Length = 515
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 98/323 (30%), Positives = 141/323 (43%), Gaps = 67/323 (20%)
Query: 4 CEVQKRWLQPLL---DVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
CEV + WLQPLL + ++ + VV P+I I DTF S GGFDW
Sbjct: 164 CEVNRDWLQPLLHRVKEVLQDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDW 216
Query: 61 NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
+L F W + ++ + EP+ TP +AGGLF IDKA+F+ LG YD DIWGGEN E
Sbjct: 217 SLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFE 276
Query: 121 LSFKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
+SF+ IP RK+H P P + +
Sbjct: 277 ISFRVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKR 322
Query: 171 GFDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDW 216
++W E + + + FG++ +R LR+NL C++FKWYLE V D
Sbjct: 323 TAEVWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDS 382
Query: 217 S---------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL 263
S C++S + + + L PC K G+ Q W + +I ++E CL
Sbjct: 383 SIQKGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCL 440
Query: 264 D----YAGGDVILYPCHGSKGNQ 282
+ G V+L C Q
Sbjct: 441 SVVTLFPGAPVVLALCKNGDERQ 463
>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Meleagris gallopavo]
Length = 509
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 168 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 220
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F I+K++F LG YD+ DIWGGEN ELSF
Sbjct: 221 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 280
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 281 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 333
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--------SNDW-------SG 218
+ E + +G + R E RR L CKSF+WYLE D G
Sbjct: 334 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 393
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
C++S + T + G+ C N Q W S IR+ + CL G +
Sbjct: 394 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWAFS-DPLIRQQDKCLSITSFSTGSQI 452
Query: 271 ILYPCHGSKGNQ 282
L C+ G Q
Sbjct: 453 TLEACNQKDGRQ 464
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 141/331 (42%), Gaps = 69/331 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I D F G T + G FDW +
Sbjct: 235 CEANVNWLPPLLDRIAQNRKAIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP ++ + +EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 289 YKRIPIPPEMQR--DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 346
Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
K W E + P G S+ K ++W E E
Sbjct: 347 KL-WMCGGRMEDIPCSRVGHIYRKYVPYKVPGGISLAKNL-------KRVAEVWMDEYAE 398
Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
++ GD+T++KELR L CKSFKW++ E+
Sbjct: 399 YVYQRRPEYRHLSAGDMTAQKELRTRLNCKSFKWFMNEVAWDLPKHYPPVEPPAAAWGEI 458
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
N SGMC++ K P+ L C K G W HG++ R D
Sbjct: 459 QNVGSGMCME--VKHFVSGSPIRLENCVKGRGEVGW---SHGQVLTFGWREDIRVGDPMH 513
Query: 260 --EACLDYA--GGDVILYPCHGSKGNQYFEY 286
+ C D V LY CHG KGNQ + Y
Sbjct: 514 TRKLCFDAVSHSSPVTLYDCHGMKGNQLWRY 544
>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Metaseiulus occidentalis]
Length = 585
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 138/302 (45%), Gaps = 46/302 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV +RWLQPLL + +N + V P+I I DTFE + P L GGF+W +
Sbjct: 223 VEVNERWLQPLLVPIQQNQTTVTCPVIDIINADTFE--YSPSPLVK------GGFNWGMH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ K K P+ +PTMAGGLF+I K F +LG YD G D+WGGENLELSF
Sbjct: 275 FRWDNLPKGYFKSEKERIAPLPSPTMAGGLFAIHKDEFRRLGEYDWGMDVWGGENLELSF 334
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKR A T+A + + + Y
Sbjct: 335 RIWMCGGSLKIMPCSRVGHVFRKRRPYGASN-GEDTLAKNSLRVANVWMDDYKKY----- 388
Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHK 232
+ K DFGD+++R ELR L CKSF WYL+ N + + + S
Sbjct: 389 ---YYRMRPDLKDIDFGDISARVELRNRLKCKSFDWYLK--NIYPDLQLPS--------N 435
Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA------GGDVILYPCH-GSKGNQYFE 285
GL + Q M K +IR D+ C+ GG +L C SK +FE
Sbjct: 436 RTGLRNVNLYKRKQPTMTGKF-QIRVDKLCVQSQDSIFRRGGAFVLQKCDPHSKKQMWFE 494
Query: 286 YD 287
+
Sbjct: 495 TE 496
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 107/327 (32%), Positives = 145/327 (44%), Gaps = 67/327 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PLL+ +A++ V PLI I DTFE R S + G FDW +F
Sbjct: 256 EVNVNWLPPLLEPIAQDYRTCVCPLIDVIVHDTFEYR-------SQDEGKRGAFDW--KF 306
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P R + EP +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 307 YYKRLPLRPGDL-DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFK 365
Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
W V P + G ++ F G + ++W
Sbjct: 366 I-WQC-----------GGRMVDAPCSRVGHVYRGYSPFPNPRGVNFVTRNFKRVAEVWMD 413
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
E + + K + GD+T +K LR L CK FKW+LE
Sbjct: 414 EYKQFLYERNPQFDKTNPGDLTKQKALREKLKCKPFKWFLEEVAPDLLVRYPLREPLPFA 473
Query: 213 SNDWSGMCIDSACKPTDMHK---PVGLYPC-----HKQGGNQFWMMSKHGEIRRD--EAC 262
S + C T HK P+G++ C H Q NQF+ ++ + +IR E C
Sbjct: 474 SGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQ-NNQFFTLTYYRDIRAASVEKC 532
Query: 263 LDYAG--GDVILYPCHGSKGNQYFEYD 287
LD + +VIL+ CH S+GNQ + YD
Sbjct: 533 LDASSDNAEVILFNCHESQGNQLWRYD 559
>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1-like [Sus
scrofa]
Length = 557
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 140/314 (44%), Gaps = 50/314 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIAWTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + + + +
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
+ E + FG V +R E R+ + CK+F+WYLE V +
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKTFRWYLENVYPELTVPVKEVLPSIIKQGA 439
Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
C+++ + T + +G+ C N Q W+ S H I++ CL G
Sbjct: 440 NCLETQGQDTAGNFLLGMGICRGSAKNPPAAQAWLFSDH-LIQQQGKCLAATSTSISPGS 498
Query: 269 DVILYPCHGSKGNQ 282
V+L C+ +G Q
Sbjct: 499 LVVLQGCNPREGRQ 512
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 105/326 (32%), Positives = 150/326 (46%), Gaps = 65/326 (19%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PL++ +A + V P I I DTF+ R + + G FDW +F
Sbjct: 250 EVNVNWLPPLIEPIAEDYRTCVCPFIDVIAHDTFQYR-------AQDEGKRGAFDW--KF 300
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ +P R + + EP +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 301 LYKRLPLRAQD-MVDPTEPFESPIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 359
Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
W V P + G ++ F GT + ++W
Sbjct: 360 V-WQC-----------GGRMVDAPCSRVGHVYRGYAPFPNPRGTNFVTRNFKRVAEVWMD 407
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW------------- 216
E + + + D GD+T +K LR L CK FKW+L EV+ D
Sbjct: 408 EYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLVVRYPLRDPKPFA 467
Query: 217 SGMCIDSA----CKPTDMHK---PVGLYPCHKQ----GGNQFWMMSKHGEIRRD--EACL 263
SG +A C + HK P+G++ C NQF+ ++ + +IR + CL
Sbjct: 468 SGRVQSAANPKLCLDSMNHKAKEPIGVFSCAANRTYPQNNQFFTLTYYRDIRVSSVDKCL 527
Query: 264 DYA--GGDVILYPCHGSKGNQYFEYD 287
D + G +VIL+ CH S+GNQ ++YD
Sbjct: 528 DASSDGSEVILFNCHESQGNQLWQYD 553
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/90 (25%), Positives = 43/90 (47%), Gaps = 11/90 (12%)
Query: 205 SFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRR 258
+ +Y ++ C+D++ ++ V L+ CH+ GNQ W M +HG+ R
Sbjct: 511 TLTYYRDIRVSSVDKCLDASSDGSE----VILFNCHESQGNQLWQYDTETQMIRHGKPTR 566
Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEYDY 288
++ CLD V++ C K Q +E+ +
Sbjct: 567 NQ-CLDLVERKVVVSKCDHRKKTQRWEWGF 595
>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
Length = 601
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/314 (29%), Positives = 141/314 (44%), Gaps = 37/314 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + RN + P+I I +++E R G + + G F+W L
Sbjct: 252 CEVNTNWLPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I ERE K+ ++P +PT AGGLF+I++ +F++LG YD G IWGGE ELSF
Sbjct: 309 YKETQITERETAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
K W +P + P +G + SI+ + T+ + +
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYSFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
+ + GD++++ LR L CKSFKWY+ E N
Sbjct: 426 LTREPQATNVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPMLPPNDVWGEARNP 485
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+G C+D + + P+G CH GGNQ ++ G++ + E CL G + C
Sbjct: 486 ATGKCLD---RMGGIPGPMGATGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542
Query: 276 HGSKGNQYFEYDYK 289
N ++ YD K
Sbjct: 543 VKGTVNGFWSYDRK 556
>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
Length = 501
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/307 (32%), Positives = 135/307 (43%), Gaps = 53/307 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + ++ VV P+I I DDTFE + GGF+W L
Sbjct: 212 CECTVGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGE- 178
F+ W E H TP T GG I +L ++W E
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377
Query: 179 -NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWS----------GMCIDSACKP 227
N D G + L C S + V +S +C+D +
Sbjct: 378 KNFFYIISPDIG------RIEHWLYCDSLHGGMLVFQVFSYTANKEIRTDDLCLDVS--- 428
Query: 228 TDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEACLDYAGGDVILYP----CHG 277
++ PV + CH GNQ W + +H CLD A + P C G
Sbjct: 429 -KLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVN---SNQCLDKATEEDSQVPSIRDCSG 484
Query: 278 SKGNQYF 284
+ Q+
Sbjct: 485 GRSQQWL 491
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 108/215 (50%), Gaps = 45/215 (20%)
Query: 107 YDSGFDI-WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK 164
Y +G D+ +GG N +L+F+ W+ +P+RE R K + PV TPTMAGGLFSID+ +F++
Sbjct: 248 YMAGSDMTYGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQE 305
Query: 165 LGTYDSGFDIWGGENLELS-----------------------------FKGDFGDVTSRK 195
+GTYD+G DIWGGENLE+S F G G + ++
Sbjct: 306 IGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKN 365
Query: 196 ELR-RNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG 254
R + FK + + + G I+ +H + ++ Q + + +
Sbjct: 366 NRRLAEVWMDEFKNFFYIISPDIGR-IEHWLYCDSLHGGMLVF--------QVFSYTANK 416
Query: 255 EIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
EIR D+ CLD + G V + CH KGNQ +EYD
Sbjct: 417 EIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 451
>gi|313234048|emb|CBY19624.1| unnamed protein product [Oikopleura dioica]
Length = 827
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 134/297 (45%), Gaps = 67/297 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE W +PLL+ +A + + V++P+I I TF G T + F G F WNL
Sbjct: 541 CECFPGWAEPLLERIAEDPTRVMTPVIEVIDAGTFRT----GE-TKTANIFKGVFGWNLV 595
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
FNW A + A P+ +PTMAGGLF++DKA+F LGTYD IWGGENLE+S
Sbjct: 596 FNWIEAYGPKNPYTSAYEARPIRSPTMAGGLFTMDKAYFNWLGTYDEEMKIWGGENLEMS 655
Query: 123 FK-------FNW-------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 168
F+ FN+ H E+ H EP+ ++
Sbjct: 656 FRVTNCSEFFNYHLCRFGCHVFREKSPYSHPGGEEPIMRNSIRVA--------------- 700
Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE--------V 212
D+W E E+ F+ D G+++SR +LR NL C+ F WY+E
Sbjct: 701 ----DVWLDEFKEVYFRRGAPILKNIDPGNMSSRIQLRENLQCQPFSWYMENVLPELDYT 756
Query: 213 SND---WSGMCIDSACKPTDMH---------KPVGLYPCHKQGGNQFWMMSKHGEIR 257
ND ++G I A T+ + ++PCH GNQ++ + +IR
Sbjct: 757 MNDDLIFAGEIISQARNRTNRQCFDSTGKDNAQIQIFPCHGLLGNQYYEYTNIKDIR 813
>gi|427778457|gb|JAA54680.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 568
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 36/221 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+P+L + N + V P+I I DTFE P GGF+W L
Sbjct: 205 CEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFEYSASP--------IVRGGFNWGLH 256
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + P + + A +P+ +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 257 FKWESPPRL--RGPQQAIDPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISF 314
Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ +P +R + P T+ + + ++ TY
Sbjct: 315 RIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGEDTLTKNSLRVAHVWMDEYKTY------ 368
Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYL 210
L + D +GDV++RKELR+ L C SF WY+
Sbjct: 369 ------YLQTRRDARNQWYGDVSARKELRKRLKCHSFDWYM 403
>gi|158286608|ref|XP_308833.4| AGAP006925-PA [Anopheles gambiae str. PEST]
gi|157020549|gb|EAA04096.4| AGAP006925-PA [Anopheles gambiae str. PEST]
Length = 622
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 141/320 (44%), Gaps = 66/320 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYK-FFIGGFDWNL 62
CE +WL+PLL+ + + + V+ P+I I F T+ Y F IGGF W+
Sbjct: 264 CECMVQWLEPLLERIKESPTSVLVPIIDVIEAKNFYYS------TNDYNDFQIGGFTWDG 317
Query: 63 QFNWHAIPERERKRHKNAAE-------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 115
F+WH + +RER+R K P ++PTMAGGLF+I + +F +G+YD D WG
Sbjct: 318 HFDWHDVTKRERERQKRECAEKDLEICPTYSPTMAGGLFAIARDYFWDIGSYDEQMDGWG 377
Query: 116 GENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTY 168
GENLE+SF+ IP P P G+ ++ A
Sbjct: 378 GENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMAI------- 430
Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE--------- 211
+W + +EL + + GDVT RK LR L CKSF WY++
Sbjct: 431 -----VWMDDYVELLYLNRPDLKDHPELGDVTHRKVLREKLHCKSFDWYMKNVYPEKFIP 485
Query: 212 ---------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDE 260
+++ +C+D+ + D +G+Y C K +Q + ++K +R +
Sbjct: 486 TRNVRAFGRLASQADNLCLDTLQQNADKPWNLGIYTCFKPEVSASQLFSLTKRNVLRNER 545
Query: 261 ACLDYAGGD-----VILYPC 275
+C V++ PC
Sbjct: 546 SCATVQASKSESKFVVMIPC 565
>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 591
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 36/221 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+P+L + N + V P+I I DTFE P GGF+W L
Sbjct: 223 CEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFEYSASP--------IVRGGFNWGLH 274
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + P + + A +P+ +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 275 FKWESPPRL--RGPQQAIDPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISF 332
Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ +P +R + P T+ + + ++ TY
Sbjct: 333 RIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGEDTLTKNSLRVAHVWMDEYKTY------ 386
Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYL 210
L + D +GDV++RKELR+ L C SF WY+
Sbjct: 387 ------YLQTRRDARNQWYGDVSARKELRKRLKCHSFDWYM 421
>gi|332021082|gb|EGI61469.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Acromyrmex
echinatior]
Length = 580
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 42/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV + W++PLL +A + + + P+I I DTF+ P GGF+W L
Sbjct: 208 IEVNEIWIEPLLSRIAYSRNIIPMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 259
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + +P+ +PTMAGGLF+ID+ +F K+G YD G DIWGGENLE+SF
Sbjct: 260 FKWDNLPIGTLNHDVDFVKPIKSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISF 319
Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
+ + IP R+R + +P TM + + ++ Y
Sbjct: 320 RIWMCGGSIELIPCSRVGHVFRRRRPYGSDDP--QDTMLKNSLRVAHVWMDEYKDY---- 373
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
L+ + D+GD++ R LR+ L CK+F WYL+V + D+ + D
Sbjct: 374 ------FLKNAKTIDYGDISERLALRQKLKCKTFGWYLKVVYPELTLPDDTERRLKDKWA 427
Query: 231 ---HKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
+P + P H + N Q+ + +S + E + G +IL PC K +
Sbjct: 428 KLDQRP--MQPWHSRKRNYTDQYQIRLSNTALCIQSEKDIKTKGSKLILMPCLRVKSQMW 485
Query: 284 FEYD 287
+E D
Sbjct: 486 YETD 489
>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
Length = 590
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 47/305 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + N VV P+I I DT L + P + GGF+W L
Sbjct: 234 CEVNEAWLQPLLTPIKENRKTVVCPVIDIISADT--LVYTPSPIVR------GGFNWGLH 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A + +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 286 FKWDPVPMSELNSPDGA---IRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISF 342
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ +P RKR + P TMA + + D +
Sbjct: 343 RIWMCGGQLLIVPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DDYKE 395
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKP 233
+ EL + D+GD++ R +R+ L C SFKWYL+ N + M + S KP
Sbjct: 396 QYFALRPELRNR-DYGDISERVSIRKRLQCHSFKWYLD--NIYPEMQVSSPHKPQQ---- 448
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRR--DEACL------DYAGGDVILYPCHGSKGNQYFE 285
P G + + + G +R + CL GG V++ C Q +
Sbjct: 449 ----PVFINKGLKRPKVLQRGRLRNLLADKCLVAQGRPSQKGGAVVVKDCDPQDPEQEWA 504
Query: 286 YDYKY 290
YD ++
Sbjct: 505 YDEEH 509
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 142/329 (43%), Gaps = 70/329 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLLD +ARN VV P I I +TF R S + G FDW L +
Sbjct: 235 EANVNWLPPLLDPIARNRRTVVCPFIDVIHYETFAYR-------SQDEGARGAFDWELYY 287
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ + KR EP +P MAGGLF+ID+++F +LG YD G D+WGGE ELSFK
Sbjct: 288 KRLPLLSEDLKR---PTEPFRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFK 344
Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
W + P G A F G Y ++W
Sbjct: 345 I-WQC-----------GGQMFDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWMD 392
Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------------------- 211
E E + +GDV+ +K LR+ L CK FKW++E
Sbjct: 393 EYKEFLYNRRPHYRTLGYGDVSKQKALRKKLKCKPFKWFMETVAFDQPLRYPPVEPPDFA 452
Query: 212 ---VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIR--RDEAC 262
+ N + C+D+ K + K L C G+ Q ++++ H ++R + C
Sbjct: 453 WGAIRNVGADKCLDTKFK--EQGKRFSLETCISSNGDVSGEQNFVLTWHKDLRPAKRNVC 510
Query: 263 LDYAGGD----VILYPCHGSKGNQYFEYD 287
D + G+ V+L+ CHG GNQ F+Y+
Sbjct: 511 FDVSSGEKKAPVVLWTCHGMHGNQLFKYN 539
>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
Length = 590
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 47/305 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + N VV P+I I DT L + P + GGF+W L
Sbjct: 234 CEVNEAWLQPLLTPIKENRKTVVCPVIDIISADT--LVYTPSPIVR------GGFNWGLH 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E A + +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 286 FKWDPVPMSELNSPDGA---IRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISF 342
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ +P RKR + P TMA + + D +
Sbjct: 343 RIWMCGGQLLIVPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DDYKE 395
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKP 233
+ EL + D+GD++ R +R+ L C SFKWYL+ N + M + S KP
Sbjct: 396 QYFALRPELRNR-DYGDISERVSIRKRLQCHSFKWYLD--NIYPEMQVSSPHKPQQ---- 448
Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRR--DEACL------DYAGGDVILYPCHGSKGNQYFE 285
P G + + + G +R + CL GG V++ C Q +
Sbjct: 449 ----PVFINKGLKRPKVLQRGRLRNLLADKCLVAQGRPSQKGGAVVVKDCDPQDPEQEWA 504
Query: 286 YDYKY 290
YD ++
Sbjct: 505 YDEEH 509
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 100/331 (30%), Positives = 142/331 (42%), Gaps = 69/331 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I D F G T + G FDW +
Sbjct: 235 CEANVNWLPPLLDRIAQNRKSIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP ++ + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 289 YKRIPIPAEMQR--DDPSQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 346
Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
K W E + P G S+ K ++W E E
Sbjct: 347 KV-WMCGGRMEDIPCSRVGHIYRKYVPYKVPGGISLAKNL-------KRVAEVWMDEYAE 398
Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
++ GD+T +KELR LGCK+FKW++ E+
Sbjct: 399 YVYQRRPEYRHLSAGDMTPQKELRSRLGCKNFKWFMSNVAWDLPKHYPPVEPPAAAWGEI 458
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
N SG+C++ K P+ L C K G W HG++ R D
Sbjct: 459 QNVGSGLCME--IKHFVSGSPIRLENCVKSRGEVGW---SHGQVLTFGWREDIRVGDPMH 513
Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
+ C D + V LY CHG KGNQ + Y
Sbjct: 514 TRKVCFDAVSHNSPVTLYDCHGMKGNQLWRY 544
>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
echinatior]
Length = 595
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 139/297 (46%), Gaps = 40/297 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL+ + +N++ +V+P+I I D+TF T S++ G F+W+L
Sbjct: 260 CECTIGWLEPLLEAVGKNATRIVAPVIDIINDNTFSY-------TRSFELHWGAFNWDLH 312
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + R K R N EP TP MAGGLFS+++ +F KLG+YD IWGGENLELS
Sbjct: 313 FRWLTLNGRLLKERRDNIVEPFRTPAMAGGLFSMNRDYFFKLGSYDDQMRIWGGENLELS 372
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLF--SIDKAFFEKLGTYDSGFDIW 175
F+ + P + P P G + ++ + + + + +
Sbjct: 373 FRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLARVALVWMDQWAEFYFKF 432
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
E L +K V SR LR L CKSF+WYLE V + +
Sbjct: 433 NPEAARLRYK---QQVRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGRVVHAGTK 489
Query: 219 MCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
CI +P G L+ C + +Q ++M+K+G I DE+ CLD D+
Sbjct: 490 KCIMRPAAKGSYGQPSGNAVLHSCIPRPMLSQMFVMTKNGVIMTDESVCLDAPERDM 546
>gi|170051778|ref|XP_001861920.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
gi|167872876|gb|EDS36259.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
Length = 601
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 139/311 (44%), Gaps = 48/311 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE +WL+PLL+ + + + V+ P+I D E + F IGGF W+
Sbjct: 244 CECMPQWLEPLLERIRESRTSVLVPII-----DVIEAKNFFYSTNGFTDFQIGGFTWDGH 298
Query: 64 FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+WH + +RE++R K A P ++PTMAGGLF+I + +F ++G+YD D WGG
Sbjct: 299 FDWHDVTQREKERQKRECSEKDVAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 358
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
ENLE+SF+ IP P P G+ ++ A D
Sbjct: 359 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 414
Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
D+ +L + GDVT R+ LR L CKSF WY++
Sbjct: 415 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAFGR 474
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
V++ +C+D+ + D +G+Y C + +Q ++K G +R + +C
Sbjct: 475 VTSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNK 534
Query: 270 -----VILYPC 275
V++ PC
Sbjct: 535 ADTRYVVMIPC 545
>gi|195384663|ref|XP_002051034.1| GJ22477 [Drosophila virilis]
gi|194145831|gb|EDW62227.1| GJ22477 [Drosophila virilis]
Length = 598
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 136/308 (44%), Gaps = 44/308 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE W +PLL + + + V+ P+I I + F+ T+ YK F +GGF WN
Sbjct: 243 CEANVGWCEPLLQRIKDSRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W + ERE+ R P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 297 HFDWVNLSEREKLRQSRECSQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 414
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
+++ +L F D GDVT R LR+ L CKSF WYL+ +
Sbjct: 415 INVFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFDWYLKNVYPEKFVPNKNVKAWGRIK 474
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
+ +C D + +GLYPC K+ +Q + +K +R + +C
Sbjct: 475 AVHANLCADDLLSNNEKPYNLGLYPCGKELQKSQLFSYTKSQVLRNEISCATVQHSSSPP 534
Query: 270 --VILYPC 275
+++ PC
Sbjct: 535 YRIVMVPC 542
>gi|355689613|gb|AER98891.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mustela putorius
furo]
Length = 302
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 121/249 (48%), Gaps = 36/249 (14%)
Query: 58 FDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 117
FDW L F W ++P+ E++R K+ P+ TPT AGGLFSI K +FE +GTYD +IWGGE
Sbjct: 1 FDWILSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGE 60
Query: 118 NLELSFKFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGT 167
N+E+SF+ W + E R K+ P T +A + + + ++
Sbjct: 61 NIEMSFRV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE--- 116
Query: 168 YDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------- 211
Y F + ++ + FGD++ R E++ L CK+F WYL
Sbjct: 117 YKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVIS 176
Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
+ + +C+D + KP+ LY CH GGNQ++ S EIR + E CL A
Sbjct: 177 GYIKSVGQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQRELCLHAA 235
Query: 267 GGDVILYPC 275
G V L C
Sbjct: 236 QGLVQLRAC 244
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 100/331 (30%), Positives = 143/331 (43%), Gaps = 69/331 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I D F G T + G FDW +
Sbjct: 187 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 240
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K ++ +EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 241 YKRIPIPLELQK--EDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 298
Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
K W E + P G S+ + ++W E E
Sbjct: 299 KV-WMCGGRMEDTPCSRVGHIYRKYVPYKVPGGVSLARNL-------KRVAEVWMDEYAE 350
Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
++ GD+ +K+LR L CKSFKW++ E+
Sbjct: 351 YIYQRRPEYRHLAAGDMAVQKDLRSQLNCKSFKWFMTKVAWDLPKHYPPVEPPAAAWGEI 410
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
N SGMC+++ K PV + C K G W HG++ R D
Sbjct: 411 RNVASGMCLET--KHFASGSPVRMESCLKGRGEGGW---SHGQVFTFGWREDIRVGDPMH 465
Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
+ C D + V LY CHG KGNQ++ Y
Sbjct: 466 TKKVCFDAVSNNSPVTLYDCHGMKGNQFWHY 496
>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oreochromis niloticus]
Length = 567
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 141/328 (42%), Gaps = 85/328 (25%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP++ + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 229 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F +D+++F LG YD+ DIWGGEN ELSF
Sbjct: 282 FKWEQIPIEQKMARSDPTQAIRTPVIAGGIFVMDRSWFNHLGQYDTHMDIWGGENFELSF 341
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
+ VW + GG I F K YD
Sbjct: 342 R--------------------VW---LCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 378
Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------ 211
++W E + + FG VT R LRR L CK F+WY+E
Sbjct: 379 IKNTRRAAEVWMDEYKQYYYSARPSAQGKAFGSVTDRLALRRKLNCKPFRWYMENVYPEL 438
Query: 212 -------VSN--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRR 258
VS+ G+C+++ + TD +GL C G N Q W + + IR+
Sbjct: 439 RVPEQEAVSSVLKQGGLCLET--RGTD---GLGLAECRGLGANRPQSQRWELIE-PLIRQ 492
Query: 259 DEACLDY----AGGDVILYPCHGSKGNQ 282
+ CL AG V + PC+ + Q
Sbjct: 493 QDLCLAISAFTAGSKVKMEPCNTKEPRQ 520
>gi|292623437|ref|XP_001339749.3| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Danio rerio]
Length = 567
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 133/322 (41%), Gaps = 74/322 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP++ + + S VVSP+I I D F L +S GGFDW+L
Sbjct: 230 CEVNTDWLQPMIQRVKEDHSRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 282
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + +P+ TP +AGG+F I+K +F LG YD+ DIWGGEN ELSF
Sbjct: 283 FKWEQIPIEQKMARNDPTQPIRTPVIAGGIFVIEKGWFNHLGQYDTHMDIWGGENFELSF 342
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
+ VW M GG I F K YD
Sbjct: 343 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 379
Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL------- 210
++W + + + FG + R L+R L C SF+WYL
Sbjct: 380 IKNTRRAAEVWMDDYKQYYYAARPSAQGKAFGSIADRLALKRKLNCNSFRWYLENVYPEL 439
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ---GGNQFWMMSKHGEIRRDEACLD 264
E +S + C + +GL C +Q W + + +IR+ + CL
Sbjct: 440 KIPEQEEAYSLLKQGGLCLESHGTDSLGLAECRSTPSIPASQKWTLIE-PQIRQHDLCLA 498
Query: 265 Y----AGGDVILYPCHGSKGNQ 282
AG V L PC+ + Q
Sbjct: 499 ITAFTAGSKVRLEPCNIKESRQ 520
>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
Length = 612
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 84/237 (35%), Positives = 117/237 (49%), Gaps = 49/237 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K W++PLL + N VV P+I I + TF + + F GGF+WNLQ
Sbjct: 201 CECTKGWMEPLLARIKENRKAVVCPVIDIINERTFAYQ-------KGIELFRGGFNWNLQ 253
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P E + R + +P+ +PTMAGGLFSID+ +FE++GTYD DIWGGEN+E+S
Sbjct: 254 FRWYALPPEMIKSRSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEIS 313
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD-----SGF 172
+ K K VW GG I F + +D SG
Sbjct: 314 LRL----------KLLKKNCFLVW---QCGGRVEILPCSHVGHVFRRTSPHDFPGRKSGT 360
Query: 173 ----------DIWGGE-------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYLE 211
++W E ++K + DV+ R ELR+ L CKSFKW+L+
Sbjct: 361 ILNSNLLRVAEVWMDEWKFHFYRTAPQAYKMRETVDVSDRVELRKRLHCKSFKWFLD 417
>gi|241622516|ref|XP_002407424.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
gi|215500988|gb|EEC10482.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
Length = 471
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 103/330 (31%), Positives = 145/330 (43%), Gaps = 72/330 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A++ VV P I I +TF R + + G FDW L +
Sbjct: 116 EANTNWLPPLLEPIAKDYRTVVCPFIDVIDYETFAYR-------AQDEGARGSFDWELYY 168
Query: 65 N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+P+ K EP +P MAGGLF+I + +F +LG YD G D+WGGE ELSF
Sbjct: 169 KRLPLLPDDLAK----PTEPFKSPVMAGGLFAISRKYFWELGGYDEGLDVWGGEQYELSF 224
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
K W V P G A F G Y ++W
Sbjct: 225 KI-WQC-----------GGTMVDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWM 272
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
E E + D GD+T++K LR+ L CKSFKW++E
Sbjct: 273 DEYKEHLYHRRPHYRHLDPGDLTAQKALRKRLNCKSFKWFMEQVAFDQPSKYPALLTPVA 332
Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIR--RDEA 261
V N+ SG+CID+ K ++ L PC K + G Q +++ H ++R +
Sbjct: 333 HWPQVRNEESGLCIDTQFK--GQNERFSLAPCLKDQRGRSGEQQLVLTWHKDVRPAKRSV 390
Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
C D + D V+L+ CHG GNQ ++YD
Sbjct: 391 CFDVSSSDVHAPVMLWSCHGMHGNQLWKYD 420
>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
Length = 521
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/323 (29%), Positives = 145/323 (44%), Gaps = 55/323 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + + + + PLI I + FE R + F G F+W +
Sbjct: 169 CEVGLNWLPPLLYPIYLDRTTMTVPLIDGIDHENFEYR----PVYQGETNFRGVFEWGML 224
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +PERE + +EP PT AGGLF+I++A+F ++G YD G +WGGEN ELSF
Sbjct: 225 YKENEVPEREAQSRTYNSEPYKAPTHAGGLFAINRAYFLEIGAYDPGLLVWGGENFELSF 284
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLGT-----YDSG 171
K W + +W P G ++ K K G+ Y
Sbjct: 285 KI-WQC-----------GGKILWVPCSRVGHVYRGFMPYTFGKLAANKKGSLITINYKRV 332
Query: 172 FDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-EVSND------- 215
++W + + F D G++T + E+++ L CKSF W++ EV+ D
Sbjct: 333 IEVWFDDKYKEFFYTREPTARFLDMGNITQQLEMKKRLNCKSFAWFMEEVAYDVLDKYPE 392
Query: 216 ------WSGMCIDSA--CKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
W + +A C T H+P +G+ CH G NQ + ++K G++ E C++
Sbjct: 393 LPANLHWGELRNTAARQCLDTMGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVN 452
Query: 265 YAGGDVILYPCHGSKGNQYFEYD 287
V L C +EYD
Sbjct: 453 ADSQGVKLVVCRLGSVEGPWEYD 475
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 149/330 (45%), Gaps = 71/330 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A++ V P I I +TFE R + + G FDW L +
Sbjct: 244 EANVNWLPPLLEPIAKDYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 296
Query: 65 N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+PE KN +EP +P MAGGLF+I FF +LG YD G DIWGGE ELSF
Sbjct: 297 KRLPLLPED----LKNPSEPFKSPVMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSF 352
Query: 124 KFNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGG 177
K W + R H P + P G F LG Y ++W
Sbjct: 353 KI-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMD 401
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VSNDWS 217
E + ++ D GD+T +K LR L CKSFKW++E +D++
Sbjct: 402 EYADFIYRRRPHLRAMDPGDLTEQKALRDKLKCKSFKWFMENIAFDLVEVYPPIEPDDFA 461
Query: 218 ----------GMCIDSACKPTDMHKPVGLYPCHKQG----GNQFWMMSKHGEIR--RDEA 261
+C+D+ K D + + + C K G Q + ++ H +IR R
Sbjct: 462 YGEMRNIGVPNLCLDAKGKGKD--EEIAVDYCQKDTPKIKGEQEFQLTWHKDIRPNRRTE 519
Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
CLD + GD V LYPCHG +GNQ + Y+
Sbjct: 520 CLDVSRGDDKSPVTLYPCHGKQGNQLWRYN 549
>gi|221041542|dbj|BAH12448.1| unnamed protein product [Homo sapiens]
Length = 360
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 122/272 (44%), Gaps = 48/272 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL+ +A + + VVSP+I I D F+ L GGF NL
Sbjct: 108 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFG-NLV 159
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + PE+ R R N P+ TP +AGGLF +DK +FE+LG YD D+WGGENLE+S
Sbjct: 160 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 219
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F+ + ++W
Sbjct: 220 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 270
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
E + +G++ SR ELR+ L CK FKWYLE +
Sbjct: 271 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 330
Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ 246
C+D+ D VG+Y CH G Q
Sbjct: 331 QQGTNCLDTLGHFAD--GVVGVYECHVAGLRQ 360
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/330 (31%), Positives = 144/330 (43%), Gaps = 67/330 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I D F G T + G FDW +
Sbjct: 222 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 275
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + +EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 276 YKRIPIPTELQK--DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 333
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 334 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEY---IYQR 390
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GD+T +KELR L CK+FKW++ E+
Sbjct: 391 RPEYRHLS----AGDMTVQKELRNRLNCKNFKWFMSEVAWDLPKHYPPVEPPAAAWGEIR 446
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD--------- 259
N S MC++S K P+ L C K G+ W HG++ R D
Sbjct: 447 NVGSSMCMES--KHFVSGSPIRLENCVKGRGDVSW---SHGQVFTFGWREDIRVGDPMHT 501
Query: 260 -EACLDYAGGD--VILYPCHGSKGNQYFEY 286
+ C D + V LY CHG KGNQ + Y
Sbjct: 502 KKVCFDAISHNSPVTLYDCHGMKGNQLWRY 531
>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
Length = 601
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 141/312 (45%), Gaps = 37/312 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + RN + P+I I +++E R G + + G F+W L
Sbjct: 252 CEVNTNWLPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I ERE K++++P +PT AGGLF+I++ +F++LG YD G IWGGE ELSF
Sbjct: 309 YKETQITERETAHRKHSSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
K W +P + P +G + SI+ + T+ ++ +
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYEKYY 425
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------VSND 215
+ + GD++++ LR L CKSFKWY+E N
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPLLPPNDVWGGAQNP 485
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+G C+D + + P+G CH GGNQ ++ G++ + E CL G + C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLLRLNVQGQLAQGEWCLTANGIRIQANHC 542
Query: 276 HGSKGNQYFEYD 287
N + YD
Sbjct: 543 VKGSVNGNWVYD 554
>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7-like [Saccoglossus
kowalevskii]
Length = 594
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 143/320 (44%), Gaps = 58/320 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL +A+N + V P+I I + + +R S + GGFDW+L
Sbjct: 245 CEVGINWLPPLLSPIAQNRTTVTVPIIDVIDNMDYTMRS-----QGSGELSRGGFDWSLY 299
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + + E ++ ++EP +P MAGGLF++ + +F +LG YD G ++WGGEN ELSF
Sbjct: 300 WKHLPMSKEETRKRSLSSEPYRSPAMAGGLFAMARDYFFELGAYDPGLEVWGGENFELSF 359
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSI-DKAFFE---------KLGTYDSGF 172
K W +W P + G ++ I K + L Y
Sbjct: 360 KI-WQC-----------GGSMLWVPCSHVGHVYRILGKVPYRAPNATMTQWSLRNYRRVV 407
Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL--------------- 210
++W + E ++ FGD++ + E + CK+F W++
Sbjct: 408 EVWMDDYKEFFYRSKPESQLLHFGDISKQLEFKTKHNCKNFDWFMKEVAPDLLAVYPVPA 467
Query: 211 ------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
E+ ++ + +C+D+ +G+ CH QGGNQ + +++ E R E CL
Sbjct: 468 ANQAWGEIKSNTNKVCVDTMGNREG--GTIGISGCHGQGGNQLFRITEDHEFRIHELCLY 525
Query: 265 YAGGDVILYPCHGSKGNQYF 284
+V L C G +F
Sbjct: 526 EIYSEVKLRRCDGKSKYSWF 545
>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Anolis carolinensis]
Length = 574
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 103/218 (47%), Gaps = 24/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L GGFDW+L
Sbjct: 233 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + + + TP +AGG+F IDK++F LG YD+ DIWGGEN ELSF
Sbjct: 286 FKWEQIPIEQKLSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 345
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RKRH P P G + K + +
Sbjct: 346 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 398
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ E + FG + R + RR L CKSF+WYLE
Sbjct: 399 QYYYEARPSAIGKSFGSIADRVDQRRKLNCKSFQWYLE 436
>gi|328700065|ref|XP_003241139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Acyrthosiphon pisum]
Length = 588
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 26/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV W+QPLL + N + +++P+I I DTF+ + P GGF+W L
Sbjct: 222 VEVNTDWIQPLLTRVRDNRTQIIAPIIDIIQPDTFDYKSSP--------LVRGGFNWGLH 273
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++P+ K+ +P+ TPT+AGGLF++D+ +F ++G YDSG +IWGGENLELSF
Sbjct: 274 FKWDSLPKGTLVTDKDFVKPIKTPTIAGGLFAVDREYFNEIGQYDSGMNIWGGENLELSF 333
Query: 124 KFNWHA-----IPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ W I R ++H+ + P TMA + + + +
Sbjct: 334 RV-WMCGGSLYIEPCSRVGHVFRQHRPYSAPNNEDTMARNSLRLANVWMDDFKKF----- 387
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ + ++L + D+GDV+ RK LR LGC +F+WYLE
Sbjct: 388 -FISKRMDL-LRLDYGDVSERKALRTKLGCNNFEWYLE 423
>gi|344288103|ref|XP_003415790.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Loxodonta africana]
Length = 640
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 134/300 (44%), Gaps = 38/300 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + S VVSP+I I TF+ +P S G DWNL
Sbjct: 288 CECHQGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQY-YP------SEALQRGVLDWNLD 340
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +PE E+K ++ P+ +P + GG+ +ID+ +F+ G YD +WGGENLELS
Sbjct: 341 FHWEPLPEHEKKALQSPISPIRSPVVPGGVVAIDRHYFQNTGAYDPLMSLWGGENLELSL 400
Query: 124 KF-----NWHAIP-ERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
K + +P R ++N A PV L + + L ++ F
Sbjct: 401 KTWLCGGSVEILPCSRVGHVYRNQDAHPVL--DQEATLQNKIRIAETWLASFKETFYKHS 458
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSG 218
E LS + + D T R +L+R LGC+ F W+L ++ N G
Sbjct: 459 PEAFSLS-QAEKPDCTERLQLQRRLGCRMFHWFLANIYPELYPSEHMPRFSGKLHNTGLG 517
Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILYPC 275
C D + + P+ L+PC+ Q + EIR C G V+L C
Sbjct: 518 FCADCQAEGDTLGCPMMLFPCNDNRKQQHLQHTSRKEIRFGSPQHLCFGVRGAQVVLQNC 577
>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
Length = 520
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 137/290 (47%), Gaps = 46/290 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL + N S V+ P+I +I +T RL + +GGF W+L
Sbjct: 169 CEVSEGWLEPLLARIKENRSVVLCPIIDHISAETLAYS-GSDRLAN-----VGGFWWSLH 222
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +PE + +P+ +PTMAGGLF++D+ +F ++G YD DIWGGENLE+SF
Sbjct: 223 FRWDPLPEE--YYGIDPTKPIRSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISF 280
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP A P + T G + ++L ++W +
Sbjct: 281 RVWMCGGGIEFIPCSHVGHIFRAGHP-YNMTGPGNNEDVHGTNSKRLA------EVWMDD 333
Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
+ + + GD++ R+ LR+ L CKSFKWYLE +
Sbjct: 334 YKRFYYIHRSDLKEKNVGDLSERRALRKKLKCKSFKWYLENVAKNKFILDENVAAFGALR 393
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
N SG C+D+ + + ++PC + Q + ++ G++RR+ C
Sbjct: 394 NPSSGFCLDTLQQDEKEAVSLAVFPCQNGKSEAQIFSLTNDGKLRRELTC 443
>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
glaber]
Length = 330
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 61/123 (49%), Positives = 78/123 (63%), Gaps = 8/123 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 152 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ +P+RE R K + PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEIS 264
Query: 123 FKF 125
F+
Sbjct: 265 FRI 267
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/81 (56%), Positives = 63/81 (77%), Gaps = 4/81 (4%)
Query: 107 YDSGFDI-WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK 164
Y +G D+ +GG N +L+F+ W+ +P+RE R K + PV TPTMAGGLFSID+ +FE+
Sbjct: 188 YMAGSDMTYGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEE 245
Query: 165 LGTYDSGFDIWGGENLELSFK 185
+GTYD+G DIWGGENLE+SF+
Sbjct: 246 IGTYDAGMDIWGGENLEISFR 266
>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
Length = 591
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 142/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD+T + L++ LGCKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
Length = 582
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 150/332 (45%), Gaps = 73/332 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR-FPPGRLTSSYKFFIGGFDWNLQ 63
EV WL PL++ +A N V P I I DTFE + GR G FDW +
Sbjct: 219 EVNVNWLPPLIEPIAENYRTCVCPYIDGIAHDTFEYKPQSEGRR--------GAFDW--K 268
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P R + + + EP +P MAGGLF+I FF +LG YD DIWGGE ELSF
Sbjct: 269 FLYKRLPLRPQDQ-TDPTEPFDSPIMAGGLFAISAKFFWELGGYDEELDIWGGEQYELSF 327
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWG 176
K W V P + G ++ F GT + ++W
Sbjct: 328 KI-WQC-----------GGRMVDAPCSHVGHVYRGLAPFPNPRGTNFVTRNFKRVAEVWM 375
Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW------------ 216
E + F K D GD+T +K LR L CK FKW+L EV+ D
Sbjct: 376 DEYKQFLFERNPEYDKTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLLLKYPLRDPKPF 435
Query: 217 -SG---------MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEIR--RD 259
SG +C+DS +P+G++ C H Q NQF+ +S +IR
Sbjct: 436 ASGRVQSLANPILCLDSLNHKE--KEPIGVFSCAANKTHPQ-SNQFFTLSYFRDIRVASV 492
Query: 260 EACLDYA--GGDVILYPCHGSKGNQYFEYDYK 289
+ CLD A G +V L+ CH +GNQ ++YD K
Sbjct: 493 DKCLDAASEGSEVRLFNCHEIQGNQLWQYDMK 524
>gi|260814835|ref|XP_002602119.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
gi|229287425|gb|EEN58131.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
Length = 1164
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/180 (38%), Positives = 94/180 (52%), Gaps = 24/180 (13%)
Query: 48 TSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 107
+SS GGFDW + F W+ +P+ E R K P+ +PTMAGGLFSI K FFE+LGTY
Sbjct: 885 SSSGHMTRGGFDWRMHFRWNTVPDYEMARRKMEKAPIRSPTMAGGLFSIHKMFFEELGTY 944
Query: 108 DSGFDIWGGENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF 162
D G +IWGGENLELSFK +P ++P P GG+ ++ +
Sbjct: 945 DPGLEIWGGENLELSFKTWMCGGTLEILPCSRVGHIFRQSQPYRFP--GGGMQTVQRNSL 1002
Query: 163 EKLGTYDSGFDIWGGENLELSFKG----------DFGDVTSRKELRRNLGCKSFKWYLEV 212
+ +W E +F +GDV+ R++LR LGCKSF+WYL+
Sbjct: 1003 RVV-------QVWMDERHRKAFYAVNPELKDMNISYGDVSERRQLRDRLGCKSFQWYLDT 1055
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 64/120 (53%), Gaps = 11/120 (9%)
Query: 69 IPERE---RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125
+PER R R + AA + G +D + GFD W F
Sbjct: 850 LPERAGLIRARLRGAAVRRLLESKGGISLHLDHLYSSSGHMTRGGFD-W-------RMHF 901
Query: 126 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
W+ +P+ E R K P+ +PTMAGGLFSI K FFE+LGTYD G +IWGGENLELSFK
Sbjct: 902 RWNTVPDYEMARRKMEKAPIRSPTMAGGLFSIHKMFFEELGTYDPGLEIWGGENLELSFK 961
>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
Length = 415
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/216 (36%), Positives = 111/216 (51%), Gaps = 25/216 (11%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD + + + VV P I + + TF + + GGFDW L F W ++
Sbjct: 190 WLEPLLDRIREDRTRVVCPSIDRVNEATFAYEV-------ANENVRGGFDWELFFQWVSL 242
Query: 70 PERERKRHKNAA---EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF- 125
P E KR + E + +PTMAGGLFSID+ FF +LG YD GF IWGGENLELSFK
Sbjct: 243 PAVEAKRRTHNVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIW 302
Query: 126 ----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY------DSGFDIW 175
+ +P ++P + + A + + +L + +
Sbjct: 303 MCGGSLEILPCSRVGHVFRKSQP-YNYSNATSIMEVVHHNNVRLAEVWLDEYKKIYYALH 361
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
G +EL+ GD++ RK LR NLGC+SF+WYLE
Sbjct: 362 PGVEVELA---KMGDISERKLLRENLGCRSFQWYLE 394
>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
adhaerens]
Length = 491
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/214 (33%), Positives = 112/214 (52%), Gaps = 16/214 (7%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV ++W +PLL+ + N +VSP++ NI +TFE + + GGFDW+L
Sbjct: 146 CEVNQQWAEPLLEQIVLNPKAIVSPVLDNIDMNTFEYQ-------EGTEDVRGGFDWSLT 198
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + P+ TPT+AGG++++ K +F LG YD G IWGGENLELSF
Sbjct: 199 FRWDYMTEAMINQRIDPTSPIKTPTIAGGIYAVSKQWFNDLGEYDMGQKIWGGENLELSF 258
Query: 124 KFNW------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ W IP P P AG + + + + + ++
Sbjct: 259 R-AWMCGGFMKIIPCSRVGHVFRLQHPYIFPEGAGRTYY--RNLRRVVEVWLDEYKVYFY 315
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ ++ D+G+V SRK+LR+ L C++FKWYL+
Sbjct: 316 QIRKIIKSIDYGNVKSRKQLRKRLHCQTFKWYLD 349
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 140/310 (45%), Gaps = 36/310 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL +A N + V P+I I D + R P + GGFDW+L
Sbjct: 257 CEVGVNWLPPLLTPIAVNRTTAVCPIIDVI--DNMDYRVYPQGTGDQDR---GGFDWSLY 311
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ +P+ E+ R ++A+EP +P MAGGLF++D+ +F +LG YD G +IWGGEN ELSF
Sbjct: 312 WKHLPVPQFEKSRRQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSF 371
Query: 124 KFNWHA------IP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
K W +P R ++ + ++ L ++ + + + +
Sbjct: 372 KI-WMCGGSLLWVPCSRVGHVYRILGKVPYSAPNGSMLILSERNLRRVVEVWFDDYKEYF 430
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
+ S G++ + R CKSF W++ E+
Sbjct: 431 YRSKPESLLVSTGNIEKQLAFREKFHCKSFGWFMKEIAPDIIEKYPLPHANKYWGEIRTK 490
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+C+DS VG+ CH GGNQ + ++++G++R + C +V L C
Sbjct: 491 KGSLCVDSMGSKDGGR--VGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYKEVRLRRC 548
Query: 276 HGSKGNQYFE 285
GS G F+
Sbjct: 549 GGSGGGWSFD 558
>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
Length = 662
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 141/306 (46%), Gaps = 46/306 (15%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV W++PLL + N + + P+I I DTF + SS GGF+W L F
Sbjct: 295 EVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTF--------IYSSSPLVRGGFNWGLHF 346
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +P+ + + P +PTMAGGLF++D+ +F+ LG YD G D+WGGENLE+SF+
Sbjct: 347 KWDNLPKGTLAKESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFR 406
Query: 125 FNWHA-----------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W I RKR + P + TM + + + + Y
Sbjct: 407 -TWQCGGSIELVPCSRIGHVFRKR-RPYGSPDGSDTMIRNSLRLSRVWMDDYIKY----- 459
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTD--MH 231
EN + K D GD+T R +LR+ L CKSF+WYL+ N + + + K TD +
Sbjct: 460 --FLENQPQAKKVDPGDLTDRHDLRKRLNCKSFEWYLK--NIYPQLKLPGE-KTTDSNVS 514
Query: 232 KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----------AGGDVILYPCHGSKGN 281
+P P H + N ++ S + C+ G ++L+PC K
Sbjct: 515 QP-KFQPWHSRKRN--YISSFQIRLSNSSLCVTTESAKEKSLWKKGSHLVLHPCLRVKAQ 571
Query: 282 QYFEYD 287
++E +
Sbjct: 572 MWYETE 577
>gi|195338421|ref|XP_002035823.1| GM15572 [Drosophila sechellia]
gi|194129703|gb|EDW51746.1| GM15572 [Drosophila sechellia]
Length = 604
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 146/306 (47%), Gaps = 42/306 (13%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++WL+PLL ++ ++ + P+I I DTFE + P L GGF+W L F
Sbjct: 249 EVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 300
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K ++ P +PTMAGGLF++++ +F+ LG YD DIWGGEN+E+SF+
Sbjct: 301 RWENLPEGTLKVPEDFRGPFRSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFR 360
Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W +P RKR + P TM + + ++ Y
Sbjct: 361 -AWQCGGAIKIVPCSRVGHIFRKR-RPYTSPDGANTMLKNSLRLAHVWMDQYKDY----- 413
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGMCIDSACKPT 228
E + ++ D+GD++ R +LR L C+ F WYL E+ +G + +A
Sbjct: 414 YLKHEKVPKAY--DYGDISDRLKLRERLQCRDFAWYLKNVYPELHLRLTGTELCAAVVAP 471
Query: 229 DMH------KPVGLYPCHKQGGNQFWMMSKHGEIRRDE-ACLDYAG-GDVILYPCHGSKG 280
+ + L C ++ NQ W ++ EI D+ CL+ +G V + CH G
Sbjct: 472 KVKGFWKKGSSLQLQTC-RRTPNQLWYETEKAEIVLDKLLCLEASGDAQVTVNKCHEMLG 530
Query: 281 NQYFEY 286
+Q + +
Sbjct: 531 DQQWRH 536
>gi|334348942|ref|XP_001380115.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Monodelphis domestica]
Length = 642
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 130/292 (44%), Gaps = 33/292 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K WL+PLL +A + S +VSP+I I F+ S G FDW L
Sbjct: 288 CECHKGWLEPLLSRIAGDRSRLVSPIIDVIDWKNFQYYH-------SMDLQRGVFDWELN 340
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +PE+ERK ++ P+ +P + GG+ +ID+ +F+ G YDS IWG ENLELS
Sbjct: 341 FHWRPLPEQERKMRQSPISPIRSPVLPGGVLAIDRHYFQNTGAYDSLMSIWGSENLELSI 400
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + IP P +P L + + LG++ F +
Sbjct: 401 RVWLCGGSVEIIPCSRVGHVYRHQPPNASPDPEAALKNKIRIVETWLGSFKDTFYQHSPK 460
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYP 238
LS + + D + R +L+R LGC++F W+L + P LYP
Sbjct: 461 AFSLS-QAEKQDCSERLQLQRRLGCRTFHWFL------------ANLSPE-------LYP 500
Query: 239 C-HKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
HK G + S G + GG V+L PC S+ Q+ EY K
Sbjct: 501 SEHKPGFSGKLYSSGVGSCAECVSGQGLPGGWVMLSPCSDSRQPQHLEYTSK 552
>gi|402586218|gb|EJW80156.1| glycosyltransferase, partial [Wuchereria bancrofti]
Length = 448
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 112/226 (49%), Gaps = 39/226 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N VV+P+I I DTF+ L GGF+WNL
Sbjct: 236 CECNVNWLEPLLARVKENHRAVVAPVIDIIDKDTFKYIAASADLR-------GGFEWNLI 288
Query: 64 FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + R RH P+ TP +AGGLF I K +FEKLGTYD D+WGGENLELS
Sbjct: 289 FKWEYLLGKLRDDRHAQPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEEMDVWGGENLELS 348
Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
F+ + IP RK+H T GG ++ + ++
Sbjct: 349 FRVWLCGGSLEIIPCSRVGHVFRKQHPY--------TFPGGSSNVFQKNTRRVA------ 394
Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
++W G+ L + +FGD+T+R +L++ L CK F WYL+
Sbjct: 395 EVWLGDYKHLYLRKVPSARYVNFGDITARLDLKKRLHCKDFDWYLK 440
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 103/338 (30%), Positives = 143/338 (42%), Gaps = 72/338 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F G T + G FDW +
Sbjct: 183 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 236
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 237 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 294
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P PT +A L + + + ++ Y
Sbjct: 295 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 351
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------------- 210
E LS GDV ++KELR NL CKSFKW++
Sbjct: 352 RPEYRHLS----AGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEAR 407
Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEI 256
++ N +G+C+D+ K + P+ L C K G W S +I
Sbjct: 408 DSATSLLFQIRNVGTGLCVDT--KHGALGSPLRLENCVKDRGEAAWNNVQVFTFSWREDI 465
Query: 257 RRDEA------CLDYA--GGDVILYPCHGSKGNQYFEY 286
R + C D V LY CHG KGNQ + Y
Sbjct: 466 RPGDPQHTKKFCFDAISHSSPVTLYDCHGMKGNQLWRY 503
>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
malayi]
Length = 575
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 112/212 (52%), Gaps = 23/212 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE K W++PLL + N VV P+I I D TF + S + F GGF+WNLQ
Sbjct: 185 CECTKGWMEPLLARIKENRKAVVCPVIDIINDRTFAYQ-------KSIELFRGGFNWNLQ 237
Query: 64 FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+A+P E + R + +P+ +PTMAGGLFSID+ +FE++GTYD DIWGGEN+E+S
Sbjct: 238 FRWYALPSEMIKSRSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEIS 297
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE---N 179
+ + +P P P G +I + ++ ++W E +
Sbjct: 298 LRV-FEILPCSHVGHVFRRTSPHDFPGRKSG--TILNSNLLRVA------EVWMDEWKFH 348
Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ FG V + R+ L CKSFKW+L+
Sbjct: 349 FYRTAPRRFGCVVNS---RKRLHCKSFKWFLD 377
>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Megachile rotundata]
Length = 637
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 139/305 (45%), Gaps = 41/305 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV K W++PLL +A + + V P+I I DTF+ P GGF+W L
Sbjct: 266 IEVNKMWIEPLLSRIAHSKTIVAMPVIDIINADTFQYTASP--------LVRGGFNWGLH 317
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P + ++ +P+ +PTMAGGLF++D+ +F +LG YD+G D+WGGENLE+SF
Sbjct: 318 FKWEQLPTK-LVHDEDFIKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGGENLEISF 376
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + IP RKR A+ + L + + L Y +
Sbjct: 377 RIWMCGGSIELIPCSRVGHVFRKRRPYGADDKHDTMLKNSL----RVAYVWLDEYKHYY- 431
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGMCIDSACKPT 228
L+ K D+GD+T R LR+ L CK F WY+ E++
Sbjct: 432 ------LKDVNKIDYGDITDRLNLRQKLKCKDFAWYVKEVYPELTFPDDDKKRLKDKWAR 485
Query: 229 DMHKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYF 284
KP + P H + N Q+ + +S + E + G +IL PC K ++
Sbjct: 486 IEQKP--MQPWHSRKRNYTDQYQIRLSNTALCIQSEKDIKTKGAKLILMPCLRIKSQMWY 543
Query: 285 EYDYK 289
E D K
Sbjct: 544 ETDKK 548
>gi|313228070|emb|CBY23220.1| unnamed protein product [Oikopleura dioica]
Length = 467
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 133/287 (46%), Gaps = 47/287 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A + V+P+I NI D+ FE+R T+ + IG F W +
Sbjct: 190 CEAISGWLEPLLQRVAEKPNVAVTPVILNIRDNDFEIR-----ATAPHNVQIGIFTWGMT 244
Query: 64 FNWHAIPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
F W R+R + N+ + V +PTMAGGLF+I++ +F G+YD WGGENLE+
Sbjct: 245 FTWERYFWRKRLNNVKNNSTKCVPSPTMAGGLFAINREYFYYSGSYDEQMHGWGGENLEM 304
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
SF+ P + P P A G ++++ D+W
Sbjct: 305 SFRLWQCGGGIETHPCSQVGHVFRTHSPYKIPEGAEG-YNLNMRRL---------VDVWL 354
Query: 177 GENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDW----- 216
E EL + GD GD++ R L+ L CKSF WY++ +ND
Sbjct: 355 DEFKELYYSRSGGVWGDEGDISERLALKEKLQCKSFAWYMDNVATSIDYFFANDTRTGFL 414
Query: 217 --SGMCIDSACKPTDM--HKPVGLYPCH-KQGGNQFWMMSKHGEIRR 258
+G C+D P + VG YPCH + GGNQ M ++ + R
Sbjct: 415 HSNGHCLDVGNLPMPLAPQNDVGTYPCHFEVGGNQVVMFTRLKGLTR 461
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 141/334 (42%), Gaps = 77/334 (23%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP-PGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A N V P I I DTFE R GR G FDW +
Sbjct: 220 EANVNWLPPLLEPIAENYKTCVCPFIDVIAHDTFEYRAQDEGRR--------GAFDW--E 269
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P K+ EP +P MAGGLF+I FF +LG YD G IWGGE ELSF
Sbjct: 270 FFYKRLPLLPEDL-KHPTEPFQSPVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSF 328
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
K W + V P G A F G Y ++W
Sbjct: 329 KI-WQC-----------GGKMVDAPCSRVGHIYRKFAPFPNPGIGDFVGKNYRRVAEVWM 376
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------ 216
E E +K D GD+T +K +R L CK FKW++E ++ D
Sbjct: 377 DEYAEYLYKRRPHYRNIDPGDLTVQKAVRERLNCKPFKWFIENIAFDLPLKYPPIEPPDL 436
Query: 217 ----------SGMCIDSACK-PTDMHKPVGLYPCHKQ------GGNQFWMMSKHGEIRRD 259
G+C+D+ K P D GL PC K Q+++++ H +IR
Sbjct: 437 AEGEIRSIADPGLCVDTERKEPEDT---FGLKPCEKNFKSKNTRTEQYFILTWHEDIRPK 493
Query: 260 --EACLDYAGGD----VILYPCHGSKGNQYFEYD 287
C D + D V LY CHG KGNQY+ YD
Sbjct: 494 GRNVCWDVSSIDNKASVNLYKCHGMKGNQYWHYD 527
>gi|260800261|ref|XP_002595052.1| hypothetical protein BRAFLDRAFT_125761 [Branchiostoma floridae]
gi|229280294|gb|EEN51063.1| hypothetical protein BRAFLDRAFT_125761 [Branchiostoma floridae]
Length = 941
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 119/269 (44%), Gaps = 81/269 (30%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLLD + RN + V P I I D+TF ++ + GGF+W ++F+W ++
Sbjct: 643 WLEPLLDRIGRNRTTVPCPSIDRINDNTFGYE-------AANENMRGGFNWGMKFDWVSL 695
Query: 70 PERERKRHK----NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125
P E R + E + +PTMAGGLFSID+ FF +LG YD GF IWG ENLE+SFK
Sbjct: 696 PPGEDDRRYQDIWSQNEIIKSPTMAGGLFSIDRRFFWELGGYDPGFQIWGAENLEISFKD 755
Query: 126 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
++A+ NA
Sbjct: 756 IFYALNPHVENEIANA-------------------------------------------- 771
Query: 186 GDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSACKP 227
GDV+ RK +R LGCKSF+WY+ EV N +C+D+
Sbjct: 772 ---GDVSDRKRMREQLGCKSFQWYIDHVYPEITIPDLRAKARGEVKNRAMSLCLDAV--- 825
Query: 228 TDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
+ VG Y CH +GG Q + + +I
Sbjct: 826 --YGEKVGAYFCHGEGGQQSFTLRMDDKI 852
>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 520
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 137/296 (46%), Gaps = 58/296 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL + S V+ P+I +I +T + +GGF W+L
Sbjct: 169 CEVGEGWLEPLLARIKDKRSAVLCPIINHISAETLTYS------ANDRPTNVGGFSWSLH 222
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P+ + EP+ +PTMAGGL ++D+++F ++G YD DIWGGENLE+SF
Sbjct: 223 FLWDPMPKE--YFDADPTEPIRSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSF 280
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
+ W E + P G D + +G D+ D+ G + L+
Sbjct: 281 RV-WMCGGSIE-----------FIPCSHVGHIFRDGHPYNMIGPGDNK-DVHGTNSKRLA 327
Query: 184 -----------------FKG-DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
KG D GD++ R+ LR+ L CKSFKWYL+
Sbjct: 328 EVWMDDYKKFYYIHRLDLKGKDVGDLSERRALRQKLRCKSFKWYLQNVAKNKFVLDENVA 387
Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
+ N SG+C+D+ + D P+ ++ C + Q + ++ G +RR+ C
Sbjct: 388 AFGALRNPSSGLCLDTLQRNEDEVIPLCVFSCQNGKSQTQIFSLTNDGILRRELTC 443
>gi|332243646|ref|XP_003270989.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Nomascus leucogenys]
Length = 443
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 106/220 (48%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKVVVCPLIDVIDDRTLEYKPSP--------VVRGTFDWNLQ 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+W+P M+GG+F+I + +F ++G YD D WGGENLELS
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIWSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T+ + Y +W E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ +R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRARVELRKRLGCKSFQWYLD 430
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 135/287 (47%), Gaps = 40/287 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL + S V+ P+I +I +T + +GGF W+L
Sbjct: 169 CEVGEGWLEPLLARIKDKRSAVLCPIINHISPETLTYS------ANDRPAHVGGFWWSLH 222
Query: 64 FNWHAIPERERKRHKNA--AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
F W +P K + +A EP+ +PTMAGGL ++D+ +F ++G YD DIWGGENLE+
Sbjct: 223 FRWDPMP----KEYSDADPTEPIRSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEM 278
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDI 174
SF+ + IP A P + G + ++L D
Sbjct: 279 SFRVWMCGGSVEFIPCSHVGHIFRAGHP-YNMIGPGNNKDVHGTNSKRLAEVWMDDYKKF 337
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDW 216
+ L+L K D GD++ RK LR+ L CKSFKWYLE + N
Sbjct: 338 YYIHRLDLKEK-DVGDLSERKALRQKLKCKSFKWYLENVAKNKFVLDENVAAFGSLRNPS 396
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
S +C+D+ + P+ ++PC + Q + ++ G +RR+ C
Sbjct: 397 SELCLDTLQRDEGEAIPLSVFPCQNGKSEAQIFSLTNDGILRRELTC 443
>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
Length = 679
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/331 (32%), Positives = 146/331 (44%), Gaps = 65/331 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A N V P I I F R + + G FDW +
Sbjct: 307 VEANYNWLPPLLEPIAINERTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDW--E 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P K+ +EP +P MAGGLF+I FF +LG YD G DIWGGE ELSF
Sbjct: 358 FFYKRLPLLPEDL-KHPSEPFKSPVMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSF 416
Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
K + A R ++ V +P L K E K YD G
Sbjct: 417 KIWMCGGEMYDAPCSRVGHIYRGPRNHVPSPRTGDYLHKNYKRVAEVWMDEYKKYLYDHG 476
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW-------------- 216
I+ + D GD+T++K +R L CKSFKW++ EV+ D
Sbjct: 477 DGIYD--------RVDAGDLTAQKAIRTKLKCKSFKWFMEEVAFDLMKSYPPIDPPDYAS 528
Query: 217 --------SGMCIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEI--RRDEAC 262
S +C+D+ H +G+Y C K NQF+ +S ++ RR + C
Sbjct: 529 GAIQNVGDSSLCVDTHG--LRKHNRMGVYSCAEDLQKPQRNQFFQLSWKRDLRQRRKKDC 586
Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDYK 289
LD A V L+ CHG +GNQY+ YDY+
Sbjct: 587 LDVQIWDANAPVWLWDCHGQQGNQYWFYDYR 617
>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
Length = 675
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 106/330 (32%), Positives = 146/330 (44%), Gaps = 65/330 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLLD +A+N V P I I F R + + G FDW+
Sbjct: 307 VEANYNWLPPLLDPIAQNKRAAVCPFIDVIDHSNFNYR-------AQDEGARGAFDWD-- 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P K+ ++P +P MAGGLF+I + FF +LG YD G DIWGGE ELSF
Sbjct: 358 FFYKRLPLLPEDL-KHPSDPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416
Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
K + A R ++ + V P L K E K Y+ G
Sbjct: 417 KIWMCGGEMYDAPCSRVGHIYRGPRQGVKNPRSGDYLHKNYKRVAEVWMDEYKNYLYNHG 476
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------------- 216
I+ D GD+T++K +R L CKSFKW++E V+ D
Sbjct: 477 DGIYDN--------VDPGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSYPPVDPPDYAS 528
Query: 217 --------SGMCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMS--KHGEIRRDEAC 262
+ +CID+ + H VG+Y C K Q+W +S + +RR + C
Sbjct: 529 GAIQNVGDNTLCIDTLGRVR--HNRVGMYRCAIDLVKPQRTQYWSLSWKRDLRLRRKKDC 586
Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
LD A V L+ CHG +GNQY+ YDY
Sbjct: 587 LDVQIWDANAPVWLWDCHGQQGNQYWFYDY 616
>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
Length = 604
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 137/308 (44%), Gaps = 44/308 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
CE + W +PLL + + + V+ P+I I F+ T+ YK F +GGF W+
Sbjct: 249 CEANEGWCEPLLQRIKESRTSVLVPIIDVIDAKDFQYS------TNGYKSFQVGGFQWSG 302
Query: 63 QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
F+W +PERE++R P ++PTMAGGLF++D+ +F ++G+YD D WGG
Sbjct: 303 HFDWVNLPEREKQRQLRECSQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 362
Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
ENLE+SF+ IP P P I+ A L D
Sbjct: 363 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 420
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
+++ +L F D GDVT R LR+ L CKSF+WYL +V
Sbjct: 421 INVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKSFEWYLKNVYPEKFVPNMNVKAWGKVK 480
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
S +C+D + +GLY C K +Q + + +R + +C
Sbjct: 481 AVNSNLCLDDLLNNNEKPYNLGLYACGKALQKSQLFSYTNSLVLRNELSCATVQHSSSPP 540
Query: 270 --VILYPC 275
V++ PC
Sbjct: 541 HRVVMVPC 548
>gi|260790280|ref|XP_002590171.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
gi|229275360|gb|EEN46182.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
Length = 1466
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 145/325 (44%), Gaps = 83/325 (25%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIG-GFDWNL 62
E WL+PL+D +AR+ VVSP I I DTF + L ++ + +G GFD
Sbjct: 1113 VECNTGWLEPLVDRIARDRKTVVSPGIDWIHGDTFAYDYGIDTLRVTWGWNLGFGFDHEH 1172
Query: 63 QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
W + E E+ +PV +P + GGLF+ID+ +F ++G YD G + WGGE+ E+S
Sbjct: 1173 AERWVQLSEDEQ------VKPVRSPMLLGGLFAIDRQYFREIGMYDPGLEYWGGEHFEIS 1226
Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG------TYDSG----F 172
FK W M GG SI+ ++G TY +G
Sbjct: 1227 FK--------------------AW---MCGG--SIEVLPCSRVGHVWGKKTYSTGNMTLH 1261
Query: 173 DIWGGENLELS------------------FKGDFGDVTSRKELRRNLGCKSFKWYL---- 210
D N+ ++ K FGD++ R+ LR L CK F+WYL
Sbjct: 1262 DWASRNNMRVAEVWMDHYKVHYYIRRPYLMKRKFGDISDRRRLRERLQCKDFRWYLDNAF 1321
Query: 211 --------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
+V N+ + C+D KP + + ++PCH G QF+ ++ ++
Sbjct: 1322 PDLYIPDDIPGRYGQVRNNGTNTCLDWTSKP---QRELEMFPCHHGLGTQFFELTGQNQL 1378
Query: 257 RRDEACLDYA--GGDVILYPCHGSK 279
R + +CL+ G DV+L C S+
Sbjct: 1379 RDERSCLEARDDGSDVMLVTCGRSE 1403
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 144/330 (43%), Gaps = 72/330 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A++ VV P I I +TF R + + G FDW L +
Sbjct: 257 EANVNWLPPLLEPIAKDYRTVVCPFIDVIDYETFAYR-------AQDEGARGSFDWELYY 309
Query: 65 N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+PE N EP +P MAGGLF+I + +F +LG YD G D+WGGE ELSF
Sbjct: 310 KRLPLLPED----LANPTEPFKSPVMAGGLFAISRRYFWELGGYDEGLDVWGGEQYELSF 365
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
K W V P G A F G Y ++W
Sbjct: 366 KI-WQC-----------GGTMVDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWM 413
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
E E + + GD+T++KELR+ L CKSFKW++
Sbjct: 414 DEYKEYLYMRRPHYRNLEPGDLTAQKELRKRLNCKSFKWFMENVAFDQPSKYPAIEPPDY 473
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIR--RDEA 261
E+ ++ S +CID+ K ++ L C + Q G Q +++ H +IR +
Sbjct: 474 AWGEIRHEKSSLCIDTQFK--GQNERFSLEKCIRDHRDQSGEQHLVLTWHKDIRPQKRTV 531
Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
C D + + V+L+ CHG GNQ F+YD
Sbjct: 532 CFDVSSSEPRAPVVLWSCHGMHGNQLFKYD 561
>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oryzias latipes]
Length = 533
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 104/234 (44%), Gaps = 56/234 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP++ + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 195 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 247
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ++ + P+ TP +AGG+F +DK++F LG YD+ DIWGGEN ELSF
Sbjct: 248 FKWEQIPIEQKMARSDPTLPIRTPVIAGGIFVMDKSWFNHLGQYDTHMDIWGGENFELSF 307
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
+ VW M GG I F K YD
Sbjct: 308 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 344
Query: 170 -----SGFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
++W E + + FG +T R LRR L CK F+WY+E
Sbjct: 345 IKNTRRAAEVWMDEYKQFYYSARPSAQGKAFGSITERLSLRRKLNCKPFRWYME 398
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 142/330 (43%), Gaps = 67/330 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A N +V P+I I D F G T + G FDW +
Sbjct: 312 CEANINWLPPLLDRIALNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 365
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + +EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 366 YKRIPIPAELQK--NDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 423
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 424 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEYVYQRR-- 481
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++KELR L CKSFKW++ EV
Sbjct: 482 -PEYRHLS----AGDVAAQKELRSTLNCKSFKWFMKEVAWDLPKHYPPVEPPAAAWGEVR 536
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD--------- 259
+ SG+C++S K P+ L C K + W HG++ R D
Sbjct: 537 SAASGLCLES--KHFVSGTPIRLESCVKGRADVSW---GHGQVFTFGWREDIRVGDPMHT 591
Query: 260 -EACLDYAG--GDVILYPCHGSKGNQYFEY 286
+ C D V LY CHG +GNQ + Y
Sbjct: 592 KKVCFDAVSHHSPVTLYDCHGMRGNQLWRY 621
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 139/326 (42%), Gaps = 65/326 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLL+ +A N +V P I I D F T + G FDW +
Sbjct: 195 CEANVNWLPPLLEPIALNKKTIVCPNIDVIDKDDFHYE------TQAGDAMRGAFDWEMY 248
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP+ ++ + ++P +P MAGGLF++D+ +FE+LG YD G DIWGGE ELSF
Sbjct: 249 YKRIPIPDE--IKNPDPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGGEQYELSF 306
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGG 177
K W V P G ++ + G ++W
Sbjct: 307 KV-WQC-----------GGRMVDAPCSRVGHVYRKFVPYKVPAGVNLGKNLKRVAEVWMD 354
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEVS----------------- 213
E E + K D GD++ + +LR L CK FKW+++V
Sbjct: 355 EYKEHLYKRRPHLRKTDMGDISGQLQLRERLKCKPFKWFMKVVAPDIILHYPPVEPEPAA 414
Query: 214 -----NDWSGMCIDSACKPTDMHKPVGLYPCHKQG----GNQFWMMSKHGEIRRD--EAC 262
N S +CIDS K V L C K G G Q + MS H +IR C
Sbjct: 415 SGEIWNKASNLCIDS--KHGGGQAEVRLDQCVKGGGIMNGEQNFHMSWHNDIRPKGRTFC 472
Query: 263 LD--YAGGDVILYPCHGSKGNQYFEY 286
D GG +IL+ CH GNQ++ Y
Sbjct: 473 FDAQMKGGTLILFACHQMLGNQHWLY 498
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 144/331 (43%), Gaps = 69/331 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I D F G T + G FDW +
Sbjct: 312 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 365
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K ++ +EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 366 YKRIPIPPELQK--EDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 423
Query: 124 KFNW------HAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDI 174
K W IP P P ++A L + + + ++ Y
Sbjct: 424 KV-WMCGGCMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEY---IYQ 479
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
E LS GD ++K+LR L CKSFKW++ E+
Sbjct: 480 RRPEYRHLS----AGDTAAQKDLRSQLNCKSFKWFMTKVAWDLSKHYPPVEPPAAAWGEI 535
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
N S MC+++ K PV + C K G W HG++ R D
Sbjct: 536 RNVGSSMCLET--KHFVSGSPVWMESCLKGRGEVGW---NHGQVFTFGWREDIRVGDPMH 590
Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
+ C D + V LY CHG KGNQ + Y
Sbjct: 591 TKKVCFDAVSNNSPVTLYDCHGMKGNQLWRY 621
>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
Length = 587
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 140/311 (45%), Gaps = 55/311 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 234 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 290 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 349
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGL-------FSIDKAFFEKLG-----TYDSG 171
K W W P G ++ K +K G Y
Sbjct: 350 KI-WQC-----------GGSIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRV 397
Query: 172 FDIWGGENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSG 218
+ W E + L+ D GD+T + L++ L CKSF+W++ +V + + G
Sbjct: 398 IETWFDETHKEYFYTREPLARYLDMGDITEQLALKKRLNCKSFQWFMDHIAYDVYDKFPG 457
Query: 219 M-----------CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
+ C + H+P +GL CH G NQ ++ G++ E C++
Sbjct: 458 LPANLHWGELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIE 517
Query: 265 YAGGDVILYPC 275
+ L C
Sbjct: 518 ADRQGIKLAVC 528
>gi|115497708|ref|NP_001069909.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos taurus]
gi|83405338|gb|AAI11261.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Bos taurus]
gi|440895696|gb|ELR47826.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos grunniens mutus]
Length = 448
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL+ +A++ VV PLI I D L + P + G F+W L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWRLE 286
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + P+ +P MAGG+F+I++ +F ++G YD G ++WGGENLELS
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSL 346
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF-FEKLGTYDSGFDIWGG 177
+ + IP R H N F I K + L + D + G
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH-------VTNRFEIMKVVEYNNLRLVHTWLDEYKG 398
Query: 178 E---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 399 QFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLD 435
>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
[Saccoglossus kowalevskii]
Length = 634
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 132/307 (42%), Gaps = 51/307 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLL + +N VV PL+ + D F G + G F+W+
Sbjct: 292 CECSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNF------GYAPQADGMARGVFNWDFF 345
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +E R + +EP +P MAGGLF++ ++FF +G YD+G DIWGGE E+SF
Sbjct: 346 YKRIPIPPKEANRRERNSEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISF 405
Query: 124 KFNWHA------IP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
K W +P R ++ P P G+ ++K + ++W
Sbjct: 406 KI-WMCGGILEFVPCSRVGHIYRRGGIPYSYPQSDDGISIVNKNYLRVA-------EVWM 457
Query: 177 GENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL------------------- 210
E E ++ +GD+T + + R+ FKW++
Sbjct: 458 DEYKEYFYRMKPELRGKPYGDITEQVQFRQEHCPHDFKWFMDEVAYDITERFPLISKNIG 517
Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
EV S C+DS + V LY CH GG+Q +++ GE R +E CL G
Sbjct: 518 WGEVRGVGSSKCVDSMGRSPSGK--VALYGCHGYGGSQLLRLNEGGEFRVNEECLYTDGS 575
Query: 269 DVILYPC 275
V L C
Sbjct: 576 TVKLERC 582
>gi|357622639|gb|EHJ74065.1| putative N-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 646
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 146/359 (40%), Gaps = 109/359 (30%)
Query: 4 CEVQKRWLQPLLDVLA--------RNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFI 55
EV WL PLL L+ R S V+P+I I DTFE P
Sbjct: 275 IEVNVDWLPPLLTRLSEGVDGVNVRFSPRAVTPIIDVINADTFEYTSSP--------LVR 326
Query: 56 GGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 115
GGF+W L F W +P+ K ++ +P+ +PTMAGGLF+I + +F K+G YDSG ++WG
Sbjct: 327 GGFNWGLHFKWDNLPKGTLKDDEDFIKPIRSPTMAGGLFAIYREYFNKIGKYDSGMNLWG 386
Query: 116 GENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDS 170
GENLE+SF+ +W M GG+ + F K Y +
Sbjct: 387 GENLEISFR--------------------IW---MCGGVLELCPCSRVGHVFRKRRPYGA 423
Query: 171 GFD-----------IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE- 211
G D +W E + + + GD++ R ELR++L CKSFKWYLE
Sbjct: 424 GEDYMLRNSMRMARVWMDEYVNKVIEQNPSAAHVSIGDISERVELRKSLKCKSFKWYLEN 483
Query: 212 ------------------VSND--------W-----------------SGMCIDSACKPT 228
ND W + +CI SA
Sbjct: 484 VYPELETGEDTAARKRIAALNDPEKNKFQPWHSRKRNYTDSYQIRLRNTSLCIQSAKDIK 543
Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDVILYPCHGSKGNQYFEY 286
P+ L C + NQ W + GE+ CLD A I+ CH G Q +++
Sbjct: 544 SKGSPLLLAGCTRT-INQMWFETDRGELVLGRTLCLD-ANTSPIIAKCHELGGTQEWKH 600
>gi|296488205|tpg|DAA30318.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 5 [Bos
taurus]
Length = 447
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 109/217 (50%), Gaps = 25/217 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL+ +A++ VV PLI I D L + P + G F+W L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWRLE 286
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + P+ +P MAGG+F+I++ +F ++G YD G ++WGGENLELS
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSL 346
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF-FEKLGTYDSGFDIWGG 177
+ + IP R H N F I K + L + D + G
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH-------VTNRFEIMKVVEYNNLRLVHTWLDEYKG 398
Query: 178 E---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 399 QFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLD 435
>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 639
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 108/218 (49%), Gaps = 28/218 (12%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV W++PLL + N + + P+I I DTF P GGF+W L F
Sbjct: 272 EVNVDWIEPLLQRIKVNRTILAMPVIDIINSDTFAYTSSP--------LVRGGFNWGLHF 323
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +P+ + + P +PTMAGGLF++D+ +F++LG YD G D+WGGENLE+SF+
Sbjct: 324 KWDNLPKGSLAKETDFVGPFQSPTMAGGLFAMDRKYFKELGEYDMGMDVWGGENLEISFR 383
Query: 125 FNWHA-----------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W I RKR + P T TM + + + + Y
Sbjct: 384 -AWQCGGSIELLPCSRIGHVFRKR-RPYGSPDGTDTMIRNSLRLARVWMDDYIKY----- 436
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
EN + K D GD++ R+ELR L CKSF+WYL+
Sbjct: 437 --FFENQPHANKLDAGDLSERQELRNRLNCKSFEWYLK 472
>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
echinatior]
Length = 597
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 147/314 (46%), Gaps = 45/314 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WLQPLL + N + V+ P+I NI ++T E ++ F +GGF W+
Sbjct: 239 CEVIKDWLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHD----NEAFFFQVGGFTWSGH 294
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W I + E + + P +PTMAGGLF+I++ +F ++G+YD D WGGENLE+SF
Sbjct: 295 FTWITIQKHEVESRFSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEISF 354
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
+ IP P P G+ + AF + Y F +
Sbjct: 355 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVW-MDEYKRLFLLHR 413
Query: 177 GE---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSND 215
E N EL GD++ R +LR+ L CKSFKWYL V
Sbjct: 414 SEFKDNPEL-----IGDISERLKLRKKLKCKSFKWYLNNVYPEKFIPDENAIAYGRVRLR 468
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEAC---LDYAGG--- 268
+C+D+ D +GLY CH K +QF+ +SK GE+RR+E C LD G
Sbjct: 469 NRRLCLDNLQHDDDKPYNLGLYNCHTKLYPSQFFSLSKSGELRREETCGRILDTDSGPYA 528
Query: 269 DVILYPCHGSKGNQ 282
+ + C KG +
Sbjct: 529 QIEMSDCSNEKGGK 542
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 142/324 (43%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A+N +V P+I I + F G + G FDW +
Sbjct: 234 CEANINWLPPLLDQIAQNPKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP + + ++P +P MAGGLF++++ +F +LG YD+G +IWGGE E+SF
Sbjct: 288 YKRIPIPPELQG--PDPSDPYQSPVMAGGLFAVNRQWFWELGGYDTGLEIWGGEQFEISF 345
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K + + +P P P+ ++ + + Y E
Sbjct: 346 KVWMCGGSMYDVPCSRVGHIYRKYVPYKVPSGTSLARNLKRVAETWMDEYTEYIYQRRPE 405
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GD+T++KELR++L CK FKWY+ E+ N
Sbjct: 406 YRHLS----TGDLTAQKELRKHLKCKDFKWYMNTVAWDLPKYYPPVEPLPAAWGEIRNAA 461
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
SG+C+DS T + L C K+G + W + +IR + C D
Sbjct: 462 SGLCVDSKHGSTGTE--LRLDNCLKEGAERTWAHEQIFTFGWREDIRPGDPLHTRKFCFD 519
Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
+ + LY CHG KGNQ++ Y
Sbjct: 520 AISQNSPITLYDCHGMKGNQHWSY 543
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 136/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL+ +A N +V P+I I + F G + G FDW +
Sbjct: 234 CEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 287
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP ++ + +EP +P MAGGLF++D+ +F +LG YD G +IWGGE ELSF
Sbjct: 288 YKRIPIPPELQR--TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSF 345
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K +P P PT ++ + + Y E
Sbjct: 346 KVWMCGGEMFDVPCSRVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPE 405
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GD++S+KELR++L CK FKWY+ E+ N
Sbjct: 406 YRHLS----TGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVA 461
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
S +CIDS T + L C K G + W + +IR E C D
Sbjct: 462 SNLCIDSKHGATGTE--LRLDTCVKDGSERTWSHEQLFTFGWREDIRPGEPLHTRKFCFD 519
Query: 265 YA--GGDVILYPCHGSKGNQYFEY 286
V LY CHG KGNQ + Y
Sbjct: 520 SISHSSPVTLYDCHGMKGNQQWSY 543
>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
Length = 591
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 140/309 (45%), Gaps = 51/309 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I TFE R + S F G F+W +
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKTFEYR----PVYGSDNHFRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ L+ D GD++ + L++ L CKSF+W++
Sbjct: 409 THKEFFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHW 468
Query: 211 -EVSNDWSGMCIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA 266
E+ + S C+DS H+P +GL CH G NQ ++ G++ E C++
Sbjct: 469 GELRSVASDGCLDSMG-----HQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEAD 523
Query: 267 GGDVILYPC 275
+ L C
Sbjct: 524 RQGIKLAVC 532
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 136/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL+ +A N +V P+I I + F G + G FDW +
Sbjct: 235 CEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP ++ + +EP +P MAGGLF++D+ +F +LG YD G +IWGGE ELSF
Sbjct: 289 YKRIPIPPELQR--TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSF 346
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K +P P PT ++ + + Y E
Sbjct: 347 KVWMCGGEMFDVPCSRVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPE 406
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GD++S+KELR++L CK FKWY+ E+ N
Sbjct: 407 YRHLS----TGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVA 462
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
S +CIDS T + L C K G + W + +IR E C D
Sbjct: 463 SNLCIDSKHGATGTE--LRLDTCVKDGSERTWSHEQLFTFGWREDIRPGEPLHTRKFCFD 520
Query: 265 YA--GGDVILYPCHGSKGNQYFEY 286
V LY CHG KGNQ + Y
Sbjct: 521 SISHSSPVTLYDCHGMKGNQQWSY 544
>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
Length = 540
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 106/327 (32%), Positives = 145/327 (44%), Gaps = 65/327 (19%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +AR+ V P I I +TFE R + + G FDW L +
Sbjct: 178 EANVNWLPPLLEPIARDYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 230
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ + KR AEP +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 231 KRLPLLPEDLKR---PAEPFKSPIMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFK 287
Query: 125 FNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGE 178
W + R H P + P G F LG Y ++W E
Sbjct: 288 I-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMDE 336
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
E +K D GD++ +K LR L CKSF W++
Sbjct: 337 YAEYIYKRRPHLRTLDPGDLSEQKALRTKLHCKSFNWFMKNIAFDLVEVYPPIEPDDFAF 396
Query: 211 -EVSN-DWSGMCIDSACKPTD--MHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLD 264
E+ N + +C+DS + D + + + K G Q + ++ H +IR + CLD
Sbjct: 397 GEIRNMGATELCLDSKKRKRDEVVVMDICMKDDPKMSGEQEFRLTWHKDIRPKDRTDCLD 456
Query: 265 YAGGD----VILYPCHGSKGNQYFEYD 287
+ G+ V LYPCHG +GNQ + YD
Sbjct: 457 VSRGEEKAPVSLYPCHGKQGNQLWRYD 483
>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
5 [Pan troglodytes]
Length = 638
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 142/313 (45%), Gaps = 60/313 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
CE WL+PLL +A + + VVSP I I +TFE P GR+ S G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
L F W +P E++R K+ P+ +PT AGGLFSI K++FE +GTYD+ +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386
Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
SF+ W + E + ++ G +F T+ G + +
Sbjct: 387 SFRV-WQCGGQLE----------IIPCSVVGHVFRTKSPH-----TFPKGTSVIARNQVR 430
Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHK 241
L+ + D + RRNL ++ K EV + S +D + + P L+
Sbjct: 431 LA--EVWMDSYKKIFYRRNL--QAAKMAQEVRGNGSRRGLDE-----EKYGPQTLFMGLM 481
Query: 242 QGG---NQFWMMSK------------------HGEIRR--DEACLDY-----AGGDVILY 273
G + W + +G I+ CLD G +I+Y
Sbjct: 482 AGTHLISTIWRLPSPSGTFYPERFVPDLTPTFYGAIKNLGTNQCLDVGENNRGGKPLIMY 541
Query: 274 PCHGSKGNQYFEY 286
CHG GNQYFEY
Sbjct: 542 SCHGLGGNQYFEY 554
>gi|195397828|ref|XP_002057530.1| GJ18184 [Drosophila virilis]
gi|194141184|gb|EDW57603.1| GJ18184 [Drosophila virilis]
Length = 625
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 96/336 (28%), Positives = 145/336 (43%), Gaps = 71/336 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++WL+PLL ++ ++ + P+I I DTFE + P L GGF+W L F
Sbjct: 243 EVNRQWLEPLLRLVHAENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 294
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K ++ P +PTMAGGLF++ + +F+ +G YD DIWGGEN+E+SF+
Sbjct: 295 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVSRLYFQHIGEYDMAMDIWGGENIEISFR 354
Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W +P RKR A P TM + + +K Y +
Sbjct: 355 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAHVWMDKYKDYYLKHE 412
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--------------------VS 213
++ DFGD+++R +LR L CK F WYL+ V
Sbjct: 413 -------KVPKDYDFGDISARLQLRERLHCKDFDWYLKHVYPELRVPGDESKKPAVAPVF 465
Query: 214 NDW---------------SGMCIDSACKPTDMH------KPVGLYPCHKQGGNQFWMMSK 252
W SG + +A + + L C + NQ W ++
Sbjct: 466 QPWHSRKRNYLDSFQLRLSGTQLCAAVVAPKVKGFWKKGSSLTLQNCRTRAANQMWYETE 525
Query: 253 HGEIRRDE-ACLDYAGGD-VILYPCHGSKGNQYFEY 286
EI D+ CL+ A VI+ CH G+Q + +
Sbjct: 526 KSEIILDKLLCLEAAADTLVIINKCHEMLGDQQWRH 561
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 143/335 (42%), Gaps = 77/335 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD + +N +V P+I I D F G T + G FDW +
Sbjct: 242 CEANVNWLPPLLDRIVQNRKTIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 295
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP R + EP +P MAGGLF++D+ +F +LG YD+G +IWGGE E+SF
Sbjct: 296 YKRIPIPAEMRT--DDPTEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 353
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K IP R ++ + P G S+ K ++W
Sbjct: 354 KVWMCGGRMEDIPCSRVGHIYRK-----YVPYKVPGGISLAKNL-------KRVAEVWMD 401
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
E E ++ GD++++KELR +L CK+F+W++
Sbjct: 402 EYAEYVYQRRPEYRHLSAGDMSAQKELRSHLNCKNFRWFMEEVAWDLPKHYPPVEPPAAA 461
Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD---- 259
E+ + SGMC++ K P+ L C K G+ W HG++ R D
Sbjct: 462 WGEIRSVGSGMCME--IKHFVSGSPIRLESCVKGRGDVSW---SHGQVLTFGWREDIRVG 516
Query: 260 ------EACLDYAG--GDVILYPCHGSKGNQYFEY 286
+ C D V LY CHG KGNQ + Y
Sbjct: 517 DPMHTRKVCFDAVSHHSPVTLYDCHGMKGNQLWRY 551
>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
Length = 699
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 146/317 (46%), Gaps = 56/317 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL +A++ + VV P I I T + + G +S G F W L
Sbjct: 367 CEANLGWLEPLLAWMAKDKTRVVCPTIDRISAQTMD--YVGGGASSR-----GTFHWTLD 419
Query: 64 FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W +A+ R+ + A+P+ +PTMAGGLF I++ +F +LGTYD G D WGGENLE+S
Sbjct: 420 FTWEYAV----RQHGETPADPIKSPTMAGGLFGINRDYFYELGTYDMGMDGWGGENLEMS 475
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + H IP P P ++++ F + ++W
Sbjct: 476 FRIWQCGGSLHIIPCSRVGHIFRDWHPYAIPNS-----TVNETFLKNSIRL---AEVWMD 527
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCI---DSACKP 227
E ++ + DFGDV+ RK LR LGCKSFKWYL+ N G I D
Sbjct: 528 EYKDIFYDIKPSARSVDFGDVSERKALREKLGCKSFKWYLD--NVVPGKLIPNSDVVLHK 585
Query: 228 TDMHKPVGL----------YPCHKQG----GNQFWMMSKHGEIRR--DEACLDYAGGDVI 271
+ + + YPCH G FW ++ + E+R D + V+
Sbjct: 586 GQVRNSLNICMDKGAGSLAYPCHTPGVHSTSQAFW-LTVYKEVRHVWDLCLTSHDNKRVM 644
Query: 272 LYPCHGSKGNQYFEYDY 288
L C ++ +EYD+
Sbjct: 645 LSTC--GPNSRKWEYDH 659
>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
(Silurana) tropicalis]
gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus (Silurana) tropicalis]
Length = 563
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 107/217 (49%), Gaps = 24/217 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WLQPLL + N VV P+I I DT + SS GGF+W L
Sbjct: 203 CEVNEMWLQPLLAPIKENPRTVVCPVIDIISADTL--------IYSSSPVVRGGFNWGLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P E + + P +PTMAGGLF++D+ +F LG YDSG DIWGGENLE+SF
Sbjct: 255 FKWDPVPLAELGGPEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSGMDIWGGENLEISF 314
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ + +P P +P TMA + + D D
Sbjct: 315 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWM------DEYKDQ 368
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ EL + DFGD+ R LRR L CKSFKWYL+
Sbjct: 369 YFALRPELRNR-DFGDIRERLALRRRLNCKSFKWYLD 404
>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
Length = 591
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 140/309 (45%), Gaps = 51/309 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I +FE R + S F G F+W +
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKSFEYR----PVYGSDTHFRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ L+ D GD+T + L++ L CKSF+W++
Sbjct: 409 THKEFFYTREPLARYLDMGDITEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHW 468
Query: 211 -EVSNDWSGMCIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA 266
E+ + S C+DS H+P +GL CH G NQ ++ G++ E C++
Sbjct: 469 GELRSVASDGCLDSMG-----HQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEAD 523
Query: 267 GGDVILYPC 275
+ L C
Sbjct: 524 RQGIKLAIC 532
>gi|126303658|ref|XP_001380711.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Monodelphis domestica]
Length = 552
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 103/213 (48%), Gaps = 14/213 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VV P+I I DTF SS GGFDW L
Sbjct: 202 CEVNKDWLLPLLHRIKEDPTRVVCPVIDIINRDTFAY-------VSSSPDMRGGFDWTLH 254
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + RE+ + +P+ TP ++GGLF ++K++F LG YD+ DIWGGEN E+SF
Sbjct: 255 FKWEELTLREKALRVDPIQPIETPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISF 314
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ + +P P P G L + K + F +
Sbjct: 315 RVWMCGGSLEILPCSRVGHVFRKKHPYTFP--EGNLNTYIKNTKRTAEVWMDEFKHYFYA 372
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
++ FG++ SR ELR+ L C +FKWYLE
Sbjct: 373 ARPVAQGRPFGNIQSRVELRKRLKCHTFKWYLE 405
>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
Length = 486
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 133 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 188
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 189 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 248
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 249 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 303
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 304 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 363
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 364 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 423
Query: 272 LYPC 275
L C
Sbjct: 424 LAVC 427
>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
N-acetylgalactosaminyltransferase 2 [Drosophila
melanogaster]
Length = 591
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
Length = 591
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
Length = 591
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
Length = 591
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|119508144|gb|ABL75647.1| IP16941p [Drosophila melanogaster]
Length = 245
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/240 (35%), Positives = 115/240 (47%), Gaps = 67/240 (27%)
Query: 78 KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKR 137
++ AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSFK W
Sbjct: 22 ESTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFK-TW---------- 70
Query: 138 HKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGENLELSF------ 184
M GG I F K Y SG ++ ++ L+
Sbjct: 71 ------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLKKNSVRLAEVWMDEY 118
Query: 185 -----------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDW 216
KGD+GDV+ R++LR +L CKSFKWYL E++N
Sbjct: 119 SQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIANVP 178
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGDVILYP 274
+GMC+D+ K ++ PV +Y C ++ G++F MS + C V P
Sbjct: 179 NGMCLDAKEK-SEEETPVSIYECKRKIRRGHRFPFMSAMAKEEISTGCSARRAKSVATTP 237
>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 7;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
AltName: Full=dGalNAc-T2
gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
Length = 591
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R P T ++ F G F+W +
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
+ L+ D GD++ + L++ L CKSF+W++ +V + + G+
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
C + H+P +GL CH G NQ ++ G++ E C++ +
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528
Query: 272 LYPC 275
L C
Sbjct: 529 LAVC 532
>gi|291397402|ref|XP_002715124.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oryctolagus cuniculus]
Length = 439
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 114/218 (52%), Gaps = 27/218 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL V+A++ VV P+I I + T E + P G F+W LQ
Sbjct: 226 CEVNKVWLEPLLSVIAKDPHTVVCPIIDVIDEMTLEYKPSP--------IVRGTFNWMLQ 277
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + A+P+ +P+MAGG+F+I + +F+++G YD D+WGGEN+E+S
Sbjct: 278 FKWDNVFSYEMEGPEGPAKPIRSPSMAGGIFAIHRHYFKEIGQYDKDMDLWGGENVEISL 337
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ IP R + + EP T A + + + + T+ +
Sbjct: 338 RIWMCGGQLFIIPCSRVGHITRKSPEPNLAVTKA-----VTRNYLRLVHTWLDEYK---- 388
Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
E L G +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 389 EQFFLHRPGLRSIPYGNISERVELRKRLGCKSFQWYLD 426
>gi|196001845|ref|XP_002110790.1| hypothetical protein TRIADDRAFT_23005 [Trichoplax adhaerens]
gi|190586741|gb|EDV26794.1| hypothetical protein TRIADDRAFT_23005 [Trichoplax adhaerens]
Length = 519
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/314 (28%), Positives = 134/314 (42%), Gaps = 54/314 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL+PLL + + + VV+P I I +F + P L G FDWNL+
Sbjct: 170 CEVNVGWLEPLLRRVNEDPTVVVTPEIDLIDASSFRYLYGPSGLIR------GVFDWNLK 223
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W IP ER K+ E V +PTM G +F+ID+ FF+ +G YDS + W E+LE+SF
Sbjct: 224 FKWKVIPREERLARKSPIESVRSPTMGGDIFAIDRKFFQSIGKYDSQVETWEVEHLEISF 283
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
+ IP + + +P P ++F + LG +IW
Sbjct: 284 RIWLCGGKIEIIPCSHVGQVLRSFQPYQPP----------QSFDDYLGKNSQRIAEIWLD 333
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
+ E + + GD+T+ R+ LGCK+F+WYL ++
Sbjct: 334 DYKEFYYQRYPHLRQNFLGDITAELRQRQKLGCKNFRWYLNNVFTDAVFPNESVMAEGKI 393
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGG 268
N S C+ A K V L C + + + EI + CLD G
Sbjct: 394 RNPASANCLMVAGKTNSY---VRLITCVHDTSSMIFRFTIRREIEINGKCLDANRSKRGS 450
Query: 269 DVILYPCHGSKGNQ 282
+ L CH + +Q
Sbjct: 451 KIQLVDCHRMRDSQ 464
>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
Length = 638
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 132/331 (39%), Gaps = 73/331 (22%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE QK WL+PLL L+ N + VVSP+I I TF+ S G FDW L
Sbjct: 287 CECQKGWLEPLLARLSSNRNSVVSPIIDVIDWKTFQYYH-------SVSLHRGVFDWKLD 339
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +PE E K ++ P+ +P +AG + ++D+ +F+ +G YDS +WG ENLELS
Sbjct: 340 FHWEPVPEHEEKVRQSPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSI 399
Query: 124 KFNW-------------------HAIP-----ERERKRHKNAAEPVWTPTMAGGLFSIDK 159
+ W H IP E R+K W + + D
Sbjct: 400 R-TWLCGGSVEIIPCSRVGHVYRHHIPHAFSYEEAIVRNKIRIAETWLDSFKENFYKNDT 458
Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
F L K + D + R +L++ LGC+SF+W++
Sbjct: 459 VAF-------------------LISKAEKPDCSERLQLQKRLGCRSFQWFITNVYPELSR 499
Query: 211 ---------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
++ N +G C D + L PC F SK EIR A
Sbjct: 500 PEDAPRLSGKLYNTGAGFCADYRPGMALADGSIKLSPCTNSLTQHFEYNSKK-EIRVGSA 558
Query: 262 ---CLDYAGGDVILYPCHGSKGNQYFEYDYK 289
CLD G VI C N +D +
Sbjct: 559 LLFCLDVRHGKVIPQNCTKETDNSEQHWDVQ 589
>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
6 [Ciona intestinalis]
Length = 627
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 142/321 (44%), Gaps = 78/321 (24%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+P+L+ +A +++ VV P+I I DTF + LT++ G W+L
Sbjct: 278 CECAPHWLEPMLERIAEDNTRVVCPVIEVIDADTFAMS-----LTTARSVQTGILSWSLG 332
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
FNW + KN E + + TMAGGLF++ + +F LG+YD+ +WGGEN+E+S
Sbjct: 333 FNWAPRKINPGQPIKND-EALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSL 391
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFD--- 173
+ +W M GG I F K Y G D
Sbjct: 392 R--------------------IW---MCGGSLEIHPCSHVGHVFRKRAPYSHPGGSDVIT 428
Query: 174 --------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------- 211
+W E E +K + GD+T+R +LR +L C++F+WY+
Sbjct: 429 HNNKRVAEVWLDEYKEQYYKRVPRARAVEAGDLTARIKLRHDLKCRNFQWYITNIYPALY 488
Query: 212 --------VSNDW------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
+W S C+DSA + + +Y CH G NQ + ++++GEIR
Sbjct: 489 ATPKEDILKGGEWHNKDRDSKYCLDSANPDGKVGVKMTMYVCHGMGVNQDFDLTRNGEIR 548
Query: 258 RD---EACLDYAGGDVILYPC 275
E CL +G ++ Y C
Sbjct: 549 HSYSKELCLQPSGNSIVTYDC 569
>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
Length = 601
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 37/312 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I +++E R G + + G F+W L
Sbjct: 252 CEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I ERE K+ ++P +PT AGGLF+I++ +F++LG YD G IWGGE ELSF
Sbjct: 309 YKETQITERESAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
K W +P + P +G + SI+ + T+ + +
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
+ + GD++++ LR L CKSFKWY+ E N
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPLLPPNDVWGEARNP 485
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+G C+D + + P+G CH GGNQ ++ G++ + E CL G + C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542
Query: 276 HGSKGNQYFEYD 287
+ F YD
Sbjct: 543 VKGSVSGNFVYD 554
>gi|328794283|ref|XP_001122865.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like,
partial [Apis mellifera]
Length = 372
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/122 (50%), Positives = 79/122 (64%), Gaps = 8/122 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A + + VV P+I I DDTFE P +T GGF+W L
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370
Query: 123 FK 124
F+
Sbjct: 371 FR 372
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/73 (56%), Positives = 54/73 (73%), Gaps = 3/73 (4%)
Query: 114 WGGENLELSFKFNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
WGG N +L+F+ W+ + +RE +R + P+ TPTMAGGLFSIDK +F +LG YD G
Sbjct: 302 WGGFNWKLNFR--WYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGM 359
Query: 173 DIWGGENLELSFK 185
DIWGGENLE+SF+
Sbjct: 360 DIWGGENLEMSFR 372
>gi|71996085|ref|NP_001022948.1| Protein GLY-11, isoform a [Caenorhabditis elegans]
gi|51315905|sp|Q7K755.2|GLT11_CAEEL RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 11; Short=pp-GaNTase
11; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 11; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|3980030|emb|CAA22098.1| Protein GLY-11, isoform a [Caenorhabditis elegans]
Length = 605
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 109/221 (49%), Gaps = 33/221 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL PLLD + +N VV P+I I D +++ + + GG +W +
Sbjct: 258 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 309
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + N P+ +PTMAGGLF+IDK +F ++G+YD G D+WG EN+E+S
Sbjct: 310 FKWDYPHRSYFEDPMNYVNPLKSPTMAGGLFAIDKEYFFEIGSYDEGMDVWGAENVEISV 369
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
+ W E + P + G +F + + K + +W E
Sbjct: 370 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 417
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
LE F+ D+GD+TSR LRRNL CK FKWYLE
Sbjct: 418 YLENFFEARPNYRTFTDYGDLTSRISLRRNLQCKPFKWYLE 458
>gi|390348396|ref|XP_787966.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 403
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 138/320 (43%), Gaps = 50/320 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLL +A N + VV P + +I D FE R + G DW+
Sbjct: 44 CECSPNWLVPLLTEIALNRTTVVCPTVDSISADNFEYR------SQGDGLCRGAMDWD-- 95
Query: 64 FNWHAIP---ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
F + IP R+R K +EP +P MAGGLF++D+ FF +LG YD G IWGGEN E
Sbjct: 96 FWYKRIPVDLSRQRLGLKYQSEPYDSPMMAGGLFALDREFFFELGGYDPGLQIWGGENFE 155
Query: 121 LSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
+SFK + +P P P S+ + ++ ++W
Sbjct: 156 ISFKAWMCGGSLKFVPCSRVGHVYRKGVPYTYPDSGVPGVSVIHMNYMRVA------EVW 209
Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------ 210
E E + +GD+ + R++ KSFKW++
Sbjct: 210 LDEFKEFFYTSRPDLRGKPYGDIGEQIRFRKHHCPKSFKWFMEEVAFDSLEKFPPPQPNQ 269
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
E+ +D +GMC+DS G+Y CH GGNQ + ++ G+I ++ C G
Sbjct: 270 AWGEIKSDHTGMCVDSMGHQATAGGEAGVYYCHGMGGNQRFRLTGPGQIMFNDYCFYVDG 329
Query: 268 GDVILYPCHGSKGNQYFEYD 287
V + C+ + ++ +D
Sbjct: 330 SRVRIDKCNKVQWPSFWVHD 349
>gi|345781283|ref|XP_853759.2| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Canis lupus
familiaris]
Length = 559
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/227 (33%), Positives = 108/227 (47%), Gaps = 45/227 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQPLL +A++S VV PLI I T E + P G F+W+L
Sbjct: 244 CEVNTAWLQPLLHAIAKDSKMVVCPLIDVIDSMTLEYQSSP--------VVRGAFNWHLD 295
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++ E + P+ +P MAGG+F+I++ +F ++G YD G D+WG ENLELS
Sbjct: 296 FKWDSVYSYEMDGPEGPTRPIRSPAMAGGIFAINRHYFNEIGQYDKGMDLWGAENLELSL 355
Query: 124 KF-----NWHAIP-----ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--G 171
+ IP ++R N E V K TY++
Sbjct: 356 RIWMCGGQLFIIPCSRVGHISKQRFSNQPELV------------------KAMTYNNLRL 397
Query: 172 FDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
+W E E F +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 398 VHVWLDEYKEQFFLQQPGLKSVAYGNISERVELRKRLGCKSFQWYLD 444
>gi|393912281|gb|EFO21646.2| glycosyl transferase [Loa loa]
Length = 470
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 114/247 (46%), Gaps = 29/247 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + N VV+P+I I DTF+ L GGF+WNL
Sbjct: 236 CECNVNWLEPLLARVKENHRTVVAPVIDVIDRDTFKYVAASADLR-------GGFEWNLV 288
Query: 64 FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W + + R +RH P+ TP +AGGLF I K +FEKLGTYD DIWGGENLELS
Sbjct: 289 FKWEYLTGKLRDERHARPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEEMDIWGGENLELS 348
Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
F+ + IP P P +G +F + ++W G
Sbjct: 349 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFQKNTR---------RAAEVWLG 399
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM 230
+ L + +FGD+T+R +L++ + F+ E + ++ A K +D+
Sbjct: 400 DYKHLYLRKVPSARYVNFGDITARLDLKKKFALQGFRLVFERNLSGVDDSLERARKISDI 459
Query: 231 HKPVGLY 237
LY
Sbjct: 460 QTGKSLY 466
>gi|338724473|ref|XP_001495495.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Equus caballus]
Length = 448
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 77/223 (34%), Positives = 111/223 (49%), Gaps = 37/223 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL +A++ VV PLI I D L++ P + G F+W+LQ
Sbjct: 235 CEVNKVWLEPLLLAIAKDPKMVVCPLIDVI--DYMTLKYKPSPVVR------GAFNWHLQ 286
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + P+ +P MAGG+F+ID+ +F ++G YD ++WGGENLELS
Sbjct: 287 FKWDNVFSYEMDGPEGPIAPIRSPAMAGGIFAIDRQYFNEIGRYDKDMNLWGGENLELSL 346
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDS--GFDIW 175
+ W + P G IDK E K TY++ +W
Sbjct: 347 RI-WMC-----------GGQLFVLPCSRVG--HIDKQRIENKREYLKAMTYNNLRMVHVW 392
Query: 176 GGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
E+ E F +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 393 LDEHKEQVFLRRPGLKSVAYGNISERVELRKRLGCKSFQWYLD 435
>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Ovis
aries]
Length = 448
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 111/223 (49%), Gaps = 35/223 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL+ +A++ VV PLI I D L + P + G F+W+L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWHLE 286
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + P+ +P MAGG+F+I + +F ++G YD G ++WGGENLELS
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGENLELSL 346
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
+ + IP R H N + + K+ Y+S IW
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH------------MTNDSEIMKVVEYNSLRLAHIWL 393
Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEV 212
E E F +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 394 DEYKEEFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLDT 436
>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
Length = 591
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 139/307 (45%), Gaps = 47/307 (15%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + R+ + + P+I I FE R G T F G F+W +
Sbjct: 238 CEVNLNWLAPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETH----FRGIFEWGML 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +P RE++R + +EP +PT AGGLF+I++ +F +LG YD G +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRSHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353
Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K W E R H + P G L S K + Y + W +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408
Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
+ L+ D GD++ + L++ L CKSF+W++
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468
Query: 211 -EVSNDWSGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
E+ + S C+DS +P + +GL CH G NQ ++ G++ E C++
Sbjct: 469 GELRSVASDGCLDSMGLQPPAI---MGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ 525
Query: 269 DVILYPC 275
+ L C
Sbjct: 526 GIKLAVC 532
>gi|393910679|gb|EFO20658.2| glycosyl transferase [Loa loa]
Length = 601
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 103/231 (44%), Gaps = 54/231 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV +RWL+PLLD + + VV P+I I +T + P GG W+L
Sbjct: 247 CEVNERWLEPLLDRIVTDRHTVVCPIIDIIDANTLKYIESP--------ICKGGMSWSLA 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P K PV +PTMAGGLF+IDK +F+KLG YD G +IWG EN+E+S
Sbjct: 299 FKWDYLPSSYFDEPKQYVRPVKSPTMAGGLFAIDKKYFDKLGQYDRGMEIWGAENVEISL 358
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
+ +W M GG I F + Y G D
Sbjct: 359 R--------------------IW---MCGGRLEIIPCSRIGHIFRQRRPYGFGIDSMGHN 395
Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
IW E ++ + D GD+ K LR+ L CKSF WYL+
Sbjct: 396 AARTANIWLDEYIDQFYAARPNLRGIDIGDIKEMKALRKKLHCKSFFWYLQ 446
>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
Length = 601
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 37/312 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL + +N + P+I I +++E R G + + G F+W L
Sbjct: 252 CEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I ERE K+ ++P +PT AGGLF+I++ +F++LG YD G IWGGE ELSF
Sbjct: 309 YKETQITERESGHRKHTSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368
Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
K W +P + P +G + SI+ + T+ + +
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425
Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
+ + GD++++ LR L CKSFKWY+ E N
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLQSYPLLPPNDVWGEARNP 485
Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
+G C+D + + P+G CH GGNQ ++ G++ + E CL G + C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542
Query: 276 HGSKGNQYFEYD 287
N + YD
Sbjct: 543 VKGTVNGNWIYD 554
>gi|312082359|ref|XP_003143412.1| glycosyl transferase [Loa loa]
Length = 599
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 103/231 (44%), Gaps = 54/231 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV +RWL+PLLD + + VV P+I I +T + P GG W+L
Sbjct: 245 CEVNERWLEPLLDRIVTDRHTVVCPIIDIIDANTLKYIESP--------ICKGGMSWSLA 296
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W +P K PV +PTMAGGLF+IDK +F+KLG YD G +IWG EN+E+S
Sbjct: 297 FKWDYLPSSYFDEPKQYVRPVKSPTMAGGLFAIDKKYFDKLGQYDRGMEIWGAENVEISL 356
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
+ +W M GG I F + Y G D
Sbjct: 357 R--------------------IW---MCGGRLEIIPCSRIGHIFRQRRPYGFGIDSMGHN 393
Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
IW E ++ + D GD+ K LR+ L CKSF WYL+
Sbjct: 394 AARTANIWLDEYIDQFYAARPNLRGIDIGDIKEMKALRKKLHCKSFFWYLQ 444
>gi|395507115|ref|XP_003757873.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Sarcophilus harrisii]
Length = 633
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/218 (34%), Positives = 107/218 (49%), Gaps = 24/218 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + + VV P+I I DTF SS GGFDW L
Sbjct: 311 CEVNKDWLLPLLHRIKEDPTRVVCPVIDIINRDTFAY-------VSSSPDMRGGFDWTLH 363
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + RE+ + +P+ TP ++GGLF ++K++F LG YD+ DIWGGEN E+SF
Sbjct: 364 FKWEELSLREKALRVDPIQPIKTPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISF 423
Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
+ + +P RK+H P P G L + K + F
Sbjct: 424 RVWMCGGSLEILPCSRVGHVFRKKH-----PYTFPE--GNLNTYIKNTKRTAEVWMDEFK 476
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ ++ FG++ +R ELR+ L C +FKWYLE
Sbjct: 477 HYFYAARPVAQGRPFGNIQARVELRKRLKCHTFKWYLE 514
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 136/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 289 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 342
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 343 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 400
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 401 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 460
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDVT++K+LR +L CKSFKW++ E+ N
Sbjct: 461 YRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVG 516
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C D+ K + P+ L C + G W + +IR + C D
Sbjct: 517 TGLCADT--KHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 574
Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 575 AVSHTSPVTLYDCHSMKGNQLWKY 598
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/345 (29%), Positives = 146/345 (42%), Gaps = 80/345 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
EV WL PLL ++ + VV P+I I ++ F+ PG G FDW L
Sbjct: 248 VEVSTNWLPPLLHPISLDRKTVVCPMIDIIDNENFQYVTQPGDAMR------GAFDWELY 301
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP KR K+ +EP +P MAGGLF+I++ +F ++G YD G +IWGGE ELSF
Sbjct: 302 YKRIPIPNE--KRPKDPSEPFESPVMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSF 359
Query: 124 KFNWHA---IPERERKRHKNAAE---PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K W I + R + P P G ++ Y ++W
Sbjct: 360 KV-WMCGGRILDSPCSRIGHIYRKFVPYTIPNNGGPNYN-----------YKRVAEVWMD 407
Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWY---------------LEVSND 215
E E + K D GD++ K LR+ L CKSF WY L S
Sbjct: 408 EYAEFFYRRRPYVRKIDAGDLSKAKALRKELKCKSFDWYIKNVIPDLVQYYPPILPPSAA 467
Query: 216 W-------SGMCID-----------SACKPTD-----------MHKPVGLYPCHKQGGNQ 246
W S +CID S C+ + K + + CH Q GNQ
Sbjct: 468 WGRLKHVVSNLCIDPQVKKGSQVVVSQCQTPEGAVRTCLDASYRSKSILTWDCHNQHGNQ 527
Query: 247 FWMMSKHGEIR-RDEACLDYAGGDVILYPCHGSKGNQYFEYDYKY 290
W + I + C A G +++ PC S G++ FE+++++
Sbjct: 528 LWKYFEKQLIHPSSKKCATVASGALLMMPC--SPGDRLFEWEWEH 570
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 136/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 257 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 310
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 311 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 368
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 369 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 428
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDVT++K+LR +L CKSFKW++ E+ N
Sbjct: 429 YRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVG 484
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C D+ K + P+ L C + G W + +IR + C D
Sbjct: 485 TGLCADT--KHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 542
Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 543 AVSHTSPVTLYDCHSMKGNQLWKY 566
>gi|15207811|dbj|BAB62930.1| hypothetical protein [Macaca fascicularis]
Length = 373
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 160 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 211
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+ +P M+GG+F+I + +F ++G YD D WGGENLELS
Sbjct: 212 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 271
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T + + Y +W E
Sbjct: 272 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 320
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R +LR+ LGCKSF+WYL+
Sbjct: 321 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 360
>gi|195115752|ref|XP_002002420.1| GI12891 [Drosophila mojavensis]
gi|193912995|gb|EDW11862.1| GI12891 [Drosophila mojavensis]
Length = 622
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 69/335 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++WL+PLL ++ +S + P+I I DTFE + P L GGF+W L F
Sbjct: 242 EVNRQWLEPLLRLVKAENSTLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 293
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K ++ P +PTMAGGLF++++ +F+ +G YD DIWGGEN+E+SF+
Sbjct: 294 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 353
Query: 125 F-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
+ +P RKR A P TM + + +K + +
Sbjct: 354 VWQCGGSIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAHVWMDKYKEFYLKHE- 411
Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--------------------VSN 214
+++ D+GD+++R +LR L CK F WYL+ V
Sbjct: 412 ------KVAKDYDYGDISARLQLRERLHCKDFGWYLKHVYPELRLPGDESKKSGAAPVFQ 465
Query: 215 DWSG------------MCIDSACKPTDMHKPVG---------LYPCHKQGGNQFWMMSKH 253
W + C K G L C + NQ W ++
Sbjct: 466 PWHSRKRNYLDSFQLRLAGTQLCAAVVAPKVKGFWKKGSSLTLQICKPRAPNQMWYETEK 525
Query: 254 GEIRRDEA-CLDYAGGD-VILYPCHGSKGNQYFEY 286
EI D+ CL+ A VI+ CH G+Q + +
Sbjct: 526 SEIILDKLFCLEAAADTLVIINKCHEMLGDQQWRH 560
>gi|347971791|ref|XP_003436799.1| AGAP004375-PB [Anopheles gambiae str. PEST]
gi|333469031|gb|EGK97157.1| AGAP004375-PB [Anopheles gambiae str. PEST]
Length = 585
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 112/219 (51%), Gaps = 24/219 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PL D LA + + ++SP+I I TFE R RL GGFDW+L
Sbjct: 218 CEVNRGWLEPLHDRLAIDPTAILSPVIDIIDPHTFEYRANSARLR-------GGFDWSLH 270
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W I E E R + + P ++P ++GG+F + K+ F++LG +D G DIWGGE+LE+S
Sbjct: 271 FRWLPIAEEEFEHRRHDESLPFYSPAISGGIFIVAKSLFQQLGGFDPGMDIWGGESLEMS 330
Query: 123 FK-----FNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
K + +P R++H + +P G + + + F
Sbjct: 331 LKAWMCGAHVEVVPCSRIGHVFRRKHPFSFQP------DGSHLTYLRNTKRVALVWMDEF 384
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ E + D G +T ++ELRR+L C+ F WYL+
Sbjct: 385 KNFFYETRPEAVAVDAGSITEQQELRRSLNCRKFSWYLQ 423
>gi|395838452|ref|XP_003792129.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Otolemur garnettii]
Length = 869
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 106/224 (47%), Gaps = 37/224 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLL +A++ VV PLI I + T E R P G FDW L+
Sbjct: 416 CEVNKGWLEPLLYSIAKDHKMVVCPLIDVIDETTLEYRASP--------VVRGAFDWELK 467
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E +P+ +P MAGG+F+I + +F ++G YD G D+WGGENLELS
Sbjct: 468 FKWDNVFSYEMDGPDRPIKPIRSPAMAGGIFAIYRHYFNEIGQYDKGMDLWGGENLELSL 527
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD--------IW 175
+ W + P G I K F+++ F +W
Sbjct: 528 RI-WMC-----------GGQLFIIPCSRVG--HITKKQFKEVSAITRAFTRNSLRMVHVW 573
Query: 176 GGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEV 212
E E F +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 574 LDEYKEQFFLRKPGLRSIAYGNISERVELRKRLGCKSFQWYLDT 617
>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
Length = 666
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 139/345 (40%), Gaps = 91/345 (26%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A N V P I I F R + + G FDW +
Sbjct: 298 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHTNFHYR-------AQDEGARGAFDW--E 348
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P K+ A+P +P MAGGLF+I + FF +LG YD G DIWGGE ELSF
Sbjct: 349 FFYKRLPLLPEDL-KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 407
Query: 124 KF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSID 158
K N P + HKN VW L+S
Sbjct: 408 KIWMCGGEMYDAPCSRIGHIYRGPRNHQPSPRKGDYLHKNYKRVAEVWMDEYKNYLYSHG 467
Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW- 216
+E + D GD+T +K +R L CKSFKW++ EV+ D
Sbjct: 468 DGLYESV---------------------DPGDLTEQKAIRTKLNCKSFKWFMKEVAFDLM 506
Query: 217 ---------------------SGMCIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMS 251
+C+D+ + H +G+Y C QFW +S
Sbjct: 507 KTYPPVDPPSYAMGALQNVGNQNLCLDTLGR--KKHNKMGMYACADNIKTPQRTQFWELS 564
Query: 252 --KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDYKY 290
+ +RR + CLD A V L+ CH GNQY+ YDY++
Sbjct: 565 WKRDLRLRRKKECLDVQIWDANAPVWLWDCHSQGGNQYWYYDYRH 609
>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
impatiens]
Length = 582
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 139/325 (42%), Gaps = 55/325 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
CEV WL PLL +A + + + P+I I TFE R + G L + G F+W
Sbjct: 229 CEVNVNWLPPLLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHL------YRGIFEWG 282
Query: 62 LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
+ + + +P RE+K + P +PT AGGLF+I++ +F LG YD G +WGGEN EL
Sbjct: 283 MLYKENELPAREKKSRPYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFEL 342
Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
SFK N +P H + P G L K + Y + W
Sbjct: 343 SFKIWQCGGNILWVP----CSHVGHVYRGFMPYTFGKLAQKKKGPLITI-NYKRVVETWF 397
Query: 177 GENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------ 210
+ + L+ D GD++ + E +R CKSF+WY+
Sbjct: 398 DDKYKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNI 457
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYP---CHKQGGNQFWMMSKHGEIRRDEACLD 264
E+ N +GMC+D+ H P L CH G NQ ++ G++ E C+
Sbjct: 458 HWGELRNIATGMCLDTMS-----HSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCIS 512
Query: 265 YAGGDVILYPCHGSKGNQYFEYDYK 289
G V C + ++YD K
Sbjct: 513 ADGQGVKFVFCRLGTVDGPWQYDEK 537
>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
pisum]
Length = 588
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 94/301 (31%), Positives = 137/301 (45%), Gaps = 35/301 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PL+ +AR+ + P+I I +T+E R + F G F+W +
Sbjct: 233 CEVGYNWLPPLIAPIARDRKIMTVPVIDGIDHNTWEYR----PVYEKDHLFRGIFEWGML 288
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +E ++ +EP +PT AGGLF+ID+ +F +LG YD G +WGGEN ELSF
Sbjct: 289 YKEIEIPAQEERKRIYKSEPYKSPTHAGGLFAIDRNYFLELGAYDPGLLVWGGENFELSF 348
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY------DSGFDIWGG 177
K W E V+ M + K L TY ++ FD
Sbjct: 349 KI-WQCGGSIEWVPCSRVGH-VYRGFMPYNFGELGKKVKGPLITYNYKRVIETWFDNKHK 406
Query: 178 E----NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSND-------------WSGM 219
E L+ D GD++ + EL+ L CK F W++E V+ D W +
Sbjct: 407 EFFYTREPLARYLDMGDISKQLELKDKLQCKDFSWFMENVAYDVYTKFPELPPNLYWGEL 466
Query: 220 --CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYP 274
+ C T H+P VGL CH QG NQ + ++ G++ E C+ +V L
Sbjct: 467 RNIGKTTCLDTRGHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQNVKLVV 526
Query: 275 C 275
C
Sbjct: 527 C 527
>gi|402865469|ref|XP_003896945.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Papio
anubis]
Length = 475
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 262 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 313
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+ +P M+GG+F+I + +F ++G YD D WGGENLELS
Sbjct: 314 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 373
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T + + Y +W E
Sbjct: 374 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 422
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R +LR+ LGCKSF+WYL+
Sbjct: 423 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 462
>gi|51316066|sp|Q95JX4.2|GLTL5_MACFA RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
gi|15207881|dbj|BAB62965.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVRPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+ +P M+GG+F+I + +F ++G YD D WGGENLELS
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T + + Y +W E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 390
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R +LR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 430
>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
Length = 676
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 104/330 (31%), Positives = 143/330 (43%), Gaps = 65/330 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A+N V P I I TF R + + G FDW +
Sbjct: 307 VEANYNWLPPLLEPIAKNKRTAVCPFIDVIDHATFNYR-------AQDEGARGAFDW--E 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P + K A+P +P MAGGLF+I + FF +LG YD G DIWGGE ELSF
Sbjct: 358 FYYKRLPLLDEDL-KYPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416
Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
K + A R ++ V +P L K E K YD
Sbjct: 417 KIWMCGGEMYDAPCSRIGHIYRGPRNHVPSPRKGDYLHRNYKRVAEVWMDEYKNYLYDHA 476
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDWSG------------ 218
I+ + D GD+T + +R+ L CKSFKW++ EV+ D
Sbjct: 477 DGIYD--------RIDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFAL 528
Query: 219 ----------MCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS--KHGEIRRDEAC 262
+CID+ + HK +G+Y C + QFW +S + +RR + C
Sbjct: 529 GAIQNVGDKRLCIDTMGRRK--HKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKEC 586
Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
LD V L+ CH GNQY+ YDY
Sbjct: 587 LDVQIWTVNAPVWLWDCHLQGGNQYWSYDY 616
>gi|347971789|ref|XP_001237517.3| AGAP004375-PA [Anopheles gambiae str. PEST]
gi|333469030|gb|EAU76847.3| AGAP004375-PA [Anopheles gambiae str. PEST]
Length = 575
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/219 (33%), Positives = 112/219 (51%), Gaps = 24/219 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PL D LA + + ++SP+I I TFE R RL GGFDW+L
Sbjct: 208 CEVNRGWLEPLHDRLAIDPTAILSPVIDIIDPHTFEYRANSARLR-------GGFDWSLH 260
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W I E E R + + P ++P ++GG+F + K+ F++LG +D G DIWGGE+LE+S
Sbjct: 261 FRWLPIAEEEFEHRRHDESLPFYSPAISGGIFIVAKSLFQQLGGFDPGMDIWGGESLEMS 320
Query: 123 FK-----FNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
K + +P R++H + +P G + + + F
Sbjct: 321 LKAWMCGAHVEVVPCSRIGHVFRRKHPFSFQP------DGSHLTYLRNTKRVALVWMDEF 374
Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
+ E + D G +T ++ELRR+L C+ F WYL+
Sbjct: 375 KNFFYETRPEAVAVDAGSITEQQELRRSLNCRKFSWYLQ 413
>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 104/330 (31%), Positives = 143/330 (43%), Gaps = 65/330 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A+N V P I I TF R + + G FDW +
Sbjct: 307 VEANYNWLPPLLEPIAKNKRTAVCPFIDVIDHATFNYR-------AQDEGARGAFDW--E 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P + K A+P +P MAGGLF+I + FF +LG YD G DIWGGE ELSF
Sbjct: 358 FYYKRLPLLDEDL-KYPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416
Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
K + A R ++ V +P L K E K YD
Sbjct: 417 KIWMCGGEMYDAPCSRIGHIYRGPRNHVPSPRKGDYLHRNYKRVAEVWMDEYKNYLYDHA 476
Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDWSG------------ 218
I+ + D GD+T + +R+ L CKSFKW++ EV+ D
Sbjct: 477 DGIYD--------RIDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFAL 528
Query: 219 ----------MCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS--KHGEIRRDEAC 262
+CID+ + HK +G+Y C + QFW +S + +RR + C
Sbjct: 529 GAIQNVGDKRLCIDTMGRRK--HKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKEC 586
Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
LD V L+ CH GNQY+ YDY
Sbjct: 587 LDVQIWTVNAPVWLWDCHLQGGNQYWSYDY 616
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 319 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 372
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 373 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 430
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 431 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 487
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDVT++K+LR +L CKSFKW++ E+
Sbjct: 488 RPEYRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 543
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 544 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 601
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 602 CFDAISHTSPVTLYDCHSMKGNQLWKY 628
>gi|15207947|dbj|BAB62998.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+ +P M+GG+F+I + +F ++G YD D WGGENLELS
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T + + Y +W E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 390
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R +LR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 430
>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
laevis]
gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
Length = 653
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 60/326 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV W PL+ +A++ + PLI I +T+EL P F G +DW++
Sbjct: 300 CEVGINWYAPLIAPIAKDRTTCTVPLIDVIEGNTYELI--PQAGGDEDGFARGAWDWSML 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +E+++ K EP +P MAGGLF+I++ +F +LG YD G IWGGEN E+S+
Sbjct: 358 WKRVPLTSKEKEQRKTKTEPYRSPAMAGGLFAIEREYFFELGLYDPGLQIWGGENFEISY 417
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSI----------DKAFFEKLGTYDSGF 172
K W + ++TP + G ++ + L Y
Sbjct: 418 KI-WQC-----------GGKLLFTPCSRVGHIYRLHGWQGNPTPAHVGSSPTLKNYVRVV 465
Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----VSN------ 214
++W E + + +GD+++ K+ R + CKSFKW++E + N
Sbjct: 466 EVWWDEYRDYFYASRPETKALAYGDISALKKFREDHNCKSFKWFMEEIAYDIPNYYPLPP 525
Query: 215 ---DW-------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
DW SG CIDS +G CH+ GGNQ + +++ ++ + + CL
Sbjct: 526 RNVDWGEIRGFESGYCIDSMGHTNGGLAELG--GCHRMGGNQLFRINEANQLMQYDQCLT 583
Query: 265 YA--GGDVILYPCHGSKGNQYFEYDY 288
G VIL C+ N+Y E+ Y
Sbjct: 584 KGTDGSKVILTHCN---LNEYKEWQY 606
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 304 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 357
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 358 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 415
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 416 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 472
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDVT++K+LR +L CKSFKW++ E+
Sbjct: 473 RPEYRHLS----AGDVTAQKKLRSSLNCKSFKWFMMKIAWDLPKFYPPVEPPAAAWGEIR 528
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 529 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 586
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 587 CFDAISHTSPVTLYDCHSMKGNQLWKY 613
>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
[Crassostrea gigas]
Length = 526
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 140/329 (42%), Gaps = 69/329 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLL+ +A + VV P I I + F R + + G FDW +
Sbjct: 167 CEANINWLPPLLEPIAEDYKTVVCPFIDVIDFENFAYR-------AQDEGARGAFDW--E 217
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P E K+ AEP +P MAGGLF+I +F ++G YD G DIWGGE ELSF
Sbjct: 218 FFYKRLPLLEEDL-KHPAEPFKSPVMAGGLFAISAKWFWEMGGYDPGLDIWGGEQYELSF 276
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------YDSGFDIWG 176
K W V P G A F G Y ++W
Sbjct: 277 KL-WQC-----------GGMMVDAPCSRIGHIYRKFAPFPNPGVGDFVGRNYRRVAEVWM 324
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
E E +K D GDV+ +K +R L CK FKW++
Sbjct: 325 DEYAEYLYKRRPHYRNIDPGDVSEQKAIRDKLHCKPFKWFMEEVAFDLPKFYPPVEPPPF 384
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ---GGNQFWMMSKHGEIR--RDEAC 262
EV N + MC+D+ K ++ L PC K GG Q + + H +IR + C
Sbjct: 385 ASGEVRNKAANMCLDTRYK--GQNERFDLQPCLKDGKGGGEQQFEFTWHKDIRPGKRTVC 442
Query: 263 LDYA----GGDVILYPCHGSKGNQYFEYD 287
D + VIL+ CHG GNQ F+Y+
Sbjct: 443 FDVSQSIKKAPVILFNCHGMGGNQRFKYN 471
>gi|198473174|ref|XP_001356196.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
gi|198139336|gb|EAL33256.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
Length = 617
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 147/336 (43%), Gaps = 72/336 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++WL+PLL ++ ++ + P+I I DTFE + P L GGF+W L F
Sbjct: 241 EVNRQWLEPLLRLIKAENASLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 292
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K ++ P +PTMAGGLF++++ +F+ +G YD DIWGGEN+E+SF+
Sbjct: 293 RWENLPEGTLKVPEDFRGPFRSPTMAGGLFAVNRLYFQDIGEYDMAMDIWGGENIEISFR 352
Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W +P RKR A P TM + + +K + +
Sbjct: 353 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAYVWMDKYKDFYLKHE 410
Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------EVSNDWSG--- 218
+++ D+GD++ R +LR L C+ F+WYL E SG
Sbjct: 411 -------KVAKDYDYGDISDRLQLRERLQCRDFEWYLRNVYPELHIPGEEPKKSASGPVF 463
Query: 219 -----------------MCIDSACKPTDMHKPVG---------LYPCHKQGGNQFWMMSK 252
+ C K G L PC ++ NQ W ++
Sbjct: 464 QPWHSRKRNYIDFYMLRLAGTELCASVMAPKVKGFWKKGSSLQLQPC-RRTPNQLWYETE 522
Query: 253 HGEIRRDE-ACLDYAG-GDVILYPCHGSKGNQYFEY 286
EI D+ CL+ +G VI+ CH G+Q + +
Sbjct: 523 KSEIILDKLLCLEASGDSQVIINKCHEMLGDQQWRH 558
>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
Length = 666
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 139/343 (40%), Gaps = 91/343 (26%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A N V P I I F R + + G FDW +
Sbjct: 298 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDW--E 348
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P + K+ A+P +P MAGGLF+I + FF +LG YD G DIWGGE ELSF
Sbjct: 349 FFYKRLPLL-KDDLKHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 407
Query: 124 KF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSID 158
K N P R H+N VW L+S
Sbjct: 408 KIWMCGGEMYDAPCSRIGHIYRGPRNHQPSPRRGDYLHRNYKRVAEVWMDEYKNYLYSHG 467
Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW- 216
+E + D GD+T +K +R L CKSFKW++E V+ D
Sbjct: 468 DGVYESV---------------------DPGDLTEQKAIRTKLKCKSFKWFMEAVAFDLM 506
Query: 217 ---------------------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS 251
+C+D+ K H +G+Y C +QFW +S
Sbjct: 507 KTYPPVDPPAYAMGALQNVGNQNLCLDTMGKKK--HNRMGMYSCASDIKVPQRSQFWELS 564
Query: 252 --KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
+ +RR + CLD A V L+ CH GNQY+ YDY
Sbjct: 565 WKRDLRLRRKKECLDVQIWDANAPVWLWDCHSQGGNQYWYYDY 607
>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
Length = 602
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/329 (32%), Positives = 146/329 (44%), Gaps = 69/329 (20%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
E WL PLL+ +A+N V P I I +TFE R + + G FDW L +
Sbjct: 238 EANVNWLPPLLEPIAQNYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 290
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
+ + KR AEP +P MAGGLF+I FF +LG YD G DIWGGE ELSFK
Sbjct: 291 KRLPLLPEDLKR---PAEPFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFK 347
Query: 125 FNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGE 178
W + R H P + P G F LG Y ++W E
Sbjct: 348 I-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMDE 396
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------------- 216
E +K D GD++ +K LR L CK F W++E ++ D
Sbjct: 397 YAEYIYKRRPHLRALDPGDLSEQKALRVKLHCKPFNWFIENIAFDLVEVYPPIEPDDFAY 456
Query: 217 --------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIRRDEA--C 262
+ +C+DS + D + + + C K G Q + ++ H +IR + C
Sbjct: 457 GEIRNMGATELCLDSKKRKRD--ELIVVDTCVKDDPKVSGEQEFRLTWHKDIRPKDRTDC 514
Query: 263 LDYAGGD----VILYPCHGSKGNQYFEYD 287
LD + G+ V LYPCHG +GNQ + YD
Sbjct: 515 LDVSRGEEKAPVSLYPCHGKQGNQLWRYD 543
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549
>gi|268576230|ref|XP_002643095.1| C. briggsae CBR-GLY-11 protein [Caenorhabditis briggsae]
Length = 619
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 110/221 (49%), Gaps = 33/221 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL PLLD + +N VV P+I I D +++ + + GG +W +
Sbjct: 272 CEVNEDWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 323
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + N P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 324 FKWDYPHRSYFEDPMNYLNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 383
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
+ W E + P + G +F + + K + +W E
Sbjct: 384 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 431
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
LE F+ D+GD+TSR LR+NL CK FKWYLE
Sbjct: 432 YLENFFEARPTYRTFTDYGDLTSRINLRQNLQCKPFKWYLE 472
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 165 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 218
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 219 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 276
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 277 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 333
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 334 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 389
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 390 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 447
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 448 CFDAVSHTSPVTLYDCHSMKGNQLWKY 474
>gi|195033813|ref|XP_001988768.1| GH11345 [Drosophila grimshawi]
gi|193904768|gb|EDW03635.1| GH11345 [Drosophila grimshawi]
Length = 620
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 147/337 (43%), Gaps = 73/337 (21%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV + W++PLL ++ ++ + P+I I DTFE + P L GGF+W L F
Sbjct: 241 EVNREWVEPLLRLVKAENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 292
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K ++ P +PTMAGGLF++++ +F+ +G YD DIWGGEN+E+SF+
Sbjct: 293 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 352
Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
W +P RKR A P TM + + + TY +
Sbjct: 353 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSMRVAHVWMD---TYKEYY- 406
Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------------------V 212
LE KG DFGD++ R +LR L C++F WYL+ V
Sbjct: 407 ----LKLEKVPKGYDFGDISDRLQLRERLECQNFDWYLKHVYPELRVPGEESKKPVSAPV 462
Query: 213 SNDW---------------SGMCIDSACKPTDMH------KPVGLYPCHKQGGNQFWMMS 251
W SG + +A + P+ L C + NQ W +
Sbjct: 463 FQPWHSRKRNYLDSFQMRLSGTQLCAAVVSPKVKGFWKKGSPLTLQLCRPRAPNQLWYET 522
Query: 252 KHGEIRRDE-ACLDYAGGD-VILYPCHGSKGNQYFEY 286
+ EI D+ CL+ V++ CH G+Q + +
Sbjct: 523 EKSEIILDKLLCLEAVEDTMVVVNKCHEMLGDQQWRH 559
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549
>gi|403276501|ref|XP_003929936.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Saimiri boliviensis boliviensis]
Length = 455
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 107/220 (48%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV P+I I D T L++ P + G FDWNLQ
Sbjct: 242 CEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDDRT--LKYKPSPVVR------GAFDWNLQ 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + +P+ +P MAGG+F+I + +F ++G YD D WGGENLELS
Sbjct: 294 FKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 353
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + +P + + Y +W E
Sbjct: 354 RIWMCGGQLFIIP-CSRVGHISKKQPGKGSELINAVAR----------NYLRLVHVWLDE 402
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G+++ R ELR+ LGC+SF+WYL+
Sbjct: 403 YKEQFFLRKPGLKYMTYGNISERVELRKRLGCQSFQWYLD 442
>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Anolis carolinensis]
Length = 941
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 127/305 (41%), Gaps = 83/305 (27%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL+PLL+ + N V P+I I D + F G F+W + F W I
Sbjct: 598 WLEPLLERIHLNRKKVPCPVIEVISDKDMSY-------MTVDNFQRGIFNWPMNFGWKPI 650
Query: 70 PERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH 128
P +++K + + P MAGGLFSIDK +F +LGTYD G D+WGGEN+E+SFK W
Sbjct: 651 PPDVIEKNKIKETDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKV-WM 709
Query: 129 AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF---EKLGTYDSGF------------D 173
E E + + G +F D + ++L T + D
Sbjct: 710 CGGEIE----------IIPCSRVGHIFRSDNPYSFPKDRLTTVERNLARVAEVWLDDYKD 759
Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE----------------VSNDW 216
++ G L K D GD+T +KELR+ L CKSFKWYLE + N
Sbjct: 760 LFYGHGYHLVQKNLDVGDLTQQKELRKRLQCKSFKWYLENVYPDIEAPLVKASGLIINIA 819
Query: 217 SGMCI--------------------------------DSACKPTDMHKPVGLYPCHKQGG 244
CI DS PTD +GL+PC K+
Sbjct: 820 LAKCITVNQSSLAFETCDVNNKDQKFNYTWMRLIQHGDSCVAPTDAKGTLGLHPCDKRNK 879
Query: 245 NQFWM 249
+ W+
Sbjct: 880 SLKWL 884
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 137/331 (41%), Gaps = 63/331 (19%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 125 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 178
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 179 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 236
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP + P P D L ++W E
Sbjct: 237 KVWMCGGRMEDIPCSRVGHIYRKSVPYKVPAGPA-----DPCNCLSLQNLKRVAEVWMDE 291
Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
E ++ GDV ++K LR +L CKSFKW++
Sbjct: 292 YAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAW 351
Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIR------ 257
E+ N +G+C D+ + + P+ L C + G W + +IR
Sbjct: 352 GEIRNVGTGLCTDTKHGTSGL--PLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPGDPQH 409
Query: 258 RDEACLDYAGGD--VILYPCHGSKGNQYFEY 286
+ C D + V LY CH KGNQ ++Y
Sbjct: 410 TKKLCFDAVSHNSPVTLYDCHSMKGNQLWKY 440
>gi|351708673|gb|EHB11592.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Heterocephalus glaber]
Length = 570
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 148/354 (41%), Gaps = 83/354 (23%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WLQP+L + + + VVSP+I I D F L +S GGFDW+L
Sbjct: 180 CEVNIEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 232
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN---LE 120
F W IP ++ + +P+ TP +AGG+F IDKA+F LG YD+ DIWGGEN +
Sbjct: 233 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFGPVA 292
Query: 121 LSFK------------FNWHAIPERE---RKRHKNAAEPVWTPT-----MAGGLFSIDKA 160
L+ K ++ +P + ++ A+P+ M GG I
Sbjct: 293 LALKQPAQLEGVGDNFISYWCLPVAKPIIQREGSPMAQPIRAELSFRVWMCGGSLEIVPC 352
Query: 161 -----FFEKLGTYD--------------SGFDIWGGENLELSFKG-------DFGDVTSR 194
F K Y+ ++W E + ++ FG V +R
Sbjct: 353 SRVGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATR 412
Query: 195 KELRRNLGCKSFKWYLE---------VSNDWSGM------CIDSACKPTDMHKPVGLYPC 239
E R+ + CKSF+WYLE V G+ C++S + T +G+ C
Sbjct: 413 IEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGIIKQGVNCLESQGQDTAGDFLLGMGIC 472
Query: 240 HKQGGN----QFWMMSKHGEIRRDEACLDYA-------GGDVILYPCHGSKGNQ 282
N Q W+ S H I++ CL G VIL C+ +G Q
Sbjct: 473 RGSAKNPPPPQAWLFSDH-LIQQQGKCLAATSTSTASPGSPVILQVCNSREGKQ 525
>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
17 [Ciona intestinalis]
Length = 870
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 149/324 (45%), Gaps = 61/324 (18%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV WL PLL+ +A + + P+I I D F PG G FDW L +
Sbjct: 508 EVTNNWLPPLLEPIALDRKVITCPMIDIINKDDFHYLTQPGDAMR------GAFDWELYY 561
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
IP K+ K+ ++P P MAGGLF+ID+ +F+++G YD G +IWGGE ELSFK
Sbjct: 562 KRIPIPPE--KQLKDPSDPFEDPVMAGGLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFK 619
Query: 125 FNWHA---IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
W I + R + ++ G +I+K F ++W E E
Sbjct: 620 -AWMCGGKILDAPCSRVGHIYREFMPYSLPPGT-NINKNF-------KRVAEVWMDEYAE 670
Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------------V 212
+K GD++ +K LR L C+SF W+++ +
Sbjct: 671 YFYKKRPHVRGIHPGDLSKQKALRELLECRSFDWFMKEVAPDIIKHYPPVMPEPAAWGML 730
Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQG-GNQFWMMSKHGEIR----RDEA---CLD 264
SN+ S C+D K P+ L PC ++G +Q ++++ +IR D+A CLD
Sbjct: 731 SNEGSKRCLDGLYKKEG--APLSLMPCREEGTADQSFILTWKEDIRPGTSMDKARKFCLD 788
Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
G V+L+ CHG GNQ ++Y
Sbjct: 789 GQGLNSPVVLWQCHGQYGNQLWKY 812
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 271 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 324
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 325 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 382
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 383 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 439
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 440 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 495
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 496 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 553
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 554 CFDAVSHTSPVTLYDCHSMKGNQLWKY 580
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCTDT--KHGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 411
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDV ++K+LR L CKSFKW++ E+ N
Sbjct: 412 YRHLS----AGDVAAQKKLRSALNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIHNVG 467
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C+D+ K + P+ L C + G W + +IR + C D
Sbjct: 468 TGLCVDT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 525
Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 526 AISNTSPVTLYDCHSMKGNQLWKY 549
>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
Length = 1150
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/326 (30%), Positives = 136/326 (41%), Gaps = 72/326 (22%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
WL PLL+ +A N V P I I DDTFEL T + G FDWN+ + +
Sbjct: 788 WLPPLLEPIAHNPRTCVCPFIDVIMDDTFEL-------TPQDQGARGAFDWNMLYK--RL 838
Query: 70 PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
P R + K+ +P +P MAGGLF+I FF +LG YD +IWG E ELSFK W
Sbjct: 839 PLRPEDQ-KDPTQPFESPVMAGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKI-WQC 896
Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGENLE- 181
+ P G + F + +YD ++W E +
Sbjct: 897 -----------GGRMIDAPCSRVGHIYRSYSPFPNVKSYDYVAKNHKRVAEVWMDEYKKY 945
Query: 182 ------LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN----DW--------------- 216
+ F D GD+T KELRR L CK F+W++E DW
Sbjct: 946 VYRKDPMRFSIDAGDLTKMKELRRRLNCKPFRWFIENVAPDLIDWYPPIEPEPFAFGVIQ 1005
Query: 217 ----SGMCIDSACKPTDMHKPVGLYPCHKQGGN-----QFWMMSKHGEIRRD--EACLDY 265
G+C+ D K L C K N Q + + E++ CLD
Sbjct: 1006 SQANKGLCV-GVVNVVD-QKGTALVACAKDKVNPERAEQHFQFTWRREVKSMLWAQCLDV 1063
Query: 266 A----GGDVILYPCHGSKGNQYFEYD 287
A G ++ L+ CH +GNQ F+YD
Sbjct: 1064 ANHSVGVELQLFSCHTQQGNQLFQYD 1089
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/336 (28%), Positives = 137/336 (40%), Gaps = 97/336 (28%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWNLQFNWH 67
WL PLL+ +A N V PLI I D TF + GR G FDW +
Sbjct: 385 WLPPLLEPIAENPKTCVCPLIDVIDDQTFNIHPQDDGGR---------GLFDWRFHYKRL 435
Query: 68 AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-- 125
A+ E +R + P +P MAGGLF+I FF +LG YD DIWG E ELSFK
Sbjct: 436 ALKESDRV---SPTAPFPSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQ 492
Query: 126 ------------------NWHAIPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFF 162
++ P + + HK AE +W ++ D +
Sbjct: 493 CGGRMLDAPCSRFSHIYRSYSPFPNSRKYDFITRNHKRVAE-IWMDEYKQYIYDRDPERY 551
Query: 163 EKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----- 216
+ D GD+T K LR L CK F+W+L EV+ +
Sbjct: 552 A---------------------RSDAGDLTKMKALREKLQCKPFEWFLKEVAPEILQLYP 590
Query: 217 --------SG---------MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHG 254
SG +CID+ +P P+G++PC H + NQ++++S H
Sbjct: 591 PVEPEPFASGAIQSIAEPTLCIDTMQRPRG--NPIGMHPCDSDLIHPKNMNQYFVLSWHR 648
Query: 255 EIRR--DEACLDYAG----GDVILYPCHGSKGNQYF 284
+I++ DE C D V +Y CH K Q+
Sbjct: 649 DIQQKSDEQCFDVPESAPRSPVTIYTCHNIKYLQHL 684
>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
garnettii]
Length = 657
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 142/319 (44%), Gaps = 37/319 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV W PL+ ++++ + PLI I +T+E+ G Y G +DW++
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYAR--GAWDWSML 361
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + RE+ K EP +P MAGGLF+I++ FF +LG YD G IWGGEN E+S+
Sbjct: 362 WKRVPLTLREKSLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 421
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
K W + G + L Y ++W E +
Sbjct: 422 KI-WQCGGKLLFVPCSRVGHIYRLEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDEYKDYF 480
Query: 184 FKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------VSNDW------ 216
+ +GD++ K+ R + CKSFKW++E + DW
Sbjct: 481 YASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHYPLPPKNIDWGEIRGF 540
Query: 217 -SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ CIDS K V L PCH+ GGNQ + +++ ++ + + CL G +++
Sbjct: 541 ETAYCIDSMGKTNGGF--VELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMIT 598
Query: 274 PC--HGSKGNQYFEYDYKY 290
C +G K QYF+ Y++
Sbjct: 599 HCSLNGFKEWQYFKNLYRF 617
>gi|324520154|gb|ADY47570.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Ascaris suum]
Length = 286
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 81/252 (32%), Positives = 115/252 (45%), Gaps = 48/252 (19%)
Query: 73 ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA--- 129
ER+ H +A P+ PT+AGGLF+ID+ FF +G+YD G +WGGENLE+SF+ W
Sbjct: 2 ERRNHDRSA-PIQAPTIAGGLFAIDRQFFYDIGSYDEGMQVWGGENLEISFRV-WTCGGS 59
Query: 130 --IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG- 186
I R H + +T GG + + ++W E E +K
Sbjct: 60 LEIHPCSRVGHVFRKQTPYT--FPGGTAKVIHHNAARTA------EVWMDEYKEFFYKMV 111
Query: 187 ------DFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMCIDS 223
D GD+ RK LR NL C+SF+WYLE + N + C+D+
Sbjct: 112 PAARSVDVGDLADRKALRENLQCRSFRWYLENIYPEAPIPRGFKSIGQIKNPSTTKCVDT 171
Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG-------DVILYPCH 276
+ + G+ CH GGNQ W ++ GE+R DE CL DV L C
Sbjct: 172 LGRSAG--EAAGVTVCHGIGGNQAWSLTSDGEVRSDETCLAADRAADKAKKIDVKLEKCS 229
Query: 277 GSKGNQYFEYDY 288
+ N ++DY
Sbjct: 230 TTSVNVNHQFDY 241
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 352 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 405
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 406 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 463
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP + P P ++A L + + + ++ Y
Sbjct: 464 KVWMCGGRMEDIPCSRVGHIYRKSVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 520
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K LR +L CKSFKW++ E+
Sbjct: 521 RPEYRHLS----AGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 576
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIR------RDEA 261
N +G+C D+ + + P+ L C + G W + +IR +
Sbjct: 577 NVGTGLCTDTKHGTSGL--PLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPGDPQHTKKL 634
Query: 262 CLDYAGGD--VILYPCHGSKGNQYFEY 286
C D + V LY CH KGNQ ++Y
Sbjct: 635 CFDAVSHNSPVTLYDCHSMKGNQLWKY 661
>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
magnipapillata]
Length = 599
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 133/306 (43%), Gaps = 62/306 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PL+ + + + + +P+I I D F + P S+ G F+W +
Sbjct: 228 CEVGGNWLPPLIAPIQEDPTTLTAPIIDGINWDDFSIN--PVYQKGSHSR--GIFEWGML 283
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ +PE+E ++ +EP +PT AGGLF+I +++F++LG YD G IWGGEN ELSF
Sbjct: 284 YKETDLPEKEARKRLYHSEPYNSPTHAGGLFAIKRSWFKELGWYDPGLLIWGGENYELSF 343
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTM---------------AGGLFSIDKAFFEKLGTY 168
K W +W P +G + L Y
Sbjct: 344 KL-WQC-----------GGRSLWVPCSHVSHVYRGHSCSSCHSGDMGRKWSGIPLSLRNY 391
Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL---------- 210
++W + + F D GDV+ + L++ + CKSF W++
Sbjct: 392 KRLIEVWFDDKYKEFFYTREPLARFIDTGDVSEQMALKKRMNCKSFTWFMEEIAYDVLKK 451
Query: 211 -----------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
EV N + +CID+ + +GL CHK GGNQ W ++ G++
Sbjct: 452 YPEPPPNAHWGEVRNIATNLCIDTLNRSPPYR--IGLSGCHKSGGNQLWRLNTLGQLASG 509
Query: 260 EACLDY 265
E C+ Y
Sbjct: 510 EWCVRY 515
>gi|170591827|ref|XP_001900671.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158591823|gb|EDP30426.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 597
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 103/342 (30%), Positives = 134/342 (39%), Gaps = 98/342 (28%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV +RWL+PLLD + + VV P+I I DT + P GG W+L
Sbjct: 245 CEVNERWLEPLLDRIVADRHTVVCPVIDIIDADTLKYIESP--------VCKGGMSWSLA 296
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W P K PV +PTMAGGLF+IDK +F LG YD G +IWG EN+E+S
Sbjct: 297 FKWDYFPPLYFDEPKQYVRPVKSPTMAGGLFAIDKKYFNMLGQYDPGMEIWGAENVEISL 356
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
+ +W M GG I F + Y G D
Sbjct: 357 R--------------------IW---MCGGRLEIVPCSRVGHIFRQRRPYGLGIDSMGRN 393
Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------VSN 214
IW E ++ + + GD+ KELRR L CK F WYL+ + N
Sbjct: 394 AARTANIWLDEYIDQFYAAKPNLRGINIGDIREMKELRRKLHCKPFLWYLQNIYPELLPN 453
Query: 215 DWSGMCIDSACKPTDMHK-------------------------------PVGLYPCHKQG 243
+ M ID K +DM + V + C K
Sbjct: 454 NHPTM-ID--LKKSDMLRSRNIARYHIILYNTSLCLTAQSVNGRLVRGSSVVVEYCRKGD 510
Query: 244 GNQFWMMSKHGEIR---RDEACLDYAGGDVILYPCHGSKGNQ 282
+Q W +K GE+R CLD G IL CH +Q
Sbjct: 511 RHQIWRWTKLGELRPMGSATLCLDSLKGPRIL-KCHLQGAHQ 551
>gi|195436945|ref|XP_002066406.1| GK18112 [Drosophila willistoni]
gi|194162491|gb|EDW77392.1| GK18112 [Drosophila willistoni]
Length = 588
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 144/331 (43%), Gaps = 61/331 (18%)
Query: 5 EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
EV ++W++PLL ++ +S + P+I I DTF + P L GGF+W L F
Sbjct: 215 EVNRQWVEPLLRLIKAENSTLAVPVIDLINADTFG--YTPSPLVR------GGFNWGLHF 266
Query: 65 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
W +PE K+ ++ P +PTMAGGLF++++ +F+ +G YD DIWGGEN+E+SF+
Sbjct: 267 RWENLPEGTLKQPEDFRGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 326
Query: 125 F-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
+ +P P +P A + K + F + ++
Sbjct: 327 AWQCGGSIKIVPCSRVGHIFRKRRPYTSPDGANTML---KNSLRLAYVWMDRFKDYYIKH 383
Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------VSNDWSG 218
++S D+GD++ R +LR L C F WYL+ + W
Sbjct: 384 EKVSKDFDYGDISERVKLREKLQCHDFDWYLKNIYPELPIPGEEPKKTAAAAPIYQPWHS 443
Query: 219 M---CIDS---------ACKPTDMHKPVG---------LYPCHKQGGNQFWMMSKHGEIR 257
IDS C K G L PCH NQ W ++ EI
Sbjct: 444 RKRNYIDSYQLRLSGTELCASVVAPKVKGFWKKGSGLQLQPCH-NSPNQIWYETEKSEII 502
Query: 258 RDE-ACLDYAG-GDVILYPCHGSKGNQYFEY 286
D+ CL+ +G V++ CH G+Q + +
Sbjct: 503 LDKLLCLEASGDAQVVINKCHEMLGDQQWRH 533
>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
garnettii]
Length = 657
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 143/319 (44%), Gaps = 37/319 (11%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV W PL+ ++++ + PLI I + + + P + F G +DW+L
Sbjct: 304 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIE--PQQGGDEDGFARGAWDWSLL 361
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ + +E+ + K+ EP +P MAGGLF+I++ FF +LG YD G IWGGEN E+S+
Sbjct: 362 WKRIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 421
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
K W + G + L Y ++W E +
Sbjct: 422 KI-WQCGGKLLFVPCSRVGHIYRLEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDEYKDYF 480
Query: 184 FKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------VSNDW------ 216
+ +GD++ K+ R + CKSFKW++E + DW
Sbjct: 481 YASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHYPLPPKNIDWGEIRGF 540
Query: 217 -SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
+ CIDS K V L PCH+ GGNQ + +++ ++ + + CL G +++
Sbjct: 541 ETAYCIDSMGKTNGGF--VELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMIT 598
Query: 274 PC--HGSKGNQYFEYDYKY 290
C +G K QYF+ Y++
Sbjct: 599 HCSLNGFKEWQYFKNLYRF 617
>gi|348568063|ref|XP_003469818.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Cavia porcellus]
Length = 499
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 113/217 (52%), Gaps = 23/217 (10%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL ++++S VV+P+I I D L++ P L G FDW LQ
Sbjct: 287 CEVNRVWLEPLLAAISKDSRTVVTPVIDII--DGISLQYLPSPLVR------GAFDWKLQ 338
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W ++ E + P+ +P MAGG+F++ + FF +LG YD D+WGGENLELS
Sbjct: 339 FKWDSVFSYETDSEGSPTNPIRSPAMAGGIFAMHRPFFYELGEYDKDMDLWGGENLELSL 398
Query: 124 KF-----NWHAIP-ERERKRHKNAAEP--VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
+ IP R K ++P + +A + + ++ Y F +
Sbjct: 399 RIWMCGGQLLIIPCSRVGHITKLYSKPDSALSKAVARNHLRLVHVWLDE---YKEQFFLR 455
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
+ ++ +G+++ R +LR+ LGC+SF+WYL+
Sbjct: 456 NPDLKSMT----YGNISERVQLRKQLGCRSFQWYLDT 488
>gi|308485607|ref|XP_003105002.1| CRE-GLY-11 protein [Caenorhabditis remanei]
gi|308257323|gb|EFP01276.1| CRE-GLY-11 protein [Caenorhabditis remanei]
Length = 624
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 33/221 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL PLLD + +N VV P+I I D +++ + + GG +W +
Sbjct: 277 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 328
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W N P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 329 FKWDYPHRSYFDDPMNYVNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 388
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
+ W E + P + G +F + + K + +W E
Sbjct: 389 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRVARVWLDE 436
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
LE F D+GD+TSR LR+NL CK FKWYLE
Sbjct: 437 YLENFFVARPTYRTFTDYGDLTSRINLRQNLQCKPFKWYLE 477
>gi|341889625|gb|EGT45560.1| hypothetical protein CAEBREN_24622 [Caenorhabditis brenneri]
Length = 625
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/221 (33%), Positives = 110/221 (49%), Gaps = 33/221 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL PLLD + +N VV P+I I D +++ + + GG +W +
Sbjct: 278 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 329
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + N P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 330 FKWDYPHRSYFEDPMNYVNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 389
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
+ W E + P + G +F + + K + +W E
Sbjct: 390 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 437
Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
LE F+ ++GD+TSR LR+NL CK FKWYLE
Sbjct: 438 YLENFFEARPTYRTFTEYGDLTSRINLRQNLQCKPFKWYLE 478
>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
caballus]
Length = 539
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 137/312 (43%), Gaps = 59/312 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL+PLL + + VV P+I I DDTFE + GGF+W L
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263
Query: 64 FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG-GENLEL 121
F W+ +P+RE R K + PV +G + ++ ++ IW G +LE+
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPV--SCFSGNMTALPTGLLYNSCSFSQ---IWQCGGSLEI 318
Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
+ RK A P T GG + +L ++W E +
Sbjct: 319 ---VTCSHVGHVFRK-----ATPY---TFPGGTGHVINKNNRRLA------EVWMDEFKD 361
Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
+ K D+GDV+ RK LR NL CK F WYL E+ N +
Sbjct: 362 FFYIISPGVVKVDYGDVSVRKSLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 421
Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
C+D+ + + + VG++ CH GGNQ + + EIR D+ CLD + G VI+ C
Sbjct: 422 NQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKC 479
Query: 276 HGSKGNQYFEYD 287
H +GNQ +EYD
Sbjct: 480 HHMRGNQLWEYD 491
>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
Length = 534
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 144/335 (42%), Gaps = 70/335 (20%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV WL PLL+ ++ + + V P I I TFE + G G FDW Q
Sbjct: 168 CEVNVNWLPPLLEPISVSMTTVTIPTIDVIDHATFEYKEQQGGPMR------GVFDW--Q 219
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
N+ IP + R R P TP M GG+F+IDK FF LG YDSG +IWGGE ELS
Sbjct: 220 LNYKRIPVLDGRGRKVRPTLPFSTPVMPGGVFAIDKEFFHHLGGYDSGLEIWGGEQFELS 279
Query: 123 FKFNWH---AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
FK W + E R + ++P ++ D + L Y ++W +
Sbjct: 280 FKI-WQCGGVLQEVPCSRVGHVFRK-FSP------YATDNDVLQILKNYMRVAEVWMDDY 331
Query: 180 LELSFKG-----------DFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----------- 216
+ +K D GD++S+K LR+ LGC+ F W++ EV++D
Sbjct: 332 KQYYYKRMLRGPKNVTNFDLGDLSSQKPLRQRLGCRDFGWFMREVASDLVKHYPLKDPDV 391
Query: 217 ----------SGMCIDSACKPTDMHKPVGLYPCHKQGG---------NQFWMMSKHGEIR 257
+G+C+DS + PV L C + G NQ + + EI
Sbjct: 392 LQQGRIQSVGTGLCLDS--DGLNSEDPVVLRRCRDRQGAFVLTKTYPNQNFTYTGLKEIE 449
Query: 258 RDE--ACLDYAG----GDVILYPCHGSKGNQYFEY 286
+ C D ++ CHG GNQ +EY
Sbjct: 450 TTDRHLCFDVDSLSREKTLVFLTCHGEGGNQMWEY 484
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 141/327 (43%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A N +V P+I I +D F G T + G FDW +
Sbjct: 285 CEANVNWLPPLLDRIASNRKTIVCPMIDVIDNDHF------GYKTQAGDAMRGAFDWEMY 338
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 339 YKRIPIPLELQK--SDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 396
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P PT +A L + + + ++ Y I+
Sbjct: 397 KVWMCGGRMEDIPCSRVGHIYRKYIPYKIPTGVSLARNLKRVAEVWMDEYAEY-----IY 451
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
+ L GDVT++K+LR +L CKSFKW++ E+
Sbjct: 452 --QRLPEYRHLSTGDVTAQKDLRNHLNCKSFKWFMTEIAWDLPRYYPPVEPAAAAWGEIR 509
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N + +CI + K P+ L C K W S +IR +
Sbjct: 510 NVGTQLCIGT--KHGAPGSPLRLESCVKGRAEAAWSNVQVFTFSWREDIRPGDPQHTKKF 567
Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
C D V LY CHG KGNQ ++Y
Sbjct: 568 CFDTISHSSPVTLYDCHGMKGNQLWKY 594
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 411
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDV ++K+LR +L CKSFKW++ E+ N
Sbjct: 412 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 467
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C D+ K + P+ L C + G W + +IR + C D
Sbjct: 468 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 525
Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 526 AISNTSPVTLYDCHSMKGNQLWKY 549
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--VDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCADT--KHWALGSPLRLESCVRDRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 143 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 196
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 197 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 254
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 255 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 314
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDV ++K+LR +L CKSFKW++ E+ N
Sbjct: 315 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 370
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C D+ K + P+ L C + G W + +IR + C D
Sbjct: 371 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 428
Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 429 AISHTSPVTLYDCHSMKGNQLWKY 452
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVAAQKRLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549
>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Anolis carolinensis]
Length = 500
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 56/122 (45%), Positives = 74/122 (60%), Gaps = 7/122 (5%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL PLL + + SHVVSP+I I DTF ++ GGFDW+L
Sbjct: 178 CEVNKDWLLPLLQRIKEDPSHVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 230
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + +++ + + EP+ TP +AGGLF IDKA+F LG YD+ DIWGGEN E+SF
Sbjct: 231 FKWEQLSPKQKAKRTDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDAAMDIWGGENFEISF 290
Query: 124 KF 125
+
Sbjct: 291 RV 292
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 93/220 (42%), Gaps = 36/220 (16%)
Query: 96 IDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 155
ID + + D+ GG + S F W + +++ + + EP+ TP +AGGLF
Sbjct: 204 IDIINLDTFAYVAASSDLRGG--FDWSLHFKWEQLSPKQKAKRTDPTEPIKTPIIAGGLF 261
Query: 156 SIDKAFFEKLGTYDSGFDIWGGENLELSFK-------------GDFGDVTSRK------E 196
IDKA+F LG YD+ DIWGGEN E+SF+ G V +K E
Sbjct: 262 VIDKAWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFPE 321
Query: 197 LRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
N K+ K EV D A +P +P G P + + G I
Sbjct: 322 GNANTYIKNTKRTAEVWMDEYKQYY-YAARPAAQGRPYGEIPEES--------LYQTGMI 372
Query: 257 RRDEACLDYAGGD------VILYPCHGSKGNQYFEYDYKY 290
R+ + CL+ + VIL PC SKG ++ Y
Sbjct: 373 RQRQRCLETQKSEGQDFPVVILNPCITSKGPASAAQEWTY 412
>gi|312381524|gb|EFR27256.1| hypothetical protein AND_06164 [Anopheles darlingi]
Length = 377
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 108/214 (50%), Gaps = 14/214 (6%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLLD L + + ++SP+I I +TF R RL GGFDW+L
Sbjct: 7 CEVNRGWLEPLLDRLQLDPTGLLSPVIDIIDAETFGYRANSARLR-------GGFDWSLH 59
Query: 64 FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
F W I E E R + ++P ++P ++GG+F + K+ FE+LG +D G DIWGGE+LE+S
Sbjct: 60 FRWLPIAEEELEHRRHDESQPFYSPAISGGIFIVAKSLFEQLGGFDPGMDIWGGESLEMS 119
Query: 123 FK-----FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
K + +P P P L + L D + +
Sbjct: 120 LKAWLCGAHVEVVPCSRIGHVFRRKHPFSFPPDGSHLTYLRNTKRVALVWMDEFKNFFYD 179
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
LE + D G V +++ELR+ L C+ F WYL+
Sbjct: 180 VRLE-AIAIDAGSVRAQQELRQKLSCRRFSWYLQ 212
>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
Length = 613
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 140/330 (42%), Gaps = 70/330 (21%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +A + VV P I + +TF R + + G FDW +
Sbjct: 248 CEANVNWLPPLLDPIAEDYRTVVCPFIDVVDYETFAYR-------AQDEGARGAFDW--E 298
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F + +P K+ A P +P MAGGLF+I +F +LG YD G DIWGGE ELSF
Sbjct: 299 FFYKRLPLLPEDL-KHPARPFKSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSF 357
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------YDSGFDIWG 176
K W + + P G A F G Y ++W
Sbjct: 358 KL-WQC-----------GGQMLDAPCSRVGHIYRKFAPFPNPGVGDFVGRNYRRVAEVWM 405
Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
E E +K G++T + +R+ L CK FKW++
Sbjct: 406 DEYAEFLYKRRPQYRSIQPGNITEQLAIRKKLNCKPFKWFMEEIAFDLPKKYPPIEPPAV 465
Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIR--RDEA 261
E+ N + +C+D+ K + GL C K GG Q ++ H +IR +
Sbjct: 466 AEGEMRNVGANLCVDTRFK--GQGETFGLEKCAKDEPGIGGEQRLQITWHKDIRPGKRSF 523
Query: 262 CLDYAG----GDVILYPCHGSKGNQYFEYD 287
C D + VILY CHG KGNQ+F+YD
Sbjct: 524 CFDVSTSVEKAPVILYNCHGMKGNQWFKYD 553
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 239 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 292
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 293 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 350
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + + Y
Sbjct: 351 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDDYAEY---IYQR 407
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 408 RPEYRHLS----AGDVVAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 463
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 464 NVGTGLCADT--KHGALGAPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 521
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 522 CFDAISHTSPVTLYDCHSMKGNQLWKY 548
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 261 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 314
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 315 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 372
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
K IP P P ++ + + Y E
Sbjct: 373 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 432
Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
LS GDV ++K+LR +L CKSFKW++ E+ N
Sbjct: 433 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 488
Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
+G+C D+ K + P+ L C + G W + +IR + C D
Sbjct: 489 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 546
Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
V LY CH KGNQ ++Y
Sbjct: 547 AISHTSPVTLYDCHSMKGNQLWKY 570
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 188 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 241
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 242 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 299
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 300 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 356
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 357 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 412
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 413 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 470
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 471 CFDAISHTSPVTLYDCHSMKGNQLWKY 497
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 274 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 327
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 328 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 385
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 386 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 442
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 443 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 498
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 499 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 556
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 557 CFDAISHTSPVTLYDCHSMKGNQLWKY 583
>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Hydra magnipapillata]
Length = 565
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 140/320 (43%), Gaps = 57/320 (17%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PL+ + +N V P + I D+F R + G F+W
Sbjct: 200 CEANVGWLPPLVSEIEKNYRCVTCPTVDFIDHDSFYYR-------GVDPYIRGTFNWRFD 252
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ I E ++ K+ E V +P MAGGLF+I K F+E+LG YD G +WGGE E+SF
Sbjct: 253 YKERGITEHQKAARKSVTEGVRSPVMAGGLFAISKKFWEELGKYDPGMYVWGGEQYEISF 312
Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAF-----FEKLGTYDSGFDIWGG 177
K W E + P + G ++ + + F L + ++W
Sbjct: 313 KL-WMC-----------GGEMLNMPCSRVGHVYRRNVPYTYNKPFASLINFKRVAEVWMD 360
Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL-------------------E 211
E E ++G + G+++ R ++R CKSFKWYL E
Sbjct: 361 EFKEFLYRGNPMVRSQNAGNISERIKVRERNKCKSFKWYLLNVANDTVRTRYEPDRASGE 420
Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR-DEACLD--YAGG 268
+ N + +C+D+ + + + L C ++ NQ + + E+ + E CLD YA
Sbjct: 421 IENTHTKLCLDTYG--ANAGRKIKLSKCGQRNSNQIFRWTYIYELHQYPEECLDARYADM 478
Query: 269 D-VILYPCHGSKGNQYFEYD 287
D V + CH GNQ F YD
Sbjct: 479 DNVYIEKCHEMGGNQKFLYD 498
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ Y
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 409 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 465 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 522
Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549
>gi|311275140|ref|XP_003134592.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Sus scrofa]
Length = 446
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 105/223 (47%), Gaps = 37/223 (16%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV K WL+PLLD + ++ VV P++ I D L + P + G F+W+LQ
Sbjct: 234 CEVNKIWLEPLLDAIVKDPKMVVCPIMDVI--DYVTLEYKPSPVVR------GVFNWHLQ 285
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E P+ +P M GGLF+I + +F ++G YD G ++WGGENLELS
Sbjct: 286 FEWDRVFSYEMDGPDGPTRPIRSPAMVGGLFAIHRHYFNEIGQYDKGMNLWGGENLELSL 345
Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF--------DIW 175
+ W + P G I+K +F G +W
Sbjct: 346 RI-WMC-----------GGQLFLLPCSRVG--HINKPYFTNQGEIKKAMAYNNLRIVHVW 391
Query: 176 GGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E E F + +G+V+ R ELR+ LGCKSF+WYL+
Sbjct: 392 LDEYKEQFFLQNPRLKSLAYGNVSERVELRKRLGCKSFQWYLD 434
>gi|18314429|gb|AAH22021.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Homo sapiens]
gi|51105933|gb|EAL24517.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 15 [Homo sapiens]
gi|119574364|gb|EAW53979.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_c
[Homo sapiens]
gi|123979772|gb|ABM81715.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
gi|123994539|gb|ABM84871.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
Length = 443
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------LVRGTFDWNLQ 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + +P+ +P M+GG+F+I + +F ++G YD D WG ENLELS
Sbjct: 282 FKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSL 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T+ + Y +W E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRERVELRKRLGCKSFQWYLD 430
>gi|345321967|ref|XP_001514624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Ornithorhynchus anatinus]
Length = 484
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 134/299 (44%), Gaps = 37/299 (12%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE + WL+PLL +A N + VV+P++ I TF+ S G FDW L
Sbjct: 114 CECHRGWLEPLLSRIASNRNRVVTPILDVIDWKTFQY-------FHSEDLQQGVFDWKLD 166
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F+W +PE++RK ++ P+ +P + GG+ ++D+ +F+ G YDS +WGGENLELS
Sbjct: 167 FHWELLPEQKRKVRQSPISPIRSPVVPGGVMAMDRHYFQNTGAYDSLMTLWGGENLELSI 226
Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
+ + +P R ++N A P L + + LG++ F
Sbjct: 227 RVWLCGGSVEVLPCSRVGHVYRNQASDT-LPNQEAILRNKIRIAETWLGSFKEIFYQHSP 285
Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGM 219
E L K + D + R +L+R LGC++F W+L ++ + G
Sbjct: 286 EAFSLR-KVEKPDCSERLQLQRRLGCRTFHWFLSNIYPELYPSERRPGFSGKLFSTRVGF 344
Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILYPC 275
C+D K + L PC +Q + EIR + + C D +IL C
Sbjct: 345 CVDGGSKGKIPGSSITLLPC-SDSQHQHLEYTSRKEIRSGTKLQLCFDVREEQLILQNC 402
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CE WL PLLD +ARN +V P+I I D F T + G FDW +
Sbjct: 236 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 289
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
+ IP +K + ++P +P MAGGLF++D+ +F +LG YD G +IWGGE E+SF
Sbjct: 290 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 347
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
K IP P P ++A L + + + ++ +
Sbjct: 348 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEFAEH---IYQR 404
Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
E LS GDV ++K+LR +L CKSFKW++ E+
Sbjct: 405 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 460
Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
N +G+C D+ K + P+ L C + G W + +IR +
Sbjct: 461 NVGTGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 518
Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
C D V LY CH KGNQ ++Y
Sbjct: 519 CFDAVSHSSPVTLYDCHSMKGNQLWKY 545
>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
Length = 662
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 141/344 (40%), Gaps = 93/344 (27%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
E WL PLL+ +A N V P I I F+ R + + G FDW
Sbjct: 294 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHSNFQYR-------AQDEGARGAFDWEFY 346
Query: 64 FN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
+ +PE K+ A+P +P MAGGLF+I FF +LG YD G DIWGGE ELS
Sbjct: 347 YKRLRLLPED----LKHPADPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELS 402
Query: 123 FKF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSI 157
FK N + P + H+N VW L+S
Sbjct: 403 FKIWMCGGQMYDAPCSRIGHIYRGPRNHNPSPRKGDYLHRNYKRVAEVWMDEYKNYLYSH 462
Query: 158 DKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW 216
+E++ D GD+T++K +R L CKSF+W++E V+ D
Sbjct: 463 GDGIYERV---------------------DAGDLTAQKAIRTKLKCKSFRWFMEEVAFDL 501
Query: 217 ----------------------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMM 250
+C+D+ + H +G++ C +QFW +
Sbjct: 502 MKNYPPVDPPNYAMGAIQSVGNPQLCLDTMGRKK--HNRMGMFACADDLKVPQKSQFWEL 559
Query: 251 SKHGEIR--RDEACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
S ++R R + CLD A V L+ CHG GNQY+ YDY
Sbjct: 560 SWKRDLRQRRKKECLDVQIWEANAPVWLWDCHGQGGNQYWYYDY 603
>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 139/340 (40%), Gaps = 97/340 (28%)
Query: 10 WLQPLLDVLARNSSHVVSPLIANICDDTFEL--RFPPGRLTSSYKFFIGGFDWNLQFNWH 67
WL PLL+ +A N V PLI I D TF++ + GR G FDW +
Sbjct: 224 WLPPLLEPIAENPKTCVCPLIDVIDDQTFDVHPQDEGGR---------GLFDWTFHYKRV 274
Query: 68 AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-- 125
I +R + EP +P MAGGLF+I FF +LG YD DIWG E E+SFK
Sbjct: 275 VIKNEDRI---SPTEPFPSPVMAGGLFAIGADFFWELGGYDEELDIWGAEQYEISFKIWQ 331
Query: 126 ------------------NWHAIPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFF 162
+ P + + HK AE +W ++ D
Sbjct: 332 CGGRMLDAPCSRFGHIYRTYSPFPNSRKYDFITRNHKRVAE-IWMDEYKQYIYDRDP--- 387
Query: 163 EKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----- 216
E K D GD++ K +R L CK FKW+L EV+ +
Sbjct: 388 ------------------ERYAKTDAGDMSKMKTIREKLMCKPFKWFLQEVAPEIIELYP 429
Query: 217 -----------------SGMCIDSACKPTDMHKPVGLYPCHKQ-----GGNQFWMMSKHG 254
S +CID+ + +P+GLYPC NQ+++ S H
Sbjct: 430 PVEPEPYASGSIQSVADSSLCIDTMQR--GRGEPIGLYPCSNSLIEPTNHNQYFVHSWHR 487
Query: 255 EIRRD--EACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
+I+ E C D G V ++ CH +GNQ+F+YD+
Sbjct: 488 DIQHKYGEGCFDVPQSKPGSPVTIFTCHMHQGNQFFQYDH 527
>gi|281485547|ref|NP_660335.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Homo sapiens]
gi|322510123|sp|Q7Z4T8.3|GLTL5_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
Length = 443
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)
Query: 4 CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
CEV + WL+PLL +A++ VV PLI I D T E + P G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------LVRGTFDWNLQ 281
Query: 64 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
F W + E + + +P+ +P M+GG+F+I + +F ++G YD D WG ENLELS
Sbjct: 282 FKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSL 341
Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
+ IP R H + + T+ + Y +W E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390
Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
E F +G++ R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRERVELRKRLGCKSFQWYLD 430
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.140 0.474
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,339,044,611
Number of Sequences: 23463169
Number of extensions: 241110088
Number of successful extensions: 415141
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1891
Number of HSP's successfully gapped in prelim test: 158
Number of HSP's that attempted gapping in prelim test: 404195
Number of HSP's gapped (non-prelim): 5918
length of query: 290
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 149
effective length of database: 9,050,888,538
effective search space: 1348582392162
effective search space used: 1348582392162
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)