BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy3380
         (290 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 657

 Score =  303 bits (776), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 173/362 (47%), Positives = 202/362 (55%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +A++ + VV P+I  I D T E  F       S    +GGFDWNLQ
Sbjct: 276 CECTVGWLEPLLDRIAKDPTTVVCPVIDVIDDTTLEYNF-----RDSGGVNVGGFDWNLQ 330

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+IDK FFE++GTYDSGFDIWGGENLELSF
Sbjct: 331 FNWHAVPEREKKRHKNTAEPVWSPTMAGGLFAIDKNFFERIGTYDSGFDIWGGENLELSF 390

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F +   Y   SG ++  
Sbjct: 391 K--------------------TW---MCGGTLEIVPCSHVGHIFRRRSPYKWRSGVNVLK 427

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGDFGDV++RKELR+ L CKSFKWYL         
Sbjct: 428 RNSVRLAEVWLDDYAKYYYQRIGDDKGDFGDVSARKELRKRLNCKSFKWYLDNIYPELFI 487

Query: 211 --------EVSN-------------------------------------DWSGMCIDSAC 225
                   EV N                                     +WSG C+DS C
Sbjct: 488 PGEAVAGGEVRNKGLGGKTCLDSPARKADLHKAVGLFPCHRQGGNQVSNNWSGQCLDSPC 547

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           K  DMHKPVGL+PCHKQGGNQ+WM+SK GEIRRDEACLDYAG DVILYPCHGSKGNQY+ 
Sbjct: 548 KSEDMHKPVGLWPCHKQGGNQYWMLSKAGEIRRDEACLDYAGQDVILYPCHGSKGNQYWH 607

Query: 286 YD 287
           Y+
Sbjct: 608 YN 609


>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus impatiens]
          Length = 602

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 165/329 (50%), Positives = 198/329 (60%), Gaps = 71/329 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+DS  +  D+HKP GLYPCH+QGGNQ+WM+SK GEIRRDE
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQYWMLSKTGEIRRDE 527

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           +CLDY+G DVILYPCHGSKGNQ + Y+++
Sbjct: 528 SCLDYSGTDVILYPCHGSKGNQQWIYNHQ 556


>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
          Length = 586

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 165/328 (50%), Positives = 198/328 (60%), Gaps = 73/328 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +AR+ + VV P+I  I D T E  F       S    +GGFDWNLQ
Sbjct: 240 CECTTGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHF-----HDSGGVNVGGFDWNLQ 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PE E+KRHKN AEPV++PTMAGGLFSIDK FFE+LGTYD+GFDIWGGENLELSF
Sbjct: 295 FNWHAVPEHEKKRHKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSF 354

Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
           K                         + W +     R+     AE VW    A       
Sbjct: 355 KTWMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLRRNSVRLAE-VWLDEYA------- 406

Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
           K +++++G                  KGDFGD+TSRK LR  LGCKSFKWYL        
Sbjct: 407 KYYYQRIGNE----------------KGDFGDITSRKALREKLGCKSFKWYLDNIYPELF 450

Query: 211 ---------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                    E+ N   G   C+DS  + +D+HKPVGLYPCH+QGGNQFWM SK GEIRRD
Sbjct: 451 IPGEAVASGEIRNLGIGGKTCLDSPARRSDLHKPVGLYPCHRQGGNQFWMYSKSGEIRRD 510

Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYD 287
           EACLDY+G +VILYPCHGSKGNQ+++Y+
Sbjct: 511 EACLDYSGQEVILYPCHGSKGNQFWDYN 538


>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus terrestris]
 gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus terrestris]
          Length = 602

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 165/329 (50%), Positives = 198/329 (60%), Gaps = 71/329 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+DS  +  D+HKP GLYPCH+QGGNQ+WM+SK GEIRRDE
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQYWMLSKTGEIRRDE 527

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           +CLDY+G DVILYPCHGSKGNQ + Y+++
Sbjct: 528 SCLDYSGTDVILYPCHGSKGNQQWIYNHQ 556


>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Nasonia vitripennis]
          Length = 610

 Score =  297 bits (760), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 163/327 (49%), Positives = 196/327 (59%), Gaps = 69/327 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 266 CECTEGWLEPLLDRIARNQTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 320

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 321 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSF 380

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 381 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 417

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GDV+ RK LR+NLGCKSFKWYL         
Sbjct: 418 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKNLGCKSFKWYLDNIYPELFI 477

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
                   E+ +  S +CIDS   P D+H+ VG Y CH QGGNQ+WM+SK GEIRRDE+C
Sbjct: 478 PGEAVASGEIRHLASRLCIDSPGNPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESC 537

Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDYK 289
           LDY+G DVILYPCHGSKGNQ + Y+ +
Sbjct: 538 LDYSGTDVILYPCHGSKGNQQWTYNTQ 564


>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 683

 Score =  295 bits (755), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 167/327 (51%), Positives = 197/327 (60%), Gaps = 71/327 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D TFE  +       S    +GGFDWNLQ
Sbjct: 330 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEYHY-----RDSGGVNVGGFDWNLQ 384

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERER+R K++ +PVW+PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF
Sbjct: 385 FSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 444

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 445 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 481

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGDV++RK LR NL C+SF WY+         
Sbjct: 482 RNSVRLAEVWLDEYKQYYYQRIGDDLGDFGDVSARKRLRDNLKCRSFDWYVRTIYPELFV 541

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+DS     +MHKPVG+YPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 542 PGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKEGEIRRDE 601

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
           ACLDYAG DVILYPCHGSKGNQ + YD
Sbjct: 602 ACLDYAGSDVILYPCHGSKGNQLWIYD 628


>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 648

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 167/327 (51%), Positives = 197/327 (60%), Gaps = 71/327 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D TFE  +       S    +GGFDWNLQ
Sbjct: 295 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEYHY-----RDSGGVNVGGFDWNLQ 349

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERER+R K++ +PVW+PTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF
Sbjct: 350 FSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 409

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 410 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 446

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGDV++RK LR NL C+SF WY+         
Sbjct: 447 RNSVRLAEVWLDEYKQYYYQRIGDDLGDFGDVSARKRLRDNLKCRSFDWYVRTIYPELFV 506

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+DS     +MHKPVG+YPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 507 PGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKEGEIRRDE 566

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
           ACLDYAG DVILYPCHGSKGNQ + YD
Sbjct: 567 ACLDYAGSDVILYPCHGSKGNQLWIYD 593


>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Apis mellifera]
          Length = 635

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 171/362 (47%), Positives = 199/362 (54%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 254 CECTEGWLEPLLDRIARNPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 308

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 309 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 368

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 369 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 405

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 406 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKRLGCKSFKWYLDNVYPELFI 465

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     MCIDS  
Sbjct: 466 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSPG 525

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 526 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 585

Query: 286 YD 287
           Y+
Sbjct: 586 YN 587


>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 3 [Bombus terrestris]
          Length = 637

 Score =  293 bits (751), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 170/364 (46%), Positives = 202/364 (55%), Gaps = 106/364 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     MCIDSA 
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSAG 527

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 528 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 587

Query: 286 YDYK 289
           Y+++
Sbjct: 588 YNHQ 591


>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus impatiens]
          Length = 637

 Score =  293 bits (751), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 170/364 (46%), Positives = 202/364 (55%), Gaps = 106/364 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 256 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 310

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFF++LGTYDSGFDIWGGENLELSF
Sbjct: 311 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSF 370

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 371 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 407

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 408 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKKLGCKSFKWYLDNVYPELFI 467

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     MCIDSA 
Sbjct: 468 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSAG 527

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 528 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 587

Query: 286 YDYK 289
           Y+++
Sbjct: 588 YNHQ 591


>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
          Length = 647

 Score =  293 bits (749), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 164/326 (50%), Positives = 195/326 (59%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 301 CECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHY-----RDSGGVNVGGFDWNLQ 355

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+P+RE+KRHK+ AEPV++PTMAGGLFSIDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 356 FNWHAVPDREKKRHKSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSF 415

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 416 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVIK 452

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ RK+LR NLGCK F+WYL         
Sbjct: 453 RNSVRLAEVWLDEYAKYYYQRIGNDKGDYGDVSERKQLRENLGCKPFRWYLDNIFPELFI 512

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+D+     ++ KPVGLYPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 513 PGEAVASGEVRNMGYGNRTCLDAPGGKKNLRKPVGLYPCHNQGGNQYWMLSKTGEIRRDE 572

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           ACLDYAG DVILYPCHGSKGNQY+ Y
Sbjct: 573 ACLDYAGQDVILYPCHGSKGNQYWNY 598


>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Apis florea]
          Length = 537

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 171/362 (47%), Positives = 199/362 (54%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 156 CECTEGWLEPLLDRIARNPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 210

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 211 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 270

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 271 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 307

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG +GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 308 RNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSERKALRKRLGCKSFKWYLDNVYPELFI 367

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     MCIDS  
Sbjct: 368 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSMCIDSPG 427

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 428 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 487

Query: 286 YD 287
           Y+
Sbjct: 488 YN 489


>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Megachile rotundata]
          Length = 638

 Score =  291 bits (744), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 169/362 (46%), Positives = 200/362 (55%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 257 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 311

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE+LGTYDSGFDIWGGENLELSF
Sbjct: 312 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSF 371

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 372 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 408

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 409 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKKLGCKSFKWYLDNVYPELFI 468

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     +CIDS  
Sbjct: 469 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLVSSICIDSPG 528

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 529 KPEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWI 588

Query: 286 YD 287
           Y+
Sbjct: 589 YN 590


>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Acyrthosiphon pisum]
          Length = 674

 Score =  289 bits (740), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 168/362 (46%), Positives = 202/362 (55%), Gaps = 105/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR +S VV P+I  I D T E  +       +    +GGFDWNLQ
Sbjct: 288 CECTEGWLEPLLDRIAREASTVVCPVIDVIDDSTLEFHY-----RDAGGVNVGGFDWNLQ 342

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH +P++E+KRHKNAAEPVW+PTMAGGLF+IDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 343 FNWHVVPDKEKKRHKNAAEPVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSF 402

Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
           K                         + W       +K     AE VW    A       
Sbjct: 403 KTWMCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLKKNSIRLAE-VWMDDYA------- 454

Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
           K ++E++G      D+           GD+GD+TSRK+LRR L CKSFKWYL        
Sbjct: 455 KYYYERIGN-----DL-----------GDYGDITSRKDLRRKLKCKSFKWYLENIYPELF 498

Query: 211 ---------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------ 241
                    EV N   G   C+DS  + TD++KP GLYPCHK                  
Sbjct: 499 IPGDAVASGEVRNLGYGNKTCLDSPARKTDLNKPAGLYPCHKMGGNQIKNIVSNMCVDSK 558

Query: 242 --------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYD 287
                         QGGNQ+WM+SK GEIRRDE+CLDYAG DVILYPCHGSKGNQY+ YD
Sbjct: 559 GDANKPVDLWQCHQQGGNQYWMLSKIGEIRRDESCLDYAGNDVILYPCHGSKGNQYWNYD 618

Query: 288 YK 289
           +K
Sbjct: 619 HK 620


>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
          Length = 671

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 160/326 (49%), Positives = 194/326 (59%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 325 CECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHY-----RDSGGVNVGGFDWNLQ 379

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+++HK+AAEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 380 FNWHAVPEREKRKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGGENLELSF 439

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 440 K--------------------TW---MCGGSLEIIPCSHVGHIFRKRSPYKWRTGVNVIK 476

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGDFGDV+SRK+LR  L CK F+WYL         
Sbjct: 477 RNSVRLAEVWMDEYAQYYYQRIGNDKGDFGDVSSRKKLREELHCKPFRWYLDNIYPELFV 536

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N   G   C+D+     ++ K VGLYPCH QGGNQ+WM+SK GEIRRDE
Sbjct: 537 PGDAVASGEVRNMGYGNRTCLDAPAGKRNLRKAVGLYPCHNQGGNQYWMLSKTGEIRRDE 596

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           ACLDYAG DV+LYPCHGS+GNQY+ Y
Sbjct: 597 ACLDYAGDDVVLYPCHGSRGNQYWNY 622


>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Acromyrmex echinatior]
          Length = 566

 Score =  287 bits (734), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 166/362 (45%), Positives = 200/362 (55%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 185 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSSGVNVGGFDWNLQ 239

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERERKRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 240 FNWHAVPERERKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 299

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 300 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRNGVNVLK 336

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GD++ RK LR+ LGCKSFKWYL         
Sbjct: 337 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDISERKALRKKLGCKSFKWYLDNVYPELFI 396

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     MCIDS+ 
Sbjct: 397 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQVTSGMCIDSSG 456

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           K  D+H+PVG+YPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 457 KIEDLHQPVGMYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 516

Query: 286 YD 287
           Y+
Sbjct: 517 YN 518


>gi|307172175|gb|EFN63700.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Camponotus floridanus]
          Length = 433

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 167/361 (46%), Positives = 198/361 (54%), Gaps = 106/361 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 52  CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 106

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 107 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 166

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 167 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 203

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 204 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKTLRKKLGCKSFKWYLDNIYPELFI 263

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     +CIDS  
Sbjct: 264 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQIASGICIDSPG 323

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           K  D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 324 KSEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 383

Query: 286 Y 286
           Y
Sbjct: 384 Y 384


>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
          Length = 621

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 168/362 (46%), Positives = 197/362 (54%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +AR+ + VV P+I  I D T E  F       S    +GGFDWNLQ
Sbjct: 240 CECTTGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHF-----HDSGGVNVGGFDWNLQ 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PE E+KRHKN AEPV++PTMAGGLFSIDK FFE+LGTYD+GFDIWGGENLELSF
Sbjct: 295 FNWHAVPEHEKKRHKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSF 354

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 355 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 391

Query: 177 GENLELS-----------------FKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGDFGD+TSRK LR  LGCKSFKWYL         
Sbjct: 392 RNSVRLAEVWLDEYAKYYYQRIGNEKGDFGDITSRKALREKLGCKSFKWYLDNIYPELFI 451

Query: 211 --------EVSNDWSG--MCID-----------------------------------SAC 225
                   E+ N   G   C+D                                   S C
Sbjct: 452 PGEAVASGEIRNLGIGGKTCLDSPARRSDLHKPVGLYPCHRQGGNQISVLDRELCIDSPC 511

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           KP D+H P+GL+PCHKQGGNQFWM SK GEIRRDEACLDY+G +VILYPCHGSKGNQ+++
Sbjct: 512 KPEDLHNPIGLWPCHKQGGNQFWMYSKSGEIRRDEACLDYSGQEVILYPCHGSKGNQFWD 571

Query: 286 YD 287
           Y+
Sbjct: 572 YN 573


>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Harpegnathos saltator]
          Length = 482

 Score =  283 bits (725), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 166/362 (45%), Positives = 198/362 (54%), Gaps = 106/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 101 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 155

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLFSID+ FFE++GTYDSGFDIWGGENLELSF
Sbjct: 156 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFSIDRVFFERIGTYDSGFDIWGGENLELSF 215

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 216 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 252

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GDV+ RK LR+ LGCKSFKWYL         
Sbjct: 253 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKTLRKKLGCKSFKWYLDNVYPELFI 312

Query: 211 --------EVSNDWSG-------------------------------------MCIDSAC 225
                   EV N   G                                     +CIDS  
Sbjct: 313 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPCGLYPCHRQGGNQIRQVASGICIDSPG 372

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           K  D+H+PVGLYPCH+QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 373 KSEDLHQPVGLYPCHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWI 432

Query: 286 YD 287
           Y+
Sbjct: 433 YN 434


>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Nasonia vitripennis]
          Length = 646

 Score =  283 bits (725), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 166/364 (45%), Positives = 198/364 (54%), Gaps = 106/364 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 265 CECTEGWLEPLLDRIARNQTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 319

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERE+KRHKN AEPVW+PTMAGGLF+ID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 320 FNWHAVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSF 379

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 380 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 416

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ LS                  KG++GDV+ RK LR+NLGCKSFKWYL         
Sbjct: 417 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSDRKALRKNLGCKSFKWYLDNIYPELFI 476

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
                   EV N   G   C+DS  +  D+HKP GLYPCH+                   
Sbjct: 477 PGEAVASGEVRNLGEGGNTCLDSPARKADLHKPAGLYPCHRQGGNQIRHLASRLCIDSPG 536

Query: 242 ----------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
                           QGGNQ+WM+SK GEIRRDE+CLDY+G DVILYPCHGSKGNQ + 
Sbjct: 537 NPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWT 596

Query: 286 YDYK 289
           Y+ +
Sbjct: 597 YNTQ 600


>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
 gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
          Length = 670

 Score =  280 bits (716), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 158/326 (48%), Positives = 192/326 (58%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 324 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEFHY-----RDSSGVNVGGFDWNLQ 378

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERE+KRH + +EPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 379 FSWHAVPEREKKRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSF 438

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 439 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 475

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGDFGDV+ RK+LR +L CKSFKWYL         
Sbjct: 476 KNSVRLAEVWMDDYAKYYYQRIGMDKGDFGDVSERKKLREDLQCKSFKWYLDNVYPELFI 535

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+DS      + KPVGLYPCH+QGGNQ+WM SK GEIRRD+
Sbjct: 536 PGDAVANGEMRNLGYGGRTCLDSPSGKRYLKKPVGLYPCHRQGGNQYWMFSKTGEIRRDQ 595

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           ACLDYAG DVIL+ CHGSKGNQ++ Y
Sbjct: 596 ACLDYAGKDVILFGCHGSKGNQFWTY 621


>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
 gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
          Length = 644

 Score =  277 bits (709), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 155/326 (47%), Positives = 192/326 (58%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I DDT E  +       S    +GGFDWNLQ
Sbjct: 298 CECTEGWLEPLLDRIARNSTTVVCPVIDVINDDTLEYHY-----RDSTGVNVGGFDWNLQ 352

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERE+KRH ++AEPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 353 FSWHAVPEREKKRHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSF 412

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG   I         F K   Y   SG ++  
Sbjct: 413 K-TW----------------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 449

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ RK+LR +L CKSF+WYL         
Sbjct: 450 KNSVRLAEVWMDDYAQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFRWYLDNIYPELFI 509

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D+      + K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 510 PGDAVAHGEIKNLGYGGRTCMDAPAGKKHLKKSVGTYPCHRQGGNQYWMLSKAGEIRRDD 569

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +CLDYAG DV LY CHGSKGNQ++ Y
Sbjct: 570 SCLDYAGKDVTLYACHGSKGNQFWTY 595


>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Danaus plexippus]
          Length = 533

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 156/361 (43%), Positives = 191/361 (52%), Gaps = 105/361 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN ++VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 146 CECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNTLEYHY-----RDSTSVNVGGFDWNLQ 200

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH +P RER RHK+ AEPVW+PTMAGGLF+IDK FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 201 FNWHPVPARERARHKHTAEPVWSPTMAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSF 260

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 261 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 297

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GD++ RKELR  L CKSF WYL         
Sbjct: 298 KNSVRLAEVWLDDYSKYYYQRVGNDKGDYGDISGRKELREKLKCKSFDWYLKNIYPELFI 357

Query: 211 --------------------------------------------EVSNDWSGMCIDSACK 226
                                                       +++N  S MC+DSA  
Sbjct: 358 PGESVAHGEIRNIGFERTCLDSPTRKSDHHKPVGLYPCHRQGGNQIANPSSDMCVDSAAG 417

Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
           P DM KPV  +PCH + GNQ+WM SK+GEIRRDE CLDY+G DV+LYPCHG+KGNQ + Y
Sbjct: 418 PEDMKKPVNPWPCHGEYGNQYWMYSKNGEIRRDETCLDYSGHDVVLYPCHGAKGNQLWLY 477

Query: 287 D 287
           D
Sbjct: 478 D 478


>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
 gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
          Length = 658

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 155/326 (47%), Positives = 189/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I DDT E  +       S    +GGFDWNLQ
Sbjct: 312 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 366

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERE+KRH + AEPV++PTMAGGLFSID+ +F +LGTYDSGFDIWGGENLELSF
Sbjct: 367 FSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSF 426

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 427 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 463

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ RK+LR +L CKSFKWYL         
Sbjct: 464 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFKWYLDNIYPELFI 523

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+DS        K VGLYPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 524 PGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDD 583

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
            CLDYAG +VILY CHG KGNQ++ Y
Sbjct: 584 YCLDYAGKEVILYSCHGGKGNQFWTY 609


>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
 gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
          Length = 658

 Score =  273 bits (699), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 155/326 (47%), Positives = 189/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I DDT E  +       S    +GGFDWNLQ
Sbjct: 312 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 366

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERE+KRH + AEPV++PTMAGGLFSID+ +F +LGTYDSGFDIWGGENLELSF
Sbjct: 367 FSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSF 426

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 427 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 463

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ RK+LR +L CKSFKWYL         
Sbjct: 464 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRKKLREDLQCKSFKWYLDNIYPELFI 523

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+DS        K VGLYPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 524 PGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDD 583

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
            CLDYAG +VILY CHG KGNQ++ Y
Sbjct: 584 YCLDYAGKEVILYSCHGGKGNQFWTY 609


>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
 gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
 gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
 gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
          Length = 647

 Score =  271 bits (693), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/324 (46%), Positives = 192/324 (59%), Gaps = 70/324 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
                   E++N  +GMC+D+  K ++   PV +Y CH QGGNQ+WM+SK GEIRRD++C
Sbjct: 516 PGDSVAHGEIANVPNGMCLDAKEK-SEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSC 574

Query: 263 LDYAGGDVILYPCHGSKGNQYFEY 286
           LDYAG DV L+ CHG KGNQ++ Y
Sbjct: 575 LDYAGKDVTLFGCHGGKGNQFWTY 598


>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
 gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
          Length = 693

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 159/359 (44%), Positives = 194/359 (54%), Gaps = 104/359 (28%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 314 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEFHY-----RDSSGVNVGGFDWNLQ 368

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WHA+PERE++RH N AEPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 369 FSWHAVPEREKRRHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSF 428

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 429 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 465

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ RK+LR +L CKSFKWYL         
Sbjct: 466 KNSVRLAEVWMDDYSKYYLQRIGMDKGDYGDVSERKKLREDLQCKSFKWYLDNIYPELFI 525

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN--------------- 245
                   E+ N   G   C+DS     +M KPVGLYPCHKQGGN               
Sbjct: 526 PGDAVANGEIRNLGYGGRTCLDSPTGKRNMKKPVGLYPCHKQGGNQIKSINTDMCVDAPK 585

Query: 246 ------------------QFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
                             Q+WM+SK GEIRRD++CLDYAG DVIL+ CHGSKGNQ++ Y
Sbjct: 586 TGDESPVGVYPCHGQGGHQYWMLSKAGEIRRDQSCLDYAGKDVILFGCHGSKGNQFWTY 644


>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
 gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
          Length = 650

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D+        K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601


>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
 gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
 gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9
 gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
 gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
          Length = 650

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D+        K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601


>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
          Length = 650

 Score =  267 bits (683), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 151/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVPK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQCYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D+        K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601


>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
 gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
          Length = 650

 Score =  266 bits (681), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 188/326 (57%), Gaps = 71/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KG++GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGNWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D+        K VG YPCH+QGGNQ+WM+SK GEIRRD+
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDD 575

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 SCLDYAGKDVTLFGCHGGKGNQFWTY 601


>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
          Length = 624

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 189/326 (57%), Gaps = 69/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
                   E+ N  +  C+DSA      +K +  YPCH+QGGNQ+WM+SK GEIRRDE+C
Sbjct: 483 PGESVAKGELRNAQTSQCLDSAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESC 542

Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDY 288
           +DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 VDYAGSDVMVFPCHGMKGNQEWRYNH 568


>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
 gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
          Length = 624

 Score =  265 bits (678), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 189/326 (57%), Gaps = 69/326 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
                   E+ N  +  C+DSA      +K +  YPCH+QGGNQ+WM+SK GEIRRDE+C
Sbjct: 483 PGESVAKGELRNAQTSQCLDSAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESC 542

Query: 263 LDYAGGDVILYPCHGSKGNQYFEYDY 288
           +DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 VDYAGSDVMVFPCHGMKGNQEWRYNH 568


>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
          Length = 632

 Score =  264 bits (674), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 151/333 (45%), Positives = 189/333 (56%), Gaps = 76/333 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + RNSS VV P+I  I D+TFE  +     T+     +GGFDW+LQ
Sbjct: 277 CECMEGWIEPLLDRIKRNSSTVVCPVIDVIDDETFEYHYSKAYFTN-----VGGFDWSLQ 331

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHAIPER+RK  K   +PV +PTMAGGLFSID+A+FEKLGTYD GFDIWGGENLELSF
Sbjct: 332 FNWHAIPERDRKNRKRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSF 391

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 392 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 428

Query: 177 GENLELS-----------------FKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                   GD+GDV+ RK LR  L CKSFKWYL         
Sbjct: 429 KNSVRLAEVWLDEYKVYYYERINNQTGDYGDVSDRKALRERLKCKSFKWYLDNIYPELFV 488

Query: 211 --------EVSN------DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
                   EV N        +  C+DS     D+HK V  YPCH QGGNQ+WM+SK GEI
Sbjct: 489 PGDSVAKGEVRNYGYKEGGGAPQCLDSVVG-EDVHKDVTPYPCHGQGGNQYWMLSKDGEI 547

Query: 257 RRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           RRDE+C+DYAG +V+++PCHG KGNQ + Y++K
Sbjct: 548 RRDESCIDYAGANVMIFPCHGMKGNQEWRYNHK 580


>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
 gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
          Length = 693

 Score =  262 bits (669), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 155/360 (43%), Positives = 194/360 (53%), Gaps = 105/360 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 313 CECAEGWLEPLLDRIARNSTTVVCPVIDVIDDATLEFHY-----RDSSGVNVGGFDWNLQ 367

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH++PERE+KRH + +EPV++PTMAGGLFSID+ FFE+LGTYDSGFDIWGGENLELSF
Sbjct: 368 FSWHSVPEREKKRHNSTSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSF 427

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   +G ++  
Sbjct: 428 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRTGVNVLK 464

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGDFGDV+ RK+LR +L CKSF+WYL         
Sbjct: 465 KNSVRLAEVWMDDYSKYYYQRIGMDKGDFGDVSDRKKLREDLQCKSFQWYLDTIYPELFI 524

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN--------------- 245
                   E+ N   G   C+DS     ++ K VGLYPCHKQGGN               
Sbjct: 525 PGNAVANGEIRNLGYGGRTCLDSPSGKRNLKKAVGLYPCHKQGGNQIRNINTNMCLDAML 584

Query: 246 -------------------QFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
                              Q+WM+SK GEIRRD+ACLDYAG DVIL+ CHGS+GNQ+++Y
Sbjct: 585 KNEDESPVGVYECHGQGGHQYWMLSKAGEIRRDQACLDYAGKDVILFGCHGSRGNQFWQY 644


>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 514

 Score =  261 bits (667), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 189/322 (58%), Gaps = 58/322 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLLD +A NS++VVSP+I  I DDT E          S    +GGFDW+LQ
Sbjct: 167 VECTQGWLEPLLDRIAVNSTNVVSPVIDIIADDTLEYN-----AKESADVNVGGFDWSLQ 221

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH+IPER  K      +PV TPTMAGGLFSID+ FFE+LG YD GFDIWGGENLELSF
Sbjct: 222 FSWHSIPERILKSGYKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGGENLELSF 281

Query: 124 KFNW---------------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA---FFEKL 165
           K  W               H   +R   + ++    +   ++      +D+    +FE+L
Sbjct: 282 K-TWMCGGRLEIIPCSHVGHIFRKRSPYKWRSGVNVLRRNSIRLAKVWMDEYANYYFERL 340

Query: 166 GTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------- 210
           G      D+           GD+GD++ R  LR  L C SFKWY+               
Sbjct: 341 GN-----DL-----------GDYGDISDRIALRDKLKCHSFKWYIDEVYPELFVPGDAIG 384

Query: 211 --EVSNDWSG-MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
             E+ N  SG MC+DS    + +HK VGLYPCH QGGNQ+W+ SK+GEIRRDEACLDYAG
Sbjct: 385 SGEMRNLGSGGMCLDSPAGKSSLHKAVGLYPCHGQGGNQYWLYSKNGEIRRDEACLDYAG 444

Query: 268 GDVILYPCHGSKGNQYFEYDYK 289
            DVILYPCHGSKGNQY+ YD +
Sbjct: 445 TDVILYPCHGSKGNQYWIYDQQ 466


>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
          Length = 630

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 148/328 (45%), Positives = 188/328 (57%), Gaps = 71/328 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 275 CECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 329

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK    A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 330 FNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 389

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 390 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 426

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGDV++RK+LR +LGCKSFKWYL         
Sbjct: 427 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYLDNIFPELFV 486

Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N       C+D      + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 487 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 546

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           +C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 547 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 574


>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
          Length = 626

 Score =  260 bits (664), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 187/328 (57%), Gaps = 71/328 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N       C+D      + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 542

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           +C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 570


>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
 gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
          Length = 626

 Score =  260 bits (664), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 187/328 (57%), Gaps = 71/328 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSND--WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   EV N       C+D      + ++PVG Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDE 542

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           +C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 543 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 570


>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
 gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
          Length = 688

 Score =  260 bits (664), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 155/362 (42%), Positives = 191/362 (52%), Gaps = 107/362 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I DDT E  +       S    +GGFDWNLQ
Sbjct: 306 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHY-----RDSSGVNVGGFDWNLQ 360

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH++PERERKRH N+AEPV++PTMAGGLF+ID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 361 FSWHSVPERERKRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSF 420

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 421 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLR 457

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDVT RK+LR +L CKSFKWYL         
Sbjct: 458 KNSVRLAEVWMDDYAQYYYHRIGNDKGDWGDVTDRKKLRADLKCKSFKWYLDNIYPELFI 517

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
                   E+ N   G   C+D+        K VG YPCH+                   
Sbjct: 518 PGDSVAHGEIRNLGYGGRTCLDAPSGKKHQKKAVGTYPCHRQGGNQIANLPTGMCLDAKE 577

Query: 242 -----------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYF 284
                            QGGNQ+WM+SK GEIRRD++CLDYAG +V LYPCHG KGNQ++
Sbjct: 578 LSTEGDDTSVSIYECHGQGGNQYWMLSKTGEIRRDDSCLDYAGKEVTLYPCHGGKGNQFW 637

Query: 285 EY 286
            Y
Sbjct: 638 SY 639


>gi|322792015|gb|EFZ16120.1| hypothetical protein SINV_06269 [Solenopsis invicta]
          Length = 433

 Score =  259 bits (662), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 149/309 (48%), Positives = 184/309 (59%), Gaps = 55/309 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +AR+ + VV P+I  I D T E  +       S    +GGFDWNLQ
Sbjct: 107 CECTEGWLEPLLDRIARDPTTVVCPVIDVIDDTTLEYHW-----RDSGGVNVGGFDWNLQ 161

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHA+PERERKRHKN AEPVW+PTMAGGLFSID+AFFE++GTYDSGFDIWGGENLELSF
Sbjct: 162 FNWHAVPERERKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSF 221

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 222 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 258

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM 219
             ++ LS                  KG++GDV+ RK LR+ LGCKSFKWYL+  N +  +
Sbjct: 259 RNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSERKALRKKLGCKSFKWYLD--NVYPEL 316

Query: 220 CIDSACKPTDMHKPVGLYP-CHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
            I      +    P  +Y   ++   +Q+WM+SK GEIRRDE+CLDY+G DVILYPCHGS
Sbjct: 317 FIPGEAVASGEASPCRIYRGINRDRLSQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGS 376

Query: 279 KGNQYFEYD 287
           KGNQ + Y+
Sbjct: 377 KGNQQWIYN 385


>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
 gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
          Length = 623

 Score =  254 bits (649), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 189/328 (57%), Gaps = 74/328 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSNDW--SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N    +  CID   KP+   K VG+Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEMRNAGGKNRQCIDY--KPSG-GKTVGMYQCHNQGGNQYWMLSKDGEIRRDE 539

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           +C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 540 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 567


>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
          Length = 623

 Score =  254 bits (648), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 189/328 (57%), Gaps = 74/328 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 271 CECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK      +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 326 FNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSF 385

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 386 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 422

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGD++SRK+LR +LGCKSFKWYL         
Sbjct: 423 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDISSRKKLREDLGCKSFKWYLDNIYPELFV 482

Query: 211 --------EVSNDW--SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N    +  CID   KP+   K VG+Y CH QGGNQ+WM+SK GEIRRDE
Sbjct: 483 PGESVAKGEMRNAGGKNRQCIDY--KPSG-GKTVGMYQCHNQGGNQYWMLSKDGEIRRDE 539

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           +C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 540 SCVDYAGSDVMVFPCHGMKGNQEWRYNH 567


>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
 gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
          Length = 684

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 152/360 (42%), Positives = 188/360 (52%), Gaps = 105/360 (29%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNSS VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 304 CECTEGWLEPLLDRIARNSSTVVCPVIDVINDETLEYHY-----RDSGGVNVGGFDWNLQ 358

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 359 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 418

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 419 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 455

Query: 177 GENLELSF-----------------KGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+                  KGD+GDV+ R++LR +L CKSFKWYL         
Sbjct: 456 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFI 515

Query: 211 --------EVSNDWSG--MCIDSACKPTDMHKPVGLYPCHK------------------- 241
                   E+ N   G   C+D+        K VG YPCH+                   
Sbjct: 516 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQIANMQHGMCLDAKE 575

Query: 242 ---------------QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
                          QGGNQ+WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 576 KSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 635


>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
 gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
          Length = 685

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 149/360 (41%), Positives = 191/360 (53%), Gaps = 102/360 (28%)

Query: 7   QKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNW 66
           +KRW++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQFNW
Sbjct: 294 KKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQFNW 348

Query: 67  HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFN 126
           H+IPER+RK    A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSFK  
Sbjct: 349 HSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKVR 408

Query: 127 WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGEN 179
                           + +W   M GG   I         F K   Y   +G ++    +
Sbjct: 409 ----------------KCIW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLKRNS 449

Query: 180 LELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL------------ 210
           + L+      +K           GDFGDV++RK+LR +LGCKSFKWYL            
Sbjct: 450 IRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYLDNIYPELFVPGE 509

Query: 211 -----EVSND-------------------------------------WSGMCIDSACKPT 228
                EV N                                       +  C+DSA    
Sbjct: 510 SVAKGEVRNSAVQPARCLDCMVGRHEKNRPVGTYQCHGQGGNQLRNAQTSQCLDSAVGDE 569

Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDY 288
             +K +  YPCH+QGGNQ+WM+SK GEIRRDE+C+DYAG DV+++PCHG KGNQ + Y++
Sbjct: 570 VENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDYAGTDVMVFPCHGMKGNQEWRYNH 629


>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 641

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 150/329 (45%), Positives = 183/329 (55%), Gaps = 77/329 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +A   ++VV P+I  I D TFE  +P  R  + Y   +GGFDWNLQ
Sbjct: 295 CECSTGWLEPLLDRIAEADTNVVCPVIDVISDSTFE--YPHRR--AGYTVNVGGFDWNLQ 350

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH++P+R++   K +   V +PTMAGGLFSI KA+FEKLG YDSGFDIWG ENLELSF
Sbjct: 351 FSWHSLPQRDKDARKQSWSAVPSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSF 410

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFD--- 173
           K                    VW   M GG   I         F K   Y    G +   
Sbjct: 411 K--------------------VW---MCGGRLEIVPCSHVGHVFRKRSPYKWLKGVNVLK 447

Query: 174 --------IWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
                   +W  E  +  F       GD+GD++ R ELRR+L CKSF WY+         
Sbjct: 448 KNSVRLAKVWMDEYAQYYFDRIGPDLGDYGDISERVELRRSLNCKSFDWYVKNIYPDLFI 507

Query: 211 --------EVSNDWSGM----CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
                   EV N  SG     C+DSA     +H  V +YPCH QGGNQ+W+ SK GEIRR
Sbjct: 508 PGDAAASGEVRN--SGFERKWCLDSAAT---VHATVSVYPCHGQGGNQYWLFSKTGEIRR 562

Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEYD 287
           DE CLDY+GGDV+LY CHGSKGNQY+ YD
Sbjct: 563 DELCLDYSGGDVVLYSCHGSKGNQYWRYD 591


>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
          Length = 518

 Score =  245 bits (626), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 139/328 (42%), Positives = 182/328 (55%), Gaps = 73/328 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +++N S+VV+P+I  I DDT + ++   + TS     +GGFDWNLQ
Sbjct: 159 CECTMGWLEPLLDRISQNKSNVVTPVIDVINDDTIQYQYSSAKSTS-----VGGFDWNLQ 213

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH IP+ E+KR K+  +PV +PTMAGGLFSI + +FE LGTYD G DIWGGENLELSF
Sbjct: 214 FNWHGIPDHEKKRRKSDVDPVRSPTMAGGLFSISREYFEYLGTYDPGMDIWGGENLELSF 273

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           +                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 274 R--------------------IW---MCGGSLDIAPCSHVGHIFRKRSPYSWKTGVNVVK 310

Query: 177 GENLELSFK-----------------GDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
             ++ L+                   GD+GDV++RK LR  L CKSFKWYL+        
Sbjct: 311 KNSIRLAEVWLDEFSKYYYERFNYDLGDYGDVSARKALRERLHCKSFKWYLDNIYPDLFI 370

Query: 212 -----VSNDWSGM--------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
                 S + +G+        C+DSA      +K + L+PCH  GGNQ+WM+SK GEIRR
Sbjct: 371 PGESLASGEVNGVFNSQSQPACLDSAADKKAYNKAIKLWPCHNMGGNQYWMLSKSGEIRR 430

Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEY 286
           DE C DYAG  V++YPCH  KGNQ + Y
Sbjct: 431 DEGCFDYAGQFVMIYPCHAMKGNQEWIY 458


>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
          Length = 622

 Score =  244 bits (624), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 146/326 (44%), Positives = 184/326 (56%), Gaps = 72/326 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + +N   VV P+I  I D+TFE  +     T+     +GGFDW+LQ
Sbjct: 271 CECLEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHAIPE++RK  ++  +PV +PTMAGGLFSID+ FFEKLG+YD G DIWGGENLELSF
Sbjct: 326 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSF 384

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG+  I         F K   Y   SG ++  
Sbjct: 385 K-TW----------------------MCGGILEIVPCSHVGHIFRKRSPYKWLSGVNVLK 421

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGDV+SRK LR  L CKSFKWYL         
Sbjct: 422 RNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALREKLQCKSFKWYLDNVYPELFV 481

Query: 211 --------EVSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                   E+ N   G   C+D A          GLY CHK+GGNQ+WM+SK GEIRRDE
Sbjct: 482 PGDAIGKGEIRNRGGGSKNCLDWASHGRQRSVNAGLYWCHKKGGNQYWMLSKDGEIRRDE 541

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEY 286
           +C+DYAG DV++YPCHG KGNQ ++Y
Sbjct: 542 SCIDYAGVDVMVYPCHGMKGNQEWKY 567


>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
          Length = 580

 Score =  241 bits (614), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 148/350 (42%), Positives = 187/350 (53%), Gaps = 98/350 (28%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + +N   VV P+I  I D+TFE  +     T+     +GGFDW+LQ
Sbjct: 252 CECLEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 306

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHAIPE++RK  ++  +PV +PTMAGGLFSID+ FFEKLG+YD G DIWGGENLELSF
Sbjct: 307 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSF 365

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K  W                      M GG+  I         F K   Y   SG ++  
Sbjct: 366 K-TW----------------------MCGGILEIVPCSHVGHIFRKRSPYKWLSGVNVLK 402

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------------EVS 213
             ++ L+ +GDFGDV+SRK LR  L CKSFKWYL                       EV+
Sbjct: 403 RNSVRLA-EGDFGDVSSRKALREKLQCKSFKWYLDNVYPELFVPGDAIGKGEIRNKGEVA 461

Query: 214 NDWSGMCIDSACKPTDMHKPV-------------------------------------GL 236
            D    C+DS     D+ K V                                     GL
Sbjct: 462 GDVVQHCLDSEVG-EDIQKVVIAFPCHRNGGNQIRNRGGGSKNCLDWASHGRQRSVNAGL 520

Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
           Y CHK+GGNQ+WM+SK GEIRRDE+C+DYAG DV++YPCHG KGNQ ++Y
Sbjct: 521 YWCHKKGGNQYWMLSKDGEIRRDESCIDYAGVDVMVYPCHGMKGNQEWKY 570


>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 576

 Score =  236 bits (603), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 148/366 (40%), Positives = 188/366 (51%), Gaps = 112/366 (30%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + RN   VV P+I  I D+TFE  +     T+     +GGFDW+LQ
Sbjct: 185 CECLEGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYHYSKAYFTN-----VGGFDWSLQ 239

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWHAIPE++RK  ++  +PV +PTMAGGLFSID+ FFE+LG+YD G DIWGGENLELSF
Sbjct: 240 FNWHAIPEKDRKGRRDI-DPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSF 298

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG+  I         F K   Y   SG ++  
Sbjct: 299 K--------------------IW---MCGGILEIVPCSHVGHIFRKRSPYKWRSGVNVLK 335

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             ++ L+      +K           GDFGDV+SRK LR+ L CKSFKWYL         
Sbjct: 336 RNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALRKKLQCKSFKWYLDNVYPELFV 395

Query: 211 --------------EVSNDWSGMCIDS--------------------------------- 223
                         EV+ D    C+DS                                 
Sbjct: 396 PGDAIGKGEIRNKGEVAGDVVQHCLDSEVGEDIQKVVIAYPCHKSGGNQIRNRGGRSKNC 455

Query: 224 ---ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
              A         VGLY CHK+GGNQ+WM+SK GEIRRDE+C+DYAG DV++YPCHG KG
Sbjct: 456 LDWASHGRQRSANVGLYWCHKKGGNQYWMLSKDGEIRRDESCIDYAGADVMVYPCHGMKG 515

Query: 281 NQYFEY 286
           NQ ++Y
Sbjct: 516 NQEWKY 521


>gi|339239855|ref|XP_003375853.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
 gi|316975462|gb|EFV58902.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
          Length = 625

 Score =  236 bits (602), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 137/312 (43%), Positives = 175/312 (56%), Gaps = 55/312 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL PLL  +  N S+VV P+I  I DDTF+       +T+     +GGFDWNLQ
Sbjct: 298 CECLEGWLPPLLSRIKENWSNVVCPVIDVIDDDTFKYHCGKSWMTN-----VGGFDWNLQ 352

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH IPER RK   +   PV +PTMAGGLFSIDK +F+ LGTYD GFDIWGGENLELSF
Sbjct: 353 FNWHPIPERVRKSRSDPTAPVESPTMAGGLFSIDKQYFQHLGTYDPGFDIWGGENLELSF 412

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    VW   M GG   I         F K   Y    G ++  
Sbjct: 413 K--------------------VW---MCGGKLEIVPCSHVGHIFRKRSPYKWRPGVNVVK 449

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------------VSNDWSGM---- 219
              + L+ +G+FGDV+ R  LR+ L C SF+WY++                +   M    
Sbjct: 450 RNTVRLA-EGEFGDVSDRIALRQRLNCSSFEWYIKNVYPELFVPGNSIAKGEIRCMGQNK 508

Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
             C+D A    + +KP+ +YPCH +GGNQ+WM+S  GEIRRDE+C+DYAG  V L  CHG
Sbjct: 509 RHCLDFASGRKEHNKPISMYPCHGEGGNQYWMLSPTGEIRRDESCVDYAGQKVFLSGCHG 568

Query: 278 SKGNQYFEYDYK 289
            KGNQ ++Y++K
Sbjct: 569 LKGNQEWKYNFK 580


>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
          Length = 597

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 135/309 (43%), Positives = 172/309 (55%), Gaps = 70/309 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLLD + R+ + VV P+I  I D+TFE        TS     +GGFDW LQ
Sbjct: 279 CECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTS-----VGGFDWGLQ 333

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH+IPER+RK    A +PV +PTMAGGLFSIDK +FEKLGTYD GFDIWGGENLELSF
Sbjct: 334 FNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 393

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   I         F K   Y   +G ++  
Sbjct: 394 K--------------------IW---MCGGTLEIVPCSHVGHVFRKRSPYKWRTGVNVLK 430

Query: 177 GENLELS------FK-----------GDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM 219
             ++ L+      +K           GDFGDV++RK+LR +LGCKSFKWYL         
Sbjct: 431 RNSIRLAEVWLDDYKTYYYERINNQLGDFGDVSARKKLRSDLGCKSFKWYL--------- 481

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSK 279
                    D   P    P       ++WM+SK GEIRRDE+C+DYAG DV+++PCHG K
Sbjct: 482 ---------DNIYPELFVPGESVAKGEYWMLSKDGEIRRDESCVDYAGSDVMVFPCHGMK 532

Query: 280 GNQYFEYDY 288
           GNQ + Y++
Sbjct: 533 GNQEWRYNH 541


>gi|405967231|gb|EKC32417.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 570

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 135/331 (40%), Positives = 177/331 (53%), Gaps = 61/331 (18%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +A +  HVV P + NI DDT E R       S+    +G FDW L F W  +
Sbjct: 196 WLEPLLDRIAEDKRHVVYPQMPNIKDDTLEFR-----AFSARNIQVGRFDWQLIFRWMEL 250

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
           PE   K  K+   P  +PTMAGGLFSI + +F +LGTYD G DIWGGENLELSF+  W  
Sbjct: 251 PEYINKTRKSFISPTRSPTMAGGLFSISREYFTELGTYDPGMDIWGGENLELSFRV-WMC 309

Query: 128 -------------HAIPERERKRHKNAAEPVWTPTMAGGLFSID--KAFFEKLGTYDSG- 171
                        H   +R   + +     V   ++      +D  K ++ +   YD G 
Sbjct: 310 GGTLEIIPCSHVGHIFRKRSPYKWRTGVNVVKKNSIRLAEVWMDEYKNYYYERFNYDLGD 369

Query: 172 ------------------FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--- 210
                             FD W  +N+     GD+GDVT RK+LR  L C SF W++   
Sbjct: 370 YGDVTDRKKLRERLQCHSFD-WFVKNVYPDLFGDYGDVTDRKKLRERLQCHSFDWFVKNV 428

Query: 211 --------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
                         E+ +    MCIDSA    + HKPV ++PCH QGGNQ+WM+SK+GEI
Sbjct: 429 YPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWMLSKNGEI 488

Query: 257 RRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           RRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 489 RRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 519



 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 64/209 (30%), Positives = 94/209 (44%), Gaps = 40/209 (19%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +A +   VV P+I NI  +T   +       S  ++ +GGFDW+L F W + 
Sbjct: 104 WLEPLLDRIATDRKKVVCPVIDNILAETLYFQ-------SLNQYSVGGFDWSLVFRWKSA 156

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSID-------------KAFFEKLGTYDSG------ 110
               R  +      +    +A  +   D             +   +++            
Sbjct: 157 KPHNRYYNSQNKTSLRAIRLARTIARSDSGGKAARGNPGWLEPLLDRIAEDKRHVVYPQM 216

Query: 111 ---------FDIWGGENLEL-----SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFS 156
                    F  +   N+++        F W  +PE   K  K+   P  +PTMAGGLFS
Sbjct: 217 PNIKDDTLEFRAFSARNIQVGRFDWQLIFRWMELPEYINKTRKSFISPTRSPTMAGGLFS 276

Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFK 185
           I + +F +LGTYD G DIWGGENLELSF+
Sbjct: 277 ISREYFTELGTYDPGMDIWGGENLELSFR 305



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 47/82 (57%), Gaps = 7/82 (8%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR-FPPGRLTSSYKFFIGGFDWNL 62
           C++   WL+PLL  +A +  HVV+P+I NI DDT +   F P ++       +G FDW+L
Sbjct: 33  CKLCIGWLEPLLGRIAEDKRHVVAPVIGNINDDTLQFAWFNPDQI------HVGKFDWDL 86

Query: 63  QFNWHAIPERERKRHKNAAEPV 84
            FNW  IP   + +  +  EP+
Sbjct: 87  TFNWMPIPSYVKDKMNSWLEPL 108


>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
 gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
          Length = 682

 Score =  227 bits (579), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 133/329 (40%), Positives = 179/329 (54%), Gaps = 71/329 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   RWL+PLLD +A+NS++VV+P+I     DT  L      L+S  +  +GGF+W L 
Sbjct: 326 CECMNRWLEPLLDRIAQNSTNVVTPVI-----DTINLETLQYHLSSHRRLSVGGFNWGLV 380

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNWH +P+R+ +  K+  +P+ +PTMAGGLFSID+ +FEKLG YD GFDIWG ENLE+SF
Sbjct: 381 FNWHILPDRDYQAMKSRIDPIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISF 440

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                    +W   M GG   +         F K   Y    G ++  
Sbjct: 441 K--------------------IW---MCGGRLEVVPCSHVGHIFRKKSPYKWRKGINVLQ 477

Query: 177 GENLELS------FKG-----------DFGDVTSRKELRRNLGCKSFKWYLE-------- 211
             N+ L+      +K            DFGDV+ RK+LR +L C SFKWYL+        
Sbjct: 478 RNNIRLAEVWLDDYKEIYYNRINHKLVDFGDVSERKKLREHLKCHSFKWYLDNVFPDLFL 537

Query: 212 -----VSNDWSGM-----CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
                 S +   +     C+D       ++  V  YPCH QGGNQFWM+SK GEIRRDE 
Sbjct: 538 PSEAIASGEIRNLGNQKYCVDHDVGRNAVNDSVIPYPCHLQGGNQFWMLSKSGEIRRDEY 597

Query: 262 CLDYAG-GDVILYPCHGSKGNQYFEYDYK 289
           C+DY G G  + Y CHGSKGNQ ++Y+++
Sbjct: 598 CIDYTGRGSPVTYECHGSKGNQLWDYNHE 626


>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
 gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
          Length = 721

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 148/399 (37%), Positives = 189/399 (47%), Gaps = 145/399 (36%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARNS+ VV P+I  I D+T E  +       S    +GGFDWNLQ
Sbjct: 303 CECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHY-----RDSGGVNVGGFDWNLQ 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH +PERERKRH + AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSF
Sbjct: 358 FSWHPVPERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSF 417

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWG 176
           K                     W   M GG   I         F K   Y   SG ++  
Sbjct: 418 K--------------------TW---MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLK 454

Query: 177 GENLELSF-----------------KGDFGDVTSRKELR--------------------- 198
             ++ L+                  KGD+GDV+ R++LR                     
Sbjct: 455 KNSVRLAEVWMDEYSQYYYHRIGNDKGDWGDVSDRRKLRTDLKCKSFKWYLDNIYPELFI 514

Query: 199 ----------RNLG-----C-----------KSFKWYL-------EVSNDWSGMCIDSAC 225
                     RNLG     C           K+   Y        +++N   GMC+D+  
Sbjct: 515 PGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVGTYPCHRQGGNQIANVPKGMCLDAKE 574

Query: 226 KPTDMHKPVGLYPCHKQGGNQ--------------------------------------F 247
           K ++   PV +Y CH QGGNQ                                      +
Sbjct: 575 K-SEEETPVSVYECHGQGGNQVSASMSTSSELRKAGGGDSESLIPGFSISLLYGFIFQSY 633

Query: 248 WMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
           WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 634 WMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 672


>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
          Length = 550

 Score =  211 bits (537), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 176/325 (54%), Gaps = 60/325 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P+LD + ++ SHVV+P+I  I D T    F P     S  F +GGFDW + 
Sbjct: 194 CECTPGWLEPMLDRIGQDWSHVVTPIIDVIDDKTLMYNFNP----LSRGFSVGGFDWAMG 249

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WHA+P  E++R K  ++P  +PTMAGGLF+ID+ +F  +G+YD G +IWGGENLE+SF
Sbjct: 250 FTWHALPNHEKERRKKISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSF 309

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          +P        RKR+ N               S     F +  +  +  +
Sbjct: 310 RIWMCGGTLETLPCSHVGHIFRKRNPN--------------HSAKHGNFVQRNSVRTA-E 354

Query: 174 IWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYLE-------VSND----- 215
           +W  E   L +       GDFGDV+ R+ LR  L CKSFKWYL+       V +D     
Sbjct: 355 VWMDEYKYLYYDRIGNHIGDFGDVSDRRALREELKCKSFKWYLDTIYPTLFVPSDAEASG 414

Query: 216 ----------WSGMCIDSA---CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEAC 262
                      S +C+DSA    + +   K V  +PCH QGGNQ WM+S++GEIR+D+ C
Sbjct: 415 EVRCKAHFPKVSQVCLDSADIDPETSANGKEVQTWPCHGQGGNQMWMLSQNGEIRKDKGC 474

Query: 263 LDYAGGDVILYPCHGSKGNQYFEYD 287
           LDY  G + +YPCH SKG Q ++Y+
Sbjct: 475 LDYNDGKLRIYPCHSSKGPQDWKYN 499


>gi|358332242|dbj|GAA50924.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 403

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/313 (40%), Positives = 171/313 (54%), Gaps = 47/313 (15%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  K WL+PLLD +  + ++VV P+I  I D T  L++   R  S     +GGFDW+L F
Sbjct: 61  ECTKGWLEPLLDRIRESETNVVVPIIEVISDKT--LQYNNARAESVQ---VGGFDWSLIF 115

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
           +WH+ P+R+++R      P+ TPTMAGGLF+I +AFF++LG YD G ++WGGENLELSFK
Sbjct: 116 HWHSPPKRDKERPGAPYSPLRTPTMAGGLFAISRAFFKRLGYYDEGMEVWGGENLELSFK 175

Query: 125 FNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W    + E           R R     E  +T  +      + +A    LG +   + 
Sbjct: 176 V-WMCGGQLETIICSHIGHIFRSRSPYKWESKFTSPLRRNTARLAEAV---LGPFAKFYH 231

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDW 216
              G     S K DFGDV+ RK +   L C SF WYL                 ++ ++ 
Sbjct: 232 SQSG-----SRKIDFGDVSERKAILERLKCHSFDWYLKNVYPEFFVPTDSVAHGDIESEA 286

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVILY 273
              CIDS  K  D    VG++PCH++GGNQ+W+MSK GEIRRD  C D AG   G V L+
Sbjct: 287 GPHCIDSPLK-GDGKVIVGMWPCHREGGNQYWLMSKLGEIRRDNKCWD-AGIEVGRVALF 344

Query: 274 PCHGSKGNQYFEY 286
            CHG +GNQ+F Y
Sbjct: 345 DCHGVRGNQHFVY 357


>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           1 [Ciona intestinalis]
          Length = 573

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 117/312 (37%), Positives = 169/312 (54%), Gaps = 42/312 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A++ + VV P+I  I D+TFE                GGF+W L 
Sbjct: 225 CECTEGWLEPLLSEIAKDRTTVVCPIIDVISDETFEFMV-------GSDMTYGGFNWKLN 277

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV +PTMAGGLFSIDK++FE+LGTYD+G DIWGGENLE+S
Sbjct: 278 FRWYPVPQREMDRRKGDRTLPVRSPTMAGGLFSIDKSYFEELGTYDAGMDIWGGENLEIS 337

Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
           F+  W       I       H    A P   P   G + + +     +  + ++ + F I
Sbjct: 338 FRI-WQCGGTLLIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDSFKNFFYI 396

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
                L    K ++GD++ R  LR  L CKSFKWYL                 E+ N+  
Sbjct: 397 ITPGVL----KQEYGDISERVRLREKLQCKSFKWYLENIYPDSQIPGEYYSLGEIRNEEG 452

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
           G+C+D+  +  +    VG++ CH+ GGNQ W  + + E+R D+ CLD +  GG +++  C
Sbjct: 453 GLCLDTMGRKEN--DKVGIFNCHEMGGNQVWAYTGNQELRCDDICLDASKVGGPIMMVKC 510

Query: 276 HGSKGNQYFEYD 287
           H  +GNQ +EYD
Sbjct: 511 HHMRGNQLWEYD 522


>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 697

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 122/311 (39%), Positives = 159/311 (51%), Gaps = 50/311 (16%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +A NSS VV P+I+ I D T ++ F       +    +GGFDW+L F WH  
Sbjct: 351 WLEPLLDRIAYNSSIVVVPVISTINDKTLKMNF-----LKADNVQVGGFDWSLTFRWHEQ 405

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
            ER+R R      PV +PTMAGGLF+I + +F  LG YDSG +IWGGENLELSFK  W  
Sbjct: 406 TERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKV-WMC 464

Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
                              ++ G +F     +   +   D          D+W  +    
Sbjct: 465 ----------GGILETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRF 514

Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
               + FK  DFGDV+ RK LR  L C+SF WYL          S   +   I+SA  P 
Sbjct: 515 YYARIGFKTIDFGDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPH 574

Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPC 275
            +  P           + ++PCHKQGGNQFW++S + EIRRDE C D    +  + LY C
Sbjct: 575 CLDSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDSGMKNHTIGLYRC 634

Query: 276 HGSKGNQYFEY 286
           HG+KGNQ F Y
Sbjct: 635 HGAKGNQKFTY 645


>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
          Length = 654

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 122/311 (39%), Positives = 159/311 (51%), Gaps = 50/311 (16%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +A NSS VV P+I+ I D T ++ F       +    +GGFDW+L F WH  
Sbjct: 351 WLEPLLDRIAYNSSIVVVPVISTINDKTLKMNF-----LKADNVQVGGFDWSLTFRWHEQ 405

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
            ER+R R      PV +PTMAGGLF+I + +F  LG YDSG +IWGGENLELSFK  W  
Sbjct: 406 TERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKV-WMC 464

Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
                              ++ G +F     +   +   D          D+W  +    
Sbjct: 465 ----------GGILETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRF 514

Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
               + FK  DFGDV+ RK LR  L C+SF WYL          S   +   I+SA  P 
Sbjct: 515 YYARIGFKTIDFGDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPH 574

Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPC 275
            +  P           + ++PCHKQGGNQFW++S + EIRRDE C D    +  + LY C
Sbjct: 575 CLDSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDSGMKNHTIGLYRC 634

Query: 276 HGSKGNQYFEY 286
           HG+KGNQ F Y
Sbjct: 635 HGAKGNQKFTY 645


>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Loxodonta africana]
          Length = 560

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 164/315 (52%), Gaps = 44/315 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKDDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD---IWGGEN 179
           F+   +++ E E K        V +   A  +  ++      +       D   +W G N
Sbjct: 324 FRT--YSLMELESK--NTVPYSVMSCHEAHAVVYVNSRALTHVINKKQQEDWQEVWDGMN 379

Query: 180 LELSF--------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN 214
           L+  F        K D+GDV+ RK LR NL CK F WYL                 E+ N
Sbjct: 380 LKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRN 439

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVIL 272
             +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+
Sbjct: 440 VETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIM 497

Query: 273 YPCHGSKGNQYFEYD 287
             CH  +GNQ +EYD
Sbjct: 498 LKCHHMRGNQLWEYD 512


>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
 gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
          Length = 488

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 167/316 (52%), Gaps = 48/316 (15%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  + W +PLL  +A + + VV P+I  I DDTFE         +      GGF+W L F
Sbjct: 139 ECTEGWAEPLLTRIAEDRTTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNF 191

Query: 65  NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W+ +P+RE  +R  +   P+ TPTMAGGLF+IDK++FE++GTYDSG DIWGGENLE+SF
Sbjct: 192 RWYPVPQREMDRRGGDRTMPLRTPTMAGGLFAIDKSYFEEIGTYDSGMDIWGGENLEISF 251

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
           +  W      E     H        TP T  GG   I      +L       ++W  +N 
Sbjct: 252 RI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVW-MDNF 303

Query: 181 ELSF--------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
           +  F        K D+GDVT RKELR  L CK FKWYL                 E+ N 
Sbjct: 304 KDFFYIISPGVTKVDYGDVTGRKELRDKLNCKPFKWYLENIYPDSQIPTSYHSLGEIRNV 363

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            S  CID+  +  +  + VG++ CH  GGNQ +  +K  E+R D+ CLD +  GG V+L+
Sbjct: 364 DSNQCIDNMARKEN--EKVGIFSCHGMGGNQVFSYTKEKELRTDDLCLDVSKPGGPVMLF 421

Query: 274 PCHGSKGNQYFEYDYK 289
            CH   GNQ +EYD K
Sbjct: 422 KCHHLGGNQLWEYDEK 437


>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 163/317 (51%), Gaps = 50/317 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+P+L  +A + +  V P+I  I DDTF+ +             +GGF W+L 
Sbjct: 152 CEVTEGWLEPMLARIAEDRTTSVCPVIDVISDDTFQYQH-------GNDPQMGGFGWSLF 204

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +P+RE+ R K +  EPV   TMAGGLF+IDK++FE+LG YD GF+IWGGENLELS
Sbjct: 205 FKWFPVPKREQIRRKGDPTEPVRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENLELS 264

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           FK          IP            P   P     +   +K   E          +W  
Sbjct: 265 FKLWMCGGKLEFIPCSHVGHVFRKKSPYHFPPGTNYVNKNNKRLAE----------VWLD 314

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
           E     +       K D GD++ R  LR++L CKSFKWYL                 EV 
Sbjct: 315 EYKNFYYRISPSVAKTDPGDISDRLNLRKSLSCKSFKWYLENIYPESSWPVNYQFMGEVR 374

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA-GGDVIL 272
           N  + +C+D+  K  +    VGLY CH QGGNQ W  +K+ E+R D+ CLD A GG V++
Sbjct: 375 NTEAHVCLDTMMK--EAGNKVGLYGCHGQGGNQIWAFTKNNELRHDDLCLDVARGGPVMM 432

Query: 273 YPCHGSKGNQYFEYDYK 289
             CH   GNQ++ YD K
Sbjct: 433 LSCHMQGGNQHWNYDEK 449


>gi|405966388|gb|EKC31681.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 815

 Score =  189 bits (481), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 89/158 (56%), Positives = 112/158 (70%), Gaps = 18/158 (11%)

Query: 147 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSF 206
           +PTMA GLFSI + +F +LGTYD G DIWGGENLELSF+GD+G VT RK+LR  L C SF
Sbjct: 607 SPTMARGLFSISREYFTELGTYDPGIDIWGGENLELSFRGDYGHVTDRKKLRERLQCHSF 666

Query: 207 KWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWM 249
            W++                 E+ +    MCIDSA    + HKPV ++PCH QGGNQ+WM
Sbjct: 667 DWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWM 726

Query: 250 MSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           +SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 727 LSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 764


>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
          Length = 588

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 165/313 (52%), Gaps = 42/313 (13%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL+PLL  + ++ ++V+ P+I  I DDTFE       LT S     GGF+W L F
Sbjct: 241 EASPGWLEPLLYEIKKDRTNVICPIIDVISDDTFEF------LTGS-DLTYGGFNWKLNF 293

Query: 65  NWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W+ +P+RE  +R  + + P+ TPTMAGGLFSIDK++F ++G+YDSG DIWGGENLE+SF
Sbjct: 294 RWYPVPQREVDRRGGDRSLPMQTPTMAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSF 353

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT-----YDSGFDIW 175
           +  W            H        TP T  GG   I      +L       Y   F I 
Sbjct: 354 RI-WMCGGTVLIATCSHVGHVFRKATPYTFPGGTSQIINKNNRRLAEVWMDDYKKFFYIV 412

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSG 218
                    K  +GDV+ RK LR +L CKSF+WYL                 E+ N  + 
Sbjct: 413 N----PTVMKHKYGDVSDRKTLRNDLQCKSFQWYLDNVYPDAQIPRRYKVLGEIKNTGAN 468

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCH 276
           +C+D+  +  +  K VG Y CH QGGNQ +  +   EIR D+ CLD A   G V++  CH
Sbjct: 469 ICLDTMGRKEN--KKVGCYSCHGQGGNQVFSFTMDNEIRIDDLCLDVANSKGPVMMVKCH 526

Query: 277 GSKGNQYFEYDYK 289
             KGNQY+EY+ K
Sbjct: 527 HQKGNQYWEYNIK 539


>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 569

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 112/304 (36%), Positives = 166/304 (54%), Gaps = 29/304 (9%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  + W +PL+D +ARN S V++P+I  I  +TF+  F     T+     +GGFDW+L F
Sbjct: 220 ECAEGWFEPLIDPIARNWSTVMTPVIDVIDKETFQYGFQAASATN-----VGGFDWSLMF 274

Query: 65  NWHAIPERERKRHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            WH +PE E+KR +N    PV +PTMAGGLF+I + +FE +GTYD G DIWGGENLELSF
Sbjct: 275 TWHFVPETEQKRRQNKHYLPVRSPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSF 334

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGENL 180
           +  W            H         P   G   ++ K    ++   +   F  +  +++
Sbjct: 335 RI-WMCGGTLLTAPCSHVGHVFRHTPPYSFGPKKNVVKNNLVRMAEVWLDDFKYYYYQHI 393

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDS 223
             +  G++GDV++R+ LR NL C SF WYL                 E+ +    +C++S
Sbjct: 394 NYTL-GNYGDVSARRALRANLQCHSFDWYLVNVYPELLIPAEALYSGEIRSKAEPLCLES 452

Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD-EACLDYAGGDVILYPCHGSKGNQ 282
             +   ++KP+ ++ CH Q GNQ+W+ ++ GEIR D   C+D AG  V +  CHG  GNQ
Sbjct: 453 PYRFGKINKPLTVFHCHGQKGNQYWLYTQKGEIRHDLYGCMDDAGSTVYVNSCHGLGGNQ 512

Query: 283 YFEY 286
            + Y
Sbjct: 513 KWTY 516


>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oreochromis niloticus]
          Length = 557

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 161/314 (51%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 210 CECTTGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+TSR  LR+ L CK F WYL                 E+ N 
Sbjct: 376 KNFFYIISPGVTKVDYGDITSRTALRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNV 435

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V++ 
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507


>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
          Length = 483

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 46/317 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 138 CECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMS 250

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  + 
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 303

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL C  F WYL                 E+ N 
Sbjct: 304 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 363

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421

Query: 274 PCHGSKGNQYFEYDYKY 290
            CH  +GNQ +EYD ++
Sbjct: 422 KCHHMRGNQLWEYDAEH 438


>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
          Length = 556

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFKCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
           (Silurana) tropicalis]
 gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus (Silurana) tropicalis]
          Length = 498

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 120/314 (38%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 138 CECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMS 250

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  + 
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 303

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL C  F WYL                 E+ N 
Sbjct: 304 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 363

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 422 KCHHMRGNQLWEYD 435


>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Anolis carolinensis]
          Length = 557

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDVT RK LR NL CK F WYL                 E+ N 
Sbjct: 378 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509


>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Danio rerio]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PL+  +  +   VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 211 CECTTGWLEPLMARIKEDRRAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYDSG DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP +  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       + D+GDV+SRK LR +L CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVRVDYGDVSSRKALRESLKCKPFSWYLENVYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++ 
Sbjct: 437 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDASRLNGPVVML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ FEYD
Sbjct: 495 KCHHMKGNQMFEYD 508


>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Anolis carolinensis]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDVT RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
          Length = 519

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 174 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 226

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 227 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 286

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 287 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 339

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 340 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 399

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 400 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 457

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 458 KCHHMRGNQLWEYD 471


>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
           norvegicus]
 gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
 gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
          Length = 516

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 166 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 218

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 219 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 278

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 279 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 331

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 332 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 391

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 392 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 449

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 450 KCHHMRGNQLWEYD 463


>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Saimiri boliviensis boliviensis]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLQCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Nomascus leucogenys]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Ornithorhynchus anatinus]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 161/314 (51%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTFGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR+NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRQNLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
 gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
           taurus]
 gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Callithrix jacchus]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
 gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
          Length = 556

 Score =  187 bits (476), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Oryctolagus cuniculus]
          Length = 557

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509


>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
          Length = 516

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 171 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 223

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 224 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 283

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 284 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 336

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 337 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 396

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 397 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 454

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 455 KCHHMRGNQLWEYD 468


>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
          Length = 556

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           3 [Canis lupus familiaris]
          Length = 556

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pan troglodytes]
 gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Pongo abelii]
 gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pongo abelii]
 gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
           paniscus]
 gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
          Length = 556

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Ailuropoda melanoleuca]
          Length = 556

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
           aries]
          Length = 556

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Anolis carolinensis]
          Length = 498

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/314 (38%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 138 CECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 190

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 191 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 250

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 251 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 303

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDVT RK LR NL CK F WYL                 E+ N 
Sbjct: 304 KDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPDSQIPRRYFSLGEIRNV 363

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 364 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 421

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 422 KCHHMRGNQLWEYD 435


>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
 gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
          Length = 561

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Nomascus leucogenys]
          Length = 557

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509


>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Callithrix jacchus]
          Length = 561

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Cricetulus griseus]
          Length = 497

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 152 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 264

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 265 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 317

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 318 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 377

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 378 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 435

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 436 KCHHMRGNQLWEYD 449


>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Ailuropoda melanoleuca]
          Length = 557

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 378 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 438 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 496 KCHHMRGNQLWEYD 509


>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
           partial [Papio anubis]
          Length = 444

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 145 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 197

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 198 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 257

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 258 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 310

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 311 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 370

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 371 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 428

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 429 KCHHMRGNQLWEYD 442


>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Macaca mulatta]
          Length = 375

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 30  CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 82

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 83  FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 142

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 143 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 195

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 196 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 255

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 256 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 313

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 314 KCHHMRGNQLWEYD 327


>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
           catus]
          Length = 546

 Score =  187 bits (474), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 201 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 253

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 254 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 313

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 314 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 366

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 367 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 426

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 427 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLSGPVIML 484

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 485 KCHHMRGNQLWEYD 498


>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
           adamanteus]
          Length = 556

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTTGWLEPLLARIKEDRKIVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLEYVYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ C+D +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCMDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus laevis]
 gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
          Length = 556

 Score =  187 bits (474), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 46/317 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTFGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE+LGTYDSG DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKKYFEELGTYDSGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  + 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDDF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL C  F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPDSQIPRRYFSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYDYKY 290
            CH  +GNQ +EYD ++
Sbjct: 495 KCHHMRGNQLWEYDAEH 511


>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Oryzias latipes]
          Length = 558

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  + + VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 213 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 265

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 325

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP +  GG   +      +L       ++W  E 
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 378

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       + D+GDV+SRK LR  L CK F WYL                 E+ N 
Sbjct: 379 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 438

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++ 
Sbjct: 439 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 496

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ FEYD
Sbjct: 497 KCHHMKGNQMFEYD 510


>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Oryzias latipes]
          Length = 557

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  + + VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 212 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 324

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP +  GG   +      +L       ++W  E 
Sbjct: 325 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       + D+GDV+SRK LR  L CK F WYL                 E+ N 
Sbjct: 378 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++ 
Sbjct: 438 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ FEYD
Sbjct: 496 KCHHMKGNQMFEYD 509


>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Oryzias latipes]
          Length = 556

 Score =  186 bits (473), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  + + VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 211 CECTEGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP +  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       + D+GDV+SRK LR  L CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++ 
Sbjct: 437 ETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ FEYD
Sbjct: 495 KCHHMKGNQMFEYD 508


>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Otolemur garnettii]
          Length = 556

 Score =  186 bits (472), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ ++YD
Sbjct: 495 KCHHMRGNQLWDYD 508


>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oryzias latipes]
          Length = 557

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 161/314 (51%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 210 CECTLGWLEPLLTRIKQDKRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+++R  LR+ L CK F WYL                 E+ N 
Sbjct: 376 KNFFYIISPGVTKVDYGDISTRTSLRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNV 435

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V++ 
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507


>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Danio rerio]
          Length = 557

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 210 CECTTGWLEPLLSRIKLDKKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+++R  LR+ L CK F WYL                 E+ N 
Sbjct: 376 KNFFYIISPGVTKVDYGDISTRTSLRQRLQCKPFSWYLENVYPDSQIPRHYYSLGEIRNV 435

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V++ 
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 494 KCHHLKGNQLWEYD 507


>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Otolemur garnettii]
          Length = 558

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 213 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 265

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 378

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 379 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 438

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 439 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIML 496

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ ++YD
Sbjct: 497 KCHHMRGNQLWDYD 510


>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
           Acetylgalactosaminyltransferase-T1
          Length = 472

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 125 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 177

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 178 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 237

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 238 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 290

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 291 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 350

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 351 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 408

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 409 KCHHLKGNQLWEYD 422


>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
 gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 510

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 125/330 (37%), Positives = 165/330 (50%), Gaps = 61/330 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + ++VV P+I  I D TFE       +  S    +GGFDW L 
Sbjct: 144 CECTRGWLEPLLARIAEDKTNVVCPVINIISDTTFEF------INGSDATQVGGFDWRLI 197

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           FNWH +P RE +R K +   PV +PTMAGGLFSI K FF +LGTYD GFD+WG ENLELS
Sbjct: 198 FNWHVVPHRELQRIKFDRTSPVRSPTMAGGLFSIHKEFFTRLGTYDPGFDVWGAENLELS 257

Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFE--KLGTYD 169
           FK  W      E           RKR  +   P     M      + + + +  K   Y+
Sbjct: 258 FK-TWMCGGTLEFVPCSHVGHVFRKRSPHRFPPTTHNVMQRNNRRLAEVWLDEYKYLYYN 316

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------V 212
           +  +I          K D GD++ R  LR  L CKSFKWYLE                 +
Sbjct: 317 AHPEI---------LKTDPGDISERLALRERLQCKSFKWYLENVYPENVFPIHFYGVVTI 367

Query: 213 SNDWSGMCID-----------SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
            +  SG C+D           +    TD  + V L+ CH  G  Q ++ +K  EIR ++ 
Sbjct: 368 KHIISGNCLDYGNLKMRGKQPTKAGKTDSGQKVELWKCHG-GPVQTFIYTKAKEIRLEKE 426

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEYDYK 289
           CLDY+   G + LYPCHG  GNQ + Y+ K
Sbjct: 427 CLDYSAITGSLTLYPCHGQGGNQVWGYNKK 456


>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
           musculus]
 gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
           musculus]
          Length = 400

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 53  CECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 105

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 106 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 165

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 166 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 218

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 219 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 278

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 279 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 336

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 337 KCHHLKGNQLWEYD 350


>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Taeniopygia guttata]
          Length = 559

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Monodelphis domestica]
          Length = 556

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DD FE        T+      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKESRKTVVCPIIDLISDDNFEY-------TAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++G YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 437 ETNQCLDNMGRKDN--EKVGMFNCHGMGGNQVFSYTAEKEIRTDDFCLDVSRLSGPVIML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
 gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Meleagris gallopavo]
 gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
          Length = 559

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
          Length = 553

 Score =  185 bits (469), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 162/314 (51%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE       +  S K + GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY------MAGSDKTY-GGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LRR L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRRKLQCRPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 559

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKQDRRTVVCPIICVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 559

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GDV SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDVASRIGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 592

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 112/307 (36%), Positives = 153/307 (49%), Gaps = 44/307 (14%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL+ +  ++S+VV P+I  I D    ++      T      +GGFDW+L F WH  
Sbjct: 253 WLEPLLERIKASTSNVVVPVIEIINDQDLSMK-----ATQEASVQVGGFDWSLTFTWHLP 307

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
           P+R++ R      P+ +PTMAGGLF+I + FF  LG YD   ++WGGENLELSFK  W  
Sbjct: 308 PKRDQIRLGAPYSPIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLELSFK-TWMC 366

Query: 130 IPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
             + E           R R   + E   T  +   L  + + + +     D  F  +   
Sbjct: 367 GGQLETVVCSHVGHIFRSRSPYSWESKRTSPIKFNLVRLAETWLD-----DYKFLYYDSL 421

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCI 221
           N +L   GD+GD++SRK +R    CKSF+WYL                 ++ N  S  CI
Sbjct: 422 NFDL---GDYGDISSRKAIRERNNCKSFQWYLDTIYPELFLPTRALASGDIENMVSPHCI 478

Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD--YAGGDVILYPCHGSK 279
           D           V LYPCH+Q GNQ W  +   EIRR +AC D     G V L+ CHG  
Sbjct: 479 DGVFNDQKTDNLVKLYPCHRQKGNQLWFYTNKNEIRRHDACFDGNVKPGHVGLFSCHGLG 538

Query: 280 GNQYFEY 286
           G Q+FEY
Sbjct: 539 GTQFFEY 545


>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Takifugu rubripes]
          Length = 556

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 115/312 (36%), Positives = 158/312 (50%), Gaps = 42/312 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 211 CECTVGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
           F+  W      E     +       A P   P   G + + +     +  +  +   F I
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYI 382

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
                + +    D+GDV+SRK LR  L CK F WYL                 E+ N  +
Sbjct: 383 ISPGVMRV----DYGDVSSRKGLRDALRCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 438

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
             C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++  C
Sbjct: 439 NQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVLMLKC 496

Query: 276 HGSKGNQYFEYD 287
           H  KGNQ FEYD
Sbjct: 497 HHMKGNQMFEYD 508


>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Callithrix jacchus]
 gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
           boliviensis boliviensis]
          Length = 559

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pongo abelii]
          Length = 612

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Oreochromis niloticus]
          Length = 556

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 114/312 (36%), Positives = 158/312 (50%), Gaps = 42/312 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 211 CECTVGWLEPLLARIKEDRTAVVCPIIDVISDETFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSIDK +FE++G+YD G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDI 174
           F+  W      E     +       A P   P   G + + +     +  +  +   F I
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYI 382

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
                + +    ++GDV+SRK LR  L CK F WYL                 E+ N  +
Sbjct: 383 ISPGVMRV----EYGDVSSRKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 438

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
             C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD +   G V++  C
Sbjct: 439 NQCMDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVVMLKC 496

Query: 276 HGSKGNQYFEYD 287
           H  KGNQ FEYD
Sbjct: 497 HHMKGNQMFEYD 508


>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
 gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
           anubis]
          Length = 559

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Canis lupus familiaris]
 gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Ailuropoda melanoleuca]
 gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
          Length = 559

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
 gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Nomascus leucogenys]
 gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pan troglodytes]
 gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
           paniscus]
 gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
           gorilla gorilla]
 gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
           sapiens]
 gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [synthetic
           construct]
 gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
           fascicularis]
 gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
          Length = 559

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Meleagris gallopavo]
          Length = 556

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTRGWLEPLLARIREDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++G+YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRV-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV++RK LR  L CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G V + 
Sbjct: 437 DTNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVTML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
          Length = 591

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 244 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 296

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 297 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 356

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 357 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 409

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 410 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 469

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 470 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 527

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 528 KCHHLKGNQLWEYD 541


>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLD 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Cavia porcellus]
          Length = 559

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRIGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Anolis carolinensis]
          Length = 559

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKADRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
          Length = 559

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 157/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+ SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDIASRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 559

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+  +
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRKE 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
           caballus]
          Length = 453

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 106 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 158

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 159 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 218

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 219 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 271

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 272 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 331

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 332 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 389

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 390 KCHHLKGNQLWEYD 403


>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Loxodonta africana]
          Length = 559

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
           mutus]
          Length = 564

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 217 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 269

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 270 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 329

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 330 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 382

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 383 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 442

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 443 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 500

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 501 KCHHLKGNQLWEYD 514


>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
           aries]
          Length = 559

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|405956426|gb|EKC23041.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 203

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 87/152 (57%), Positives = 111/152 (73%), Gaps = 15/152 (9%)

Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY 209
           MAGGLFSI + +F +LGTYD G DIWGGENLELSF+GD+GDVT+RK+LR  L C SF W+
Sbjct: 1   MAGGLFSISREYFTELGTYDPGMDIWGGENLELSFRGDYGDVTNRKKLRERLQCYSFDWF 60

Query: 210 --------------LEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
                         +++ +    MCIDSA    + HK V ++PCH QGGNQ+WM+SK+GE
Sbjct: 61  VKNVYPDPFVPVEAIDLESKAKPMCIDSAVDNHNYHKLVNMWPCHNQGGNQYWMLSKNGE 120

Query: 256 IRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           IRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 121 IRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 152


>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
           catus]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Sus scrofa]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
           taurus]
 gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
           taurus]
 gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
           [Bos taurus]
          Length = 519

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 172 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 224

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 225 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 284

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 285 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 337

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 338 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 397

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 398 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 455

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 456 KCHHLKGNQLWEYD 469


>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Equus caballus]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L C+ F WYL                 E+ N 
Sbjct: 378 KTFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYSSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Monodelphis domestica]
          Length = 559

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKVDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+++R  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
          Length = 499

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 157/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 152 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID  +F+++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENLEIS 264

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 265 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 317

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 318 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 377

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 378 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 435

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 436 KCHHLKGNQLWEYD 449


>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Sarcophilus harrisii]
          Length = 559

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKVDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+++R  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Ornithorhynchus anatinus]
          Length = 559

 Score =  181 bits (460), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKFDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Ornithorhynchus anatinus]
          Length = 558

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 158/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTVGWLEPLLARIKFDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 323

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 324 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 377 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 437 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 495 KCHHLKGNQLWEYD 508


>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
           gallus]
          Length = 556

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 160/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTRGWLEPLLARIWEDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++G+YD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRV-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV++RK LR  L CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G V + 
Sbjct: 437 DTNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVTML 494

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 495 KCHHMRGNQLWEYD 508


>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
           laevis]
 gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
          Length = 559

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 115/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARINHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R + +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRRGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+ +R  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDIATRVGLRHKLQCKPFSWYLENVYPDSQIPRHYYSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+ 
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTASKEIRTDDLCLDVSKLNGPVIML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  +GNQ +EYD
Sbjct: 496 KCHHLRGNQLWEYD 509


>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 565

 Score =  180 bits (456), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 114/316 (36%), Positives = 164/316 (51%), Gaps = 48/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PL+  +A + S VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 218 CECTQGWLEPLMARIAEDRSRVVCPIIDVISDETFEFH-------AGSDMTYGGFNWKLN 270

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+++P+RE  R K +   P+ TPTMAGGLF+I K +FE++GTYD+G DIWGGENLE+S
Sbjct: 271 FRWYSVPKREMDRRKGDRTIPLNTPTMAGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMS 330

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP +  GG  +I      +L       ++W  + 
Sbjct: 331 FRI-WMCGGTLEIVTCSHVGHVFRKTTPYSFPGGTGAIINKNNRRLA------EVWMDDY 383

Query: 180 LEL-------SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
                     S K ++GDVT+RK+LR  L CKSFKWYL                 E+ N 
Sbjct: 384 KTFFYKISPGSKKSEYGDVTNRKQLRDKLQCKSFKWYLENIYPESQFMMDYNMIGEIRNM 443

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG----GDVI 271
            +  C+D+  +  +    VG+Y CH QGGNQ +  +K  E++ D+ CLD +      D++
Sbjct: 444 ETKQCLDNMGRKEN--NKVGIYACHGQGGNQIFAWTKKKELKHDDLCLDASRQSGFNDIM 501

Query: 272 LYPCHGSKGNQYFEYD 287
              CH   GNQ + ++
Sbjct: 502 QLRCHNQGGNQEWSFN 517


>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
 gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
          Length = 481

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 111/310 (35%), Positives = 153/310 (49%), Gaps = 39/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL  +  N S+VV P I  I  + F   +  G          G F+WNL 
Sbjct: 173 CEVVDGWLEPLLARIHENRSNVVCPEIDVISFENFGYSYASG--------IRGVFNWNLH 224

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E++R K+  +P+ +PTMAGGLF+I K +FE +G YD   DIWGGENLE+SF
Sbjct: 225 FRWRTLPAVEQQRRKSVIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSF 284

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      N   IP          ++P   P  AG   + +     ++   D+  DI+   
Sbjct: 285 RIWQCGGNLEIIPCSHVGHVFRKSQPYTFPKGAGETLNKNLQRVAEVWM-DNYKDIFYNR 343

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMC 220
              L  +  +GD++ R ELR+ L CKSF WYL                  E+ N  +G C
Sbjct: 344 FPNLR-QHSYGDISKRIELRKKLKCKSFDWYLKNVFTDVQYPDMIFLAKGELRNPSTGYC 402

Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAGGDVILYPCH 276
           +DS       +  +G+YPCH QGGNQ    +   E+  DE CLD       G V + PCH
Sbjct: 403 LDSMGNKE--YADIGIYPCHGQGGNQLLTYTIRKELEMDEVCLDALSRRVAGTVKMAPCH 460

Query: 277 GSKGNQYFEY 286
             KG Q +E+
Sbjct: 461 RKKGTQLWEH 470


>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 534

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 106/305 (34%), Positives = 150/305 (49%), Gaps = 77/305 (25%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 236 CECTTGWLEPLLARIKKDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 288

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 289 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 348

Query: 123 FKFN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           F+   W  +                   +  G+  +                        
Sbjct: 349 FRLQMWFVV------------------CVCVGVTKV------------------------ 366

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSA 224
                D+GD++SR  LR+ L CK F WYL                 E+ N  +  C+D+ 
Sbjct: 367 -----DYGDISSRTTLRQKLQCKPFSWYLENIYPDSQIPRHYYSLGEIRNVETNQCLDNM 421

Query: 225 CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQ 282
            +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V++  CH  KGNQ
Sbjct: 422 ARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQ 479

Query: 283 YFEYD 287
            +EYD
Sbjct: 480 LWEYD 484


>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
          Length = 559

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +   +D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V + 
Sbjct: 438 ETNQFLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTML 495

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ +EYD
Sbjct: 496 KCHHLKGNQLWEYD 509


>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
          Length = 624

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 114/317 (35%), Positives = 164/317 (51%), Gaps = 52/317 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + ++ + VV P+I  I DD+FE       +T S     GGF+W L 
Sbjct: 271 CECTEGWLEPLLYEIHKDRTAVVCPIIDVIGDDSFEY------ITGS-DMTWGGFNWKLN 323

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R  + + P  TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 324 FRWYPVPQRELDRRGGDRSNPTKTPTMAGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMS 383

Query: 123 FKF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           F+        +     R     +  +   W     GG+  I     +++       ++W 
Sbjct: 384 FRVWMCGGKVYIVTCSRVGHVFRKTSPYSW----PGGVARIINHNTQRI------VEVWM 433

Query: 177 GENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL-----------------EV 212
            E  +  +K         +GDV+ RK LR  L CKSFKWYL                 E+
Sbjct: 434 DEYKDFFYKINPGVRSTSYGDVSERKALREKLHCKSFKWYLQNVYPESQMPVEYHALGEI 493

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDV 270
            N  +G CIDS  + +   + VG+  CH  GGNQ +  +K   ++ D+ CLD +   G V
Sbjct: 494 RNKATGQCIDSMGRKSG--EKVGMVQCHGMGGNQIFSYTKKQALQTDDVCLDVSSLHGPV 551

Query: 271 ILYPCHGSKGNQYFEYD 287
            L+ CHG  GNQ +EYD
Sbjct: 552 KLFQCHGLGGNQKWEYD 568


>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Hydra magnipapillata]
          Length = 548

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 113/320 (35%), Positives = 168/320 (52%), Gaps = 51/320 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLL  +  +  +VV P+I  I  D  +L +    L    +  +GGF W+L 
Sbjct: 239 CEATEGWVEPLLFRIKEDKRNVVCPVIEVI--DAVDLSYKKTELDRITQ--VGGFTWDLF 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNW  I E E++   +  +P+ +PTMAGGLF+IDK++F ++G+YD+  +IWGGENLE+SF
Sbjct: 295 FNWKEITEDEKRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSF 354

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           +          IP  R     +    P   P    G+       F +L       ++W  
Sbjct: 355 RIWMCGGKLEIIPCSRVGHIFRKENSPYSFPN---GVSKTLAKNFNRLA------EVWMD 405

Query: 178 ENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL------------------E 211
           E  EL ++          +GD++ R ELR+ LGCKSFKWY+                  E
Sbjct: 406 EYKELYYRRKPPEDKLVKYGDISERVELRKKLGCKSFKWYIDNVIPDMIGADPNPPAHGE 465

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRRDEACLDYA---- 266
           V N  S MC+DS     +  + + ++PCH+ GGNQF+++SK GE I  DE+CLDY+    
Sbjct: 466 VRNVASNMCLDSMGNKGNRAQ-IKVFPCHRLGGNQFFVLSKRGEIIHNDESCLDYSLENE 524

Query: 267 GGDVILYPCHGSKGNQYFEY 286
              V ++ CHG  GNQ + Y
Sbjct: 525 ENKVDMWNCHGLGGNQEWIY 544


>gi|449685123|ref|XP_002167708.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
           partial [Hydra magnipapillata]
          Length = 411

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 113/319 (35%), Positives = 167/319 (52%), Gaps = 48/319 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL  +    S+VV P I +I  DT E R       S      GGF W+L 
Sbjct: 51  CETTPGWIEPLLARINEAKSNVVVPTIESIDADTLEYR------ASDNPEQRGGFSWDLM 104

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           ++W++IPE E+   ++ ++P+ TPTMAGGLF+IDK++F ++G+YD   DIWGGENLELSF
Sbjct: 105 YDWNSIPENEKHLRQSPSDPIRTPTMAGGLFAIDKSYFFEMGSYDQEMDIWGGENLELSF 164

Query: 124 KFNWHA---IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGEN 179
           +  W     I      R  +    V +P      ++  K   E L    +   ++W  E 
Sbjct: 165 RI-WMCGGRIEILPCSRVGHIFRKVTSP------YTFPKGVTETLSKNLNRLAEVWMDEY 217

Query: 180 LELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------EVSN 214
            E  ++        ++G++T R ELR+ L CKSFKWY+                  E+ N
Sbjct: 218 KEYYYRSRPLFRGKEYGNITQRLELRQKLQCKSFKWYMENIYSDMEIPDLYPPAEGEIRN 277

Query: 215 DWSGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYA----GG 268
             S +CIDS      ++   VGLYPCH +GG Q + +S  GEI  +D+ CLD A    G 
Sbjct: 278 GASNLCIDSMGVVKENVKHQVGLYPCHGEGGAQHFQLSLKGEIIFQDKFCLDVAVASPGA 337

Query: 269 DVILYPCHGSKGNQYFEYD 287
            +  + CH  +GNQ ++++
Sbjct: 338 FIEFFKCHKQRGNQLWQHN 356


>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 615

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 120/318 (37%), Positives = 159/318 (50%), Gaps = 50/318 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N   VV P+I  I DDTFE         ++     GGF+W L 
Sbjct: 266 CECADGWLEPLLARIVLNRKTVVCPVIDVISDDTFEY-------VTASDMTWGGFNWKLN 318

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R+++   P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        TP T  GG   I      +L   +   D W    
Sbjct: 379 FRI-WMCGGTLEISPCSHVGHVFRKSTPYTFPGGTSHIVNHNNARLA--EVWMDEWKHFY 435

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
                G  N+E+      GDV+ R  LR  L CKSF+WYL                 E+ 
Sbjct: 436 YAINPGASNVEV------GDVSERLALREKLKCKSFRWYLENIYPESQMPLDYYYLGEIK 489

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  S  C+D+  + +   + VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V 
Sbjct: 490 NVDSQQCLDTMSRKSG--EKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIVGPVS 547

Query: 272 LYPCHGSKGNQYFEYDYK 289
           L  CHG +GNQ + YD K
Sbjct: 548 LIRCHGLEGNQAWVYDSK 565


>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 615

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 117/320 (36%), Positives = 156/320 (48%), Gaps = 54/320 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N   VV P+I  I DDTFE         ++     GGF+W L 
Sbjct: 266 CECADGWLEPLLARIVLNRKTVVCPVIDVISDDTFEY-------VTASDMTWGGFNWKLN 318

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R+++   P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW-- 175
           F+          IP            P   P   GG+  I           +   D W  
Sbjct: 379 FRVWQCGGTLEIIPCSHVGHVFRDKSPYSFP---GGVSKI--VLHNAARVAEVWMDEWRD 433

Query: 176 -------GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------E 211
                  G  N+E+      GDV+ R  LR  L CKSF+WYL                 E
Sbjct: 434 FYYAMNPGASNVEV------GDVSERLALREKLKCKSFRWYLENIYPESQMPLDYYYLGE 487

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GD 269
           + N  S  C+D+  + +   + VG+  CH  GGNQ +  +K  +I  D+ CLD +   G 
Sbjct: 488 IKNVDSQQCLDTMSRKSG--EKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIVGP 545

Query: 270 VILYPCHGSKGNQYFEYDYK 289
           V L  CHG +GNQ + YD K
Sbjct: 546 VSLIRCHGLEGNQAWVYDSK 565


>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
          Length = 639

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 115/312 (36%), Positives = 164/312 (52%), Gaps = 38/312 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A N   VV P+I  I D++FE         ++     GGF+W L 
Sbjct: 286 CECTEGWLEPLLARVAENRKIVVCPIIDVISDESFEY-------VTASDMTWGGFNWKLN 338

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R+ +  +P+ TPTMAGGLFSIDK +FE++GTYD G DIWGGENLE+S
Sbjct: 339 FRWYRVPQREMDRRNGDRTQPLRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMS 398

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    E E     H        +P +  GG+  I      ++   +   D W    
Sbjct: 399 FRV-WQCGGELEIIPCSHVGHVFRDKSPYSFPGGVAKIVNKNAARVA--EVWMDRWKDFF 455

Query: 180 LEL---SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
            E+   +   + GDV+SR+ LR+ L CKSF+WYL                 E+ N  +  
Sbjct: 456 YEMNPGARSVEVGDVSSRRSLRKKLQCKSFRWYLENVYPESQMPLDYFFLGEIRNAETQT 515

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--VILYPCHG 277
           C+D+  +    +  VG+  CH  GGNQ +  +K  +I  D+ CLD  G D  V L  CHG
Sbjct: 516 CLDTMGRKGGEN--VGISYCHGLGGNQVFAYTKRQQIMSDDNCLDATGTDGIVKLIRCHG 573

Query: 278 SKGNQYFEYDYK 289
             GNQ + Y+ +
Sbjct: 574 MGGNQAWLYEAQ 585


>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
 gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
          Length = 499

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 159/319 (49%), Gaps = 44/319 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A +  +VV P+I  I  D F  +       S      GGF W+L 
Sbjct: 157 CEATPGWLEPLLVRIAEDRRNVVCPVIEVINADDFRYQ------ASDVIHERGGFTWDLF 210

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W AIPE E+KR K+  + + +PTMAGGLF+I K +F  LG+YDS  +IWGGENLE+SF
Sbjct: 211 FTWKAIPEAEKKRRKDETDYIRSPTMAGGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSF 270

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           +          +P  R     +    P   P    G  +     F +L     D   D +
Sbjct: 271 RIWMCGGQLEIVPCSRVGHVFRKYTSPYKFPK---GTTTTLARNFNRLAEVWMDEYKDHY 327

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------EVSND 215
             +  E     D GD++ R  LR+ LGCKSFKWYL                    ++ N 
Sbjct: 328 YRKKTEEERNVDIGDISDRVALRKRLGCKSFKWYLDNIYPDMTNKLPPKSYLYSHQIRNK 387

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYAGGD----V 270
            S +C+D+  +     K VGLY CH  GGNQF+ ++K  EI   D+ CLD   GD    V
Sbjct: 388 ESSLCLDTLGEKN--IKRVGLYTCHGMGGNQFFTLTKSNEILFNDDKCLDSPNGDPGSYV 445

Query: 271 ILYPCHGSKGNQYFEYDYK 289
            +  CHG KGNQ ++++ +
Sbjct: 446 EMITCHGLKGNQEWKHNKR 464


>gi|358332241|dbj|GAA27774.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 584

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 160/318 (50%), Gaps = 56/318 (17%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  K WL+PLLD + +N S VVSP+I  I DDTF   + P  L+   +  +GGFDW++ +
Sbjct: 231 ECNKGWLEPLLDCIQKNQSTVVSPVIDRINDDTFA--YEPLLLS---QIQVGGFDWDMTY 285

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
           NWH  P+R+ +R      P+  PT+AGGLFS+ + FF  LG YD   D+WGGENLELSFK
Sbjct: 286 NWHVPPKRDLERPGAPFTPIRAPTIAGGLFSVHRDFFAYLGYYDPQMDVWGGENLELSFK 345

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWGG 177
             W                 V   +  G +F     +  K  T D+         ++W  
Sbjct: 346 -TWMC----------GGTLQVHPCSHVGHVFRTKSPYSAKNNTGDTLRHNLVRLAEVWMD 394

Query: 178 ENL-----ELSFK-GDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN 214
           E         SFK GD+GDV+ RK LR  L C+SFKWYL                 ++ +
Sbjct: 395 EYKGYFYERFSFKLGDYGDVSERKALRERLKCRSFKWYLNNVFPELFVPSNSLANGDIES 454

Query: 215 DWSGMCIDSACKPTDMHKP----VGLYPCHKQGGNQFWMMSKHGEIRRDEAC--LDYAGG 268
               +C+D++    D H+P    +  YPCH+ GGNQ W  +   EIRRD  C  +D A G
Sbjct: 455 FKMAICLDAS---ADDHQPELHLLRGYPCHRLGGNQLWYWTPDKEIRRDNRCWSVDEASG 511

Query: 269 DVILYPCHGSKGNQYFEY 286
            + +  C G+   Q F Y
Sbjct: 512 FIGMAKCGGTD-KQKFNY 528


>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
 gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
          Length = 633

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 118/310 (38%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I DDTFE       +T+S   + GGF+W L 
Sbjct: 286 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDDTFEY------ITASDSTW-GGFNWKLN 338

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 339 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMS 398

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 399 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 455

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 456 YAMSTGARKASAGDVSDRKALRERLQCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 515

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG   CH  GGNQ +  +K  +I  D+ CLD A   G V +  CH 
Sbjct: 516 CLDTMGR--KYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 573

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 574 MGGNQEWVYD 583


>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
 gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
          Length = 621

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 274 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 326

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 327 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 386

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 387 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 443

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 444 YAMSTGARKASAGDVSDRKSLRDRLQCKSFRWYLENVYPESLMPLDYYYLGEIRNSETET 503

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V +  CH 
Sbjct: 504 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 561

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 562 MGGNQEWVYD 571


>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
          Length = 592

 Score =  173 bits (439), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 115/310 (37%), Positives = 164/310 (52%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S + + GGF+W L 
Sbjct: 236 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 288

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE ++R+ +   P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 289 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 348

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+ +I           +   D W    
Sbjct: 349 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 405

Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
            ++S    K   GDV+ RKELR  L CKSF+WYL                 E+ N  +G 
Sbjct: 406 YQMSPGARKASAGDVSERKELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVETGN 465

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  + ++  + +G   CH  GGNQ +  +K  ++  D+ CLD +   G V L  CHG
Sbjct: 466 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDASNALGPVNLVRCHG 523

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 524 MGGNQEWVYD 533


>gi|405967230|gb|EKC32416.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 347

 Score =  173 bits (438), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 99/231 (42%), Positives = 129/231 (55%), Gaps = 41/231 (17%)

Query: 86  TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPV 145
           +PTMA GLFSI + +F KLGTYD G DIWGGENLELSF+  W      E          +
Sbjct: 77  SPTMARGLFSISREYFTKLGTYDPGMDIWGGENLELSFRV-WMCCGTLE----------I 125

Query: 146 WTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGGENLELSFK------GDFGDVTS 193
              +  G +F     F  + G            ++W  E     ++      GD+GDVT 
Sbjct: 126 IPCSHVGHIFRKRSLFKCRTGVNVVKKNSIRLAEVWMDEYKNYYYERFNYDLGDYGDVTD 185

Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
           RK+LR  L C SF W++                 E+ +    MCIDSA    + HKPV +
Sbjct: 186 RKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNM 245

Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           +PCH QGGNQ+WM+SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 246 WPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 296


>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
          Length = 626

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 115/310 (37%), Positives = 157/310 (50%), Gaps = 34/310 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  + S VV P+I  I D TFE          +     GGF+W L 
Sbjct: 276 CECTEGWLEPLLSRIVEDRSTVVCPIIDVISDTTFEY-------IQASDMTWGGFNWKLN 328

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +PERE ++R  +   P+ TPTMAGGLF+ID+ +F K+G+YD G DIWGGENLE+S
Sbjct: 329 FRWYRVPEREMQRRGGDRTAPLRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMS 388

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    + E     H        +P +  GG+ ++           +   D WG   
Sbjct: 389 FRV-WQCGGVLEIVPCSHVGHVFRDKSPYSFPGGVQAV--VLKNAARVAEVWMDEWGEFY 445

Query: 180 LEL---SFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID------------SA 224
             +   +     GDV+ RK LR  L CKSF+WYLE     S M +D            S 
Sbjct: 446 YAMNPGALNVPVGDVSERKALRERLKCKSFRWYLENIYPESQMPLDYYYLGEIRNAETSN 505

Query: 225 CKPT---DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSK 279
           C  T      +P+G+  CH  GGNQ +  +K  +I  D+ CLD A   G + L  CHG +
Sbjct: 506 CLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQIMSDDNCLDAAHPRGPIKLIRCHGMR 565

Query: 280 GNQYFEYDYK 289
           GNQ + YD K
Sbjct: 566 GNQEWTYDTK 575


>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
          Length = 368

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 166/314 (52%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S + + GGF+W L 
Sbjct: 12  CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 64

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE ++R+ +   P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 65  FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 124

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+ +I      ++       ++W  E 
Sbjct: 125 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANIVLKNAARVA------EVWLDEW 177

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            E  +       K   GDV+ RKELR  L CKSF+WYL                 E+ N 
Sbjct: 178 KEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNV 237

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVILY 273
            +G C+D+  + ++  + +G   CH  GGNQ +  +K  ++  D+ CLD   A G V L 
Sbjct: 238 ETGNCLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDASNALGPVNLV 295

Query: 274 PCHGSKGNQYFEYD 287
            CHG  GNQ + YD
Sbjct: 296 RCHGMGGNQEWVYD 309


>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
 gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
          Length = 630

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK+LR  L CKSF+WYL                 E+ N  +  
Sbjct: 453 YAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V +  CH 
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 570

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 571 MGGNQEWVYD 580


>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
 gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
          Length = 630

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 117/310 (37%), Positives = 157/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK+LR  L CKSF+WYL                 E+ N  +  
Sbjct: 453 YAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V +  CH 
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSNGPVNMVRCHN 570

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 571 MGGNQEWVYD 580


>gi|161077160|ref|NP_001097343.1| CG30463, isoform E [Drosophila melanogaster]
 gi|157400368|gb|ABV53824.1| CG30463, isoform E [Drosophila melanogaster]
          Length = 264

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 105/239 (43%), Positives = 133/239 (55%), Gaps = 65/239 (27%)

Query: 89  MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTP 148
           MAGGLFSID+ FF++LGTYDSGFDIWGGENLELSFK                     W  
Sbjct: 1   MAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFK--------------------TW-- 38

Query: 149 TMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGENLELSF----------------- 184
            M GG   I         F K   Y   SG ++    ++ L+                  
Sbjct: 39  -MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLKKNSVRLAEVWMDEYSQYYYHRIGND 97

Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKP 227
           KGD+GDV+ R++LR +L CKSFKWYL                 E++N  +GMC+D A + 
Sbjct: 98  KGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIANVPNGMCLD-AKEK 156

Query: 228 TDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
           ++   PV +Y CH QGGNQ+WM+SK GEIRRD++CLDYAG DV L+ CHG KGNQ++ Y
Sbjct: 157 SEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKGNQFWTY 215


>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 460

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I D+TFE         S+     GGF+W L 
Sbjct: 113 CECTQNWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDLTWGGFNWKLN 165

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R  +   PV TPTMAGGLF+IDK +F +LG YD G DIWGGENLELS
Sbjct: 166 FRWYRVPQRELDRRGGDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELS 225

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    E E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 226 FRI-WMCGGELEIVPCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA------EVWLDEW 278

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            E  F         D GD++ R+ LR+ L C SF+WYL                 E+ + 
Sbjct: 279 KEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENIYPESHMPLDYYHLGEIKHA 338

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
            S +C+D+  + +  +  V +  CH QGGNQ +  +K  +I  D+ CLD +   G V L 
Sbjct: 339 DSPVCLDTFGRKSGEN--VAVSTCHGQGGNQVFAYTKRQQIMSDDNCLDASSPRGPVKLL 396

Query: 274 PCHGSKGNQYFEYD 287
            CHG  GNQ + YD
Sbjct: 397 RCHGMGGNQLWIYD 410


>gi|405966386|gb|EKC31679.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 206

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 84/155 (54%), Positives = 106/155 (68%), Gaps = 18/155 (11%)

Query: 150 MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY 209
           MAGGLFSI + +F + GTYD G DIWGGE LELSF+ D+G VT RK+L   L C SF W+
Sbjct: 1   MAGGLFSISREYFTEPGTYDPGMDIWGGEKLELSFRVDYGVVTDRKKLLERLQCHSFDWF 60

Query: 210 L-----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK 252
           +                 E+ +    MCIDSA    + HKPV ++PCH QGGNQ+WM+SK
Sbjct: 61  VKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHNYHKPVNMWPCHNQGGNQYWMLSK 120

Query: 253 HGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           +GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 121 NGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 155


>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
 gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
          Length = 518

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/307 (36%), Positives = 150/307 (48%), Gaps = 40/307 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + +N + VV P I  I D+TFE  +  G +        G F+WNL 
Sbjct: 168 CEANVGWLEPLLYRIMQNRTIVVCPEIDVISDETFEYTYSSGNVR-------GSFNWNLN 220

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W A+PE E KR     + + +PTMAGGLF+I   +F+ +G YD   +IWGGENLELSF
Sbjct: 221 FRWKAVPEYENKRRAARTDGIRSPTMAGGLFTIHSQYFKDIGLYDKQMEIWGGENLELSF 280

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP          ++P   P   G   S  K        +  G+  +  +
Sbjct: 281 RIWQCGGQLEIIPCSHVGHVFRKSQPYSFPKGTGETLS--KNLQRVAEVWMDGYKRYFYK 338

Query: 179 NLELSFKGD-FGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGM 219
             +   KG  FGD++ R ELR+ L CK+F WY+                  E+ N  SG 
Sbjct: 339 R-QPHLKGHPFGDISKRLELRKKLKCKNFDWYIKNVVPEIFLPNSSIIARGELRNPASGD 397

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD----VILYPC 275
           CIDS       H  +G+Y CHKQ GNQ+ + +K+ EI  D+ C DYA       V +  C
Sbjct: 398 CIDSLG--AGEHAYIGIYKCHKQMGNQYLVYTKNEEIIVDDNCFDYANSQPSSKVKMLDC 455

Query: 276 HGSKGNQ 282
           H  KGNQ
Sbjct: 456 HSMKGNQ 462


>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
 gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
          Length = 630

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 117/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGGENLEMS 395

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 453 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH 
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 570

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 571 MGGNQEWVYD 580


>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
           [Tribolium castaneum]
 gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
          Length = 602

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 112/316 (35%), Positives = 157/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + ++   VV P+I  I D+TFE         ++     GGF+W L 
Sbjct: 249 CECTEGWLEPLLARIVQDRKTVVCPIIDVISDETFEY-------ITASDMTWGGFNWKLN 301

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE +R  N    P+ TPTMAGGLFSIDK +F +LG+YD G DIWGGENLE+S
Sbjct: 302 FRWYRVPQREMERRNNDRTAPLRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMS 361

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          IP            P    T  GG+  I       L       ++W  
Sbjct: 362 FRVWQCGGKLEIIPCSHVGHVFRDKSPY---TFPGGVSKI------VLHNAARVAEVWMD 412

Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
           E  +  +  +        GDV++R+ELR  L CKSF+WYLE                 + 
Sbjct: 413 EWRDFYYAMNPGARSVPVGDVSARRELRERLKCKSFRWYLENVYPESQMPLEYYYLGDIR 472

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + +  +  +G+  CH  GGNQ +  +K  +I  D+ CLD +   G V 
Sbjct: 473 NVETKNCLDTMGRKSGEN--LGMTYCHNLGGNQVFAYTKRQQIMSDDNCLDASNKKGPVK 530

Query: 272 LYPCHGSKGNQYFEYD 287
           L  CHG  GNQ + YD
Sbjct: 531 LVRCHGMGGNQAWAYD 546


>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 632

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/338 (35%), Positives = 161/338 (47%), Gaps = 65/338 (19%)

Query: 4   CEVQKRWLQPLLD----------------VLARNS-------SHVVSPLIANICDDTFEL 40
           CE    WL+PLL                 V  R S       + VV P+I  I D+TFE 
Sbjct: 211 CECTVGWLEPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEY 270

Query: 41  RFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKA 99
                   +      GGF+W L F W+ +P+RE  R K +   PV TPTMAGGLFSIDK 
Sbjct: 271 -------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKT 323

Query: 100 FFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNA------AEPVWTPTMAGG 153
           +FE++G+YD G DIWGGENLE+SF+  W      E     +       A P   P   G 
Sbjct: 324 YFEEIGSYDPGMDIWGGENLEMSFRI-WQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQ 382

Query: 154 LFSIDKAFFEK--LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL- 210
           + + +     +  +  +   F I     + +    D+GDV+SRK LR  L CK F WYL 
Sbjct: 383 VINKNNRRLAEVWMDDFKDFFYIISPGVMRV----DYGDVSSRKGLRDALHCKPFSWYLE 438

Query: 211 ----------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG 254
                           E+ N  +  C+D+  +  +  + VG + CH  GGNQ +  +   
Sbjct: 439 NIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADK 496

Query: 255 EIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYDYKY 290
           EIR D+ CLD +   G V++  CH  KGNQ FEYD +Y
Sbjct: 497 EIRTDDLCLDVSRLNGPVLMLKCHHMKGNQMFEYDAEY 534


>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
 gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
          Length = 632

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 285 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 337

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 338 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 397

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 398 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 454

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 455 YAMSTGARKASAGDVSDRKALRDRLQCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 514

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHG 277
           C+D+  +    ++ VG   CH  GGNQ +  +K  +I  D+ CLD A   G V +  CH 
Sbjct: 515 CLDTMGRK--YNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDAASSSGPVNMVRCHN 572

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 573 MGGNQEWVYD 582


>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
 gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
          Length = 615

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 125/335 (37%), Positives = 165/335 (49%), Gaps = 83/335 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PL+  +A +   VV+P+I  I DDTFE       +T+S   + GGF+W+L 
Sbjct: 268 VEVTDGWLEPLVTRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 320

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P+RE  +R  + + P+ TPT+AGGLF+IDK FF  +G+YD G  +WGGENLE+S
Sbjct: 321 FRWYAVPKRELNRRGADRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 380

Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
           F+  W      E           RK+             H NAA    VW          
Sbjct: 381 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 433

Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
             KAFF K+        +    N+E       GDVT RK+LR  L CKSFKWYLE     
Sbjct: 434 --KAFFYKM--------VPAARNVEA------GDVTERKKLRETLQCKSFKWYLENIYPE 477

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                       + N ++  CID+  K  D   P GL  CH  GGNQ W ++  GEIR D
Sbjct: 478 APLPADFKSLGAIVNRFTEKCIDTNGK-KDGQSP-GLQGCHGSGGNQAWSLTGKGEIRSD 535

Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYD 287
           + CL        G ++ L  C  SK N    FE+D
Sbjct: 536 DLCLSSGHVYQIGSELKLERCSVSKINIKHVFEFD 570


>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 600

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 114/310 (36%), Positives = 163/310 (52%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S + + GGF+W L 
Sbjct: 241 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 293

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE ++R+ +   P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 294 FRWYRVPSREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 353

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+ +I           +   D W    
Sbjct: 354 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 410

Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
            ++S    K   GDV+ R+ LR  L CKSF+WYL                 E+ N+ S  
Sbjct: 411 YQMSPGARKASAGDVSERRALREKLKCKSFRWYLENIYPESQMPLDYYFLGEIRNEESQN 470

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  + ++  + +G   CH  GGNQ +  +K  +I  D+ CLD +   G V L  CHG
Sbjct: 471 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHG 528

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 529 MGGNQEWVYD 538


>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 617

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 270 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 322

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 323 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 382

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 383 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 439

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 440 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 499

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH 
Sbjct: 500 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 557

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 558 MGGNQEWVYD 567


>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 3 [Bombus impatiens]
          Length = 607

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 116/309 (37%), Positives = 157/309 (50%), Gaps = 35/309 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTY--DSGFDIWGG 177
           F+  W      E     H        +P T  GG+  +      ++     D   D +  
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYA 429

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
            N E +     GDV+ R +LR  L CKSF+WYLE                 V N  +  C
Sbjct: 430 MNPEGARNVAVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVETQSC 489

Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGS 278
           +D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V +  CHG 
Sbjct: 490 LDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIVRCHGM 547

Query: 279 KGNQYFEYD 287
            GNQ + Y+
Sbjct: 548 GGNQAWVYN 556


>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
 gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
          Length = 630

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 396 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 452

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 453 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 512

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH 
Sbjct: 513 CLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 570

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 571 MGGNQEWVYD 580


>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
          Length = 536

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 156/310 (50%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 189 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 241

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 242 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 301

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 302 FRI-WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKI--VLHNAARVAEVWLDEWRDFY 358

Query: 180 LELSF---KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
             +S    K   GDV+ RK LR  L CKSF+WYL                 E+ N  +  
Sbjct: 359 YSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETET 418

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH 
Sbjct: 419 CLDTMGR--KYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHN 476

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 477 MGGNQEWVYD 486


>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 605

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 114/315 (36%), Positives = 159/315 (50%), Gaps = 50/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 254 CECTEGWLEPLLARITEDRKTVVCPIIDVISDETFEY------ITASDTTW-GGFNWRLN 306

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R  N    P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 307 FRWYRVPKREMDRRNNDKTVPIRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P            P    T  GG+  I       L   +   ++W  
Sbjct: 367 FRVWQCGGTLEIVPCSHVGHVFRDKSPY---TFPGGVSQI------VLHNANRVAEVWMD 417

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
           E  +  +       K + GD+TSR +LR +L CKSF+WYL                 ++ 
Sbjct: 418 EWRDFYYAMNPGAKKIEVGDITSRLKLREDLKCKSFRWYLTNIYPESTMPLDYYFLGDIK 477

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + +  +  VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 478 NVETEQCLDTMGRKSGEN--VGMSYCHGYGGNQVFSYTKRHQITADDNCLDAASVRGPVK 535

Query: 272 LYPCHGSKGNQYFEY 286
           L  CHG  GNQ ++Y
Sbjct: 536 LVRCHGMGGNQEWKY 550


>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
           saltator]
          Length = 605

 Score =  170 bits (431), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 156/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A +   VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 257 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R+ +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRNSDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  I           +   D W    
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 426

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+      D GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 427 YAMNPGARNV------DVGDVSERVKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 481 NVEAQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554


>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
          Length = 507

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 112/318 (35%), Positives = 166/318 (52%), Gaps = 41/318 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   +WL+PL+  +  + S ++ P+I  I  D   + +      S     +GGF W+L 
Sbjct: 154 CECNVQWLEPLVARIKESRSALLCPMIDVI--DAKAMSYNGIGAGS-----VGGFWWSLH 206

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +P+RERKR K++ E + +PTMAGGLF+ D+ +F ++G YD G D+WGGENLE+SF
Sbjct: 207 FSWRPLPQRERKRRKSSVETIRSPTMAGGLFAADRKYFFEIGGYDPGMDVWGGENLEISF 266

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
           +          +P         ++ P   P          K   E  +  Y   F     
Sbjct: 267 RVWMCGGTLEFVPCSRVGHIFRSSHPYTFPGNKDTHGLNSKRLAEVWMDGYKRLFYHHRR 326

Query: 178 ENLELS--FKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
           + L ++  F  D GD + R +LRR+L CKSFKWYLE                  V N  S
Sbjct: 327 DLLVINPQFNADAGDFSDRLQLRRDLKCKSFKWYLENVYPEKFIPDENVIAYGMVRNPSS 386

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG---NQFWMMSKHGEIRRDEACLDYAGGD---VI 271
            +C+D+  K   M   +GLY C  QGG   NQ + +S+  E+RR+E+C+D  GG+   V 
Sbjct: 387 NLCLDTLSKDEKMVFNLGLYGC--QGGVSSNQLFSLSQSNELRREESCMDSVGGEGSPVK 444

Query: 272 LYPCHGSKGNQYFEYDYK 289
           L PCHGS+G+Q + Y+ +
Sbjct: 445 LMPCHGSRGHQEWTYNLE 462


>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
          Length = 737

 Score =  170 bits (430), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 154/315 (48%), Gaps = 56/315 (17%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +A NSS VV P+I  I D T +   P     S  +  IGGFDW+L F WH  
Sbjct: 346 WLEPLLDRIAYNSSIVVVPVITVINDKTLKYDLP-----SPSRVQIGGFDWSLSFIWHEQ 400

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
            ER + R      PV +PTMAGGLF+I + +F  LG YD G ++WGGENLELSFK  W  
Sbjct: 401 TERHKNRPGAPYSPVQSPTMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKI-WMC 459

Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGE---- 178
                       +  +   +  G +F     +   +   D          D+W  +    
Sbjct: 460 ----------GGSLEIVICSQVGHIFRDRSPYIWDVDVKDPLKRNLLRLADVWLDDYKRF 509

Query: 179 -NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------VSNDWSGMCIDSACKPT 228
            +  + F+  D G+V+ RK LR  L C SF WYL          S   +   I+SA  P 
Sbjct: 510 YHARIGFEMVDIGNVSERKALREKLKCHSFDWYLTNIYPELFVPSKALASGDIESAAGPH 569

Query: 229 DMHKP-----------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD-----YAGGDVIL 272
            +  P           +   PCHKQGGNQFW++S   EIRRD+ C D     Y+ G   L
Sbjct: 570 CLDAPLPSENDSSSVIIKTRPCHKQGGNQFWLLSSENEIRRDDYCFDSGIQKYSIG---L 626

Query: 273 YPCHGSKGNQYFEYD 287
           Y CHGS GNQ F Y+
Sbjct: 627 YHCHGSHGNQEFTYE 641


>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 603

 Score =  170 bits (430), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 105/313 (33%), Positives = 152/313 (48%), Gaps = 44/313 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W +PLL  +  +  +VV P+I  I +  F     P        F  G F W L+
Sbjct: 260 CECTLGWAEPLLAKIKEDRQNVVMPVIDEISETNFNYNAVPE------PFQRGVFKWRLE 313

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  E +R K+ ++ + TP MAGGLFSI++ +F ++G+YD+G DIWGGEN+E+SF
Sbjct: 314 FTWRPIPSYEEQRRKHESDGIKTPVMAGGLFSINRDYFYEMGSYDTGMDIWGGENIEISF 373

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P   GG   +      ++       D+W  E
Sbjct: 374 RIWMCGGSIEMLPCSRVGHVFRPRFPYSFPNRRGGDGDVVSRNLMRVA------DVWMDE 427

Query: 179 ------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
                 N+    K     DVT+R +LR  L CKSF+WYL                  E+ 
Sbjct: 428 YAKHFYNIRFDLKRKKHDDVTARVKLRSKLQCKSFQWYLENVYPELEIPDDKFLAAGEIR 487

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILY 273
           N  SG+C+D+  K      PVGLY CH QGGNQ++  +  GEI+ ++ C+D+ G D+ + 
Sbjct: 488 NPESGICLDTLGKQEG--APVGLYACHGQGGNQYYTYNNKGEIKAEDNCMDFNGHDLYIR 545

Query: 274 PCHGSKGNQYFEY 286
            C G   NQ + Y
Sbjct: 546 ECDGLGLNQKWTY 558


>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
           echinatior]
          Length = 605

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 154/315 (48%), Gaps = 50/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A +   VV P+I  I DDTFE         S+     GGF+W L 
Sbjct: 257 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-------ISASDMTWGGFNWKLN 309

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R+ +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRNSDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  I           +   D W    
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 426

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+      D GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 427 YAMNPGARNV------DVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 481 NIETQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAANPQGPVK 538

Query: 272 LYPCHGSKGNQYFEY 286
           +  CHG  GNQ + Y
Sbjct: 539 IVRCHGMGGNQAWVY 553


>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
           musculus]
          Length = 594

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 82/350 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 213 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 265

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 326 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 378

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 379 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 438

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGN------------------------------ 245
            +  C+D+  +  +  + VG++ CH  GGN                              
Sbjct: 439 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEECCSNHPLYGLVY 496

Query: 246 ------QFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
                 Q +  +   EIR D+ CLD +   G VI+  CH  +GNQ +EYD
Sbjct: 497 TPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYD 546


>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 561

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 156/319 (48%), Gaps = 53/319 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PL+  +A + + VVSP+I +I D+TFE    P       +   GGF+W L 
Sbjct: 210 CECTKGWLEPLIARIAEDRTRVVSPVIDSISDETFEYNSVP-------ELGCGGFNWRLN 262

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE+KR K +A  P+ TPTMAGGLFSI K +F ++GTYD G DIWGGENLE+S
Sbjct: 263 FRWYPMSKREKKRRKGDATIPINTPTMAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMS 322

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P            P    T  GG+ ++      +L       ++W  
Sbjct: 323 FRIWMCGGTLEIVPCSHVGHVFRGKSPY---TFPGGVATVVHNNNRRLA------EVWMD 373

Query: 178 ENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
           E     +K        ++GD+  RK+LR  L C SF+WYL                  EV
Sbjct: 374 EYKSFYYKTVPNARNAEYGDIEDRKQLREKLQCNSFRWYLENIFPDSQFLLDNYFRFCEV 433

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG----G 268
            N  +  C+D+  +         L  CH QGG+Q +  SK  E++ D+ CLD +      
Sbjct: 434 RNMETKQCLDNMGQKE--KSKAALSRCHGQGGHQIYAWSKLNELKHDDLCLDASAPSGFK 491

Query: 269 DVILYPCHGSKGNQYFEYD 287
           DV    C+   G Q + Y+
Sbjct: 492 DVEQSRCNSHGGTQEWRYN 510


>gi|74215848|dbj|BAE28617.1| unnamed protein product [Mus musculus]
          Length = 330

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 110/293 (37%), Positives = 149/293 (50%), Gaps = 46/293 (15%)

Query: 25  VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
           VV P+I  I DDTFE         +      GGF+W L F W+ +P+RE  R K +   P
Sbjct: 4   VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 56

Query: 84  VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKNA 141
           V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+  W      E     H   
Sbjct: 57  VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSHVGH 115

Query: 142 AEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVTS 193
                TP T  GG   I      +L       ++W  E     +       K D+G+++S
Sbjct: 116 VFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGVTKVDYGNISS 169

Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
           R  LRR L CK F WYL                 E+ N  +  C+D+  +  +  + VG+
Sbjct: 170 RLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN--EKVGI 227

Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
           + CH  GGNQ +  + + EIR D+ CLD +   G V +  CH  KGNQ +EYD
Sbjct: 228 FNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 280


>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
          Length = 592

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 160/350 (45%), Gaps = 82/350 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERE--RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   +      +L       ++W  E 
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLA------EVWMDEF 376

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  +       K D+GDV+ RK LR NL CK F WYL                 E+ N 
Sbjct: 377 KDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNV 436

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGN------------------------------ 245
            +  C+D+  +  +  + VG++ CH  GGN                              
Sbjct: 437 ETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEECCSNHPLYGLVY 494

Query: 246 ------QFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
                 Q +  +   EIR D+ CLD +   G VI+  CH  +GNQ +EYD
Sbjct: 495 TPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYD 544


>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
           floridanus]
          Length = 608

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 155/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A +   VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 260 CECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 312

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R+ +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 313 FRWYRVAQREMDRRNGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 372

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  I           +   D W    
Sbjct: 373 FRV-WQCGGTLEISSCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 429

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+      D GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 430 YAMNPGARNV------DVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVK 483

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N     C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 484 NVEMQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 541

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 542 IVRCHGMGGNQAWVYN 557


>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus impatiens]
          Length = 606

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 157/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        TP T  GG   I      +L   +   D W    
Sbjct: 371 FRI-WMCGGTLEIATCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA--EVWLDQWKYFY 427

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+ +      GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 428 YNINPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 482 NVETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555


>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Metaseiulus occidentalis]
          Length = 583

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 159/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A +++ VV P+I  I D+ F    P    T       GGF+W L 
Sbjct: 232 CECTEGWLEPLLARIAEDNTRVVCPVIDVISDENFAY-VPASDQT------WGGFNWKLN 284

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R  +   PV TPTMAGGLF++DKA+FEKLG YD G DIWGGENLE+S
Sbjct: 285 FRWYRVPQRENDRRGGDRTLPVRTPTMAGGLFAMDKAYFEKLGKYDEGMDIWGGENLEMS 344

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       D+W  E 
Sbjct: 345 FRI-WMCGGTLEIVTCSHVGHVFRKSTPYTFPGGTGKIVNHNNARLA------DVWLDEW 397

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            +  F       K D GD + R +LR++L CKSF+WYL                 E+ N 
Sbjct: 398 KDFYFAINPVAKKVDRGDTSGRHKLRQDLQCKSFRWYLENIYPESHMPLDYYHLGEIKNA 457

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
              +C+D+  K +     +G   CH  GGNQ +  +K  +I  D++CLD +   G V L+
Sbjct: 458 DGNLCLDTYGKKSGDVLYMG--KCHGLGGNQVFAYTKRQQIMADDSCLDASSPSGPVKLF 515

Query: 274 PCHGSKGNQYFEYD 287
            CH   GNQ + YD
Sbjct: 516 RCHNMGGNQMWTYD 529


>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Megachile rotundata]
          Length = 605

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 156/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A N S VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 257 CECTEGWLEPLLARIAENRSTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  +           +   D W    
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 426

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+ +      GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 427 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 481 NIDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554


>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
          Length = 613

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/335 (35%), Positives = 165/335 (49%), Gaps = 83/335 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + WL+PL+  +A +   VV+P+I  I DDTFE       +T+S   + GGF+W+L 
Sbjct: 267 VEVTEGWLEPLISRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 319

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+++P+RE  +R  + + P+ TPT+AGGLF+IDK FF  +G+YD G  +WGGENLE+S
Sbjct: 320 FRWYSVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 379

Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
           F+  W      E           RK+             H NAA    VW          
Sbjct: 380 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 432

Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
             KAFF K+        +    N+E       GDVT RK+LR  L CKSFKWYLE     
Sbjct: 433 --KAFFYKM--------VPAARNVEA------GDVTERKKLRETLQCKSFKWYLENIYPE 476

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                       + N ++  C+D+  K     +P G+  CH  GGNQ W ++  GEIR D
Sbjct: 477 APLPADFRSLGAIVNRFTEKCVDTNGKKDG--QPPGMQACHGAGGNQAWSLTGKGEIRSD 534

Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYD 287
           + CL        G ++ L  C  SK N    F +D
Sbjct: 535 DLCLSSGHVYQIGSELKLERCSVSKINPKHVFTFD 569


>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
          Length = 611

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 166/337 (49%), Gaps = 83/337 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PL+  +A +   VV+P+I  I DDTFE       +T+S   + GGF+W+L 
Sbjct: 267 VEVTDGWLEPLVHRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 319

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P+RE  +R  + + P+ TPT+AGGLF+IDK FF  +G+YD G  +WGGENLE+S
Sbjct: 320 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 379

Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
           F+  W      E           RK+             H NAA    VW          
Sbjct: 380 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 432

Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
             KAFF K+        +   +N+E       GDVT RK+LR  L CKSFKWYLE     
Sbjct: 433 --KAFFYKM--------VPAAKNVEA------GDVTDRKKLRETLQCKSFKWYLENIYPE 476

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                       + N ++  CID+  K  D   P G+  CH  GGNQ W ++  GEIR D
Sbjct: 477 APLPADFRSLGSIVNRFTEKCIDTNGK-KDGQAP-GMQACHGAGGNQAWSLTGKGEIRSD 534

Query: 260 EACLDYA-----GGDVILYPCHGSKGN--QYFEYDYK 289
           + CL        G ++ L  C  SK N    F +D +
Sbjct: 535 DLCLSSGHVYQIGSELKLERCSVSKLNPKHIFAFDAQ 571


>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
 gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
          Length = 596

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 113/310 (36%), Positives = 162/310 (52%), Gaps = 38/310 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S + + GGF+W L 
Sbjct: 239 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 291

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE ++R+ +   P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 292 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 351

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    I E     H        +P T  GG+ +I           +   D W    
Sbjct: 352 FRI-WQCGGILEISPCSHVGHVFRDKSPYTFPGGVANI--VLKNAARVAEVWLDEWKEFY 408

Query: 180 LELS---FKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGM 219
            ++S    K   GDV+ R+ LR  L CKSF+WYL                 E+ N  +  
Sbjct: 409 YQMSPGARKASAGDVSERRALRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVKTHN 468

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHG 277
           C+D+  + ++  + +G   CH  GGNQ +  +K  +I  D+ CLD +   G V L  CHG
Sbjct: 469 CLDTMGRKSN--EKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHG 526

Query: 278 SKGNQYFEYD 287
             GNQ + YD
Sbjct: 527 MGGNQEWIYD 536


>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
 gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
          Length = 630

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 117/317 (36%), Positives = 158/317 (49%), Gaps = 52/317 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKFNWHA-----IPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           F+  W       I    R  H        TP T  GG   I      +L       ++W 
Sbjct: 396 FRV-WMCGGVLEIAPCSRVGHVFRKS---TPYTFPGGTTEIVNHNNARL------VEVWL 445

Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EV 212
            +  E  +       K   GDV+ RK LR  L CKSF+WYL                 E+
Sbjct: 446 DDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEI 505

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDV 270
            N  +  C+D+  +    ++ VG+  CH  GGNQ +  +K  +I  D+ CLD +   G V
Sbjct: 506 RNAETETCLDTMGRK--YNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPV 563

Query: 271 ILYPCHGSKGNQYFEYD 287
            +  CH   GNQ + YD
Sbjct: 564 NMVRCHNMGGNQEWVYD 580


>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
 gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
           malayi]
          Length = 584

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 107/309 (34%), Positives = 154/309 (49%), Gaps = 34/309 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + WL+PLLD ++ +   VV+P+I  I D+ FE         ++     GGF+W+L 
Sbjct: 242 VEVTEGWLEPLLDRVSTDRKRVVAPIIDVISDENFEY-------ITASDVTWGGFNWHLN 294

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  +R+ + + P+ TPT+AGGLF+ID+ FF  +G+YD G +IWGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEIS 354

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +    P            P   P     +   + A   ++   D   DI+ G
Sbjct: 355 FRVWMCGGSLEIHPCSRVGHVFRKHTPYSFPGGTARVIHHNAARTAEVWM-DEYKDIFYG 413

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
             +  +   D GD+T RK LR NL CKSF+WYLE                 V N     C
Sbjct: 414 -MVPAAKNVDVGDLTERKILRENLQCKSFRWYLETIYPESPIPIDFFSLGQVQNMGVMEC 472

Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
           +D+A +         + PCH +GGNQ W  +  GEIR DE CL +    V +  C GS  
Sbjct: 473 LDTAGRSAG--DSPAMLPCHGKGGNQLWTYTGKGEIRSDELCLAFTTKGVSMEKCTGSVP 530

Query: 281 NQYFEYDYK 289
                +DY+
Sbjct: 531 LSKMIFDYE 539


>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
          Length = 495

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 106/316 (33%), Positives = 155/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE         +      GGF+W L 
Sbjct: 153 CECTEGWLEPLLFEIHKNRKSVVCPIIDVISDETFEY-------ITGSDMTWGGFNWKLN 205

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R  + + P+ +PTMAGGL +I++ +F ++G+YD G DIWGGENLE+S
Sbjct: 206 FRWYPVPQREVERRGGDRSLPLRSPTMAGGLLAIERDYFYEIGSYDDGMDIWGGENLEMS 265

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +           A P   P   G + + + A            ++W  
Sbjct: 266 FRIWMCGGTLLIVTCSHVGHVFRKATPYTFPGGTGRIINHNNARLA---------EVWMD 316

Query: 178 ENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
           E     +K        D+GD++ R +LR  L CKSF+WYL                 E+ 
Sbjct: 317 EWRSFYYKINPGVKQTDYGDLSPRIQLREKLECKSFRWYLQNIYPESQMPLDYYSLGEIR 376

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+DS  +     + VG+  CH  GGNQ +  SK    + D+ CLD +   G V 
Sbjct: 377 NKETNQCLDSMGRKAG--EKVGIVGCHGMGGNQIFSYSKKKAFQTDDLCLDVSALTGPVK 434

Query: 272 LYPCHGSKGNQYFEYD 287
           LY CHG  GNQ +E+D
Sbjct: 435 LYQCHGLGGNQLWEHD 450


>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 580

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 105/295 (35%), Positives = 152/295 (51%), Gaps = 48/295 (16%)

Query: 24  HVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERER-KRHKNAAE 82
           +VV P+I  I DD F          +      GGF+W LQF W+ +P+RE  +R  +   
Sbjct: 252 NVVCPIIDVISDDNFAFH-------TGSDMTYGGFNWKLQFRWYPVPQREADRRGGDRTI 304

Query: 83  PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKN 140
           P+ +PTMAGGLFSIDK +FE++GTYD+G D+WGGENLE+SF+  W      E     H  
Sbjct: 305 PLRSPTMAGGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRI-WMCGGTLEIVTCSHVG 363

Query: 141 AAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVT 192
                 TP T  GG   I     ++L       ++W  +     +       K +FGDV+
Sbjct: 364 HVFRKSTPYTFPGGTGRIINRNNQRLA------EVWMDDFRHFYYRISPGVRKTEFGDVS 417

Query: 193 SRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVG 235
            RK+LR  L C +F+WYL                 E+ N  +  C+D+  +  +  + VG
Sbjct: 418 QRKKLRDRLKCHTFEWYLENIYPESQFRLDFKTIGEIRNIETHKCLDNMGRKEN--EKVG 475

Query: 236 LYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG----DVILYPCHGSKGNQYFEY 286
           ++ CH QGGNQ + ++K  EI+ D+ CLD +      DV++  CHG  GNQ + Y
Sbjct: 476 IFSCHGQGGNQIFALTKQNEIKHDDLCLDASANSHYKDVVMIKCHGKHGNQEWLY 530


>gi|344237432|gb|EGV93535.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Cricetulus
           griseus]
          Length = 413

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 109/293 (37%), Positives = 148/293 (50%), Gaps = 46/293 (15%)

Query: 25  VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
           VV P+I  I DDTFE         +      GGF+W L F W+ +P+RE  R K +   P
Sbjct: 87  VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 139

Query: 84  VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH--AIPERERKRHKNA 141
           V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+  W      E     H   
Sbjct: 140 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSHVGH 198

Query: 142 AEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGDFGDVTS 193
                TP T  GG   I      +L       ++W  E     +       K D+G+++S
Sbjct: 199 VFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGFTKVDYGEISS 252

Query: 194 RKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGL 236
           R  LR  L CK F WYL                 E+ N  +  C+D+  +  +  + VG+
Sbjct: 253 RLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN--EKVGI 310

Query: 237 YPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
           + CH  GGNQ +  + + EIR D+ CLD +   G V +  CH  KGNQ +EYD
Sbjct: 311 FNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 363


>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis florea]
          Length = 605

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 116/316 (36%), Positives = 157/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 257 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 309

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 310 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  +           +   D W    
Sbjct: 370 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 426

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+ +      GDV+ R +LR+ L CKSF+WYLE                 V 
Sbjct: 427 YAMNPGARNVAV------GDVSERIKLRQRLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 481 NVDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 538

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 539 IVRCHGMGGNQAWVYN 554


>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
          Length = 584

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 112/330 (33%), Positives = 157/330 (47%), Gaps = 76/330 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + WL+PLLD ++ +   VV+P+I  I D+ FE       +T+S   + GGF+W+L 
Sbjct: 242 VEVTEGWLEPLLDRVSTDRKRVVAPIIDVISDENFEY------ITASDVTW-GGFNWHLN 294

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  +R+ + + P+ TPT+AGGLF+ID+ FF  +G+YD G ++WGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEIS 354

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD-------- 169
           F+                    VW   M GG   I         F K   Y         
Sbjct: 355 FR--------------------VW---MCGGSLEIHPCSRVGHVFRKHTPYSFPGGTARV 391

Query: 170 ------SGFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----- 211
                    ++W  E  ++ +         D GD+T RK LR NL CKSF+WYLE     
Sbjct: 392 IHHNTARTAEVWMDEYKDIFYSMVPAARNVDVGDLTERKILRENLQCKSFRWYLETIYPE 451

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                       V N     C+D+A +         + PCH QGGNQ W  +  GEIR D
Sbjct: 452 SPIPIDFFSLGQVQNMGVMECLDTAGRSAG--DSPAMLPCHGQGGNQLWTYTGKGEIRSD 509

Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           E CL +    V +  C GS       +DY+
Sbjct: 510 ELCLAFTTKGVGMEKCIGSVPLSKMIFDYE 539


>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis mellifera]
          Length = 606

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 116/316 (36%), Positives = 156/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  +           +   D W    
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 427

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+ +      GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 428 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 482 NVDTQTCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555


>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus impatiens]
          Length = 606

 Score =  166 bits (421), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 116/316 (36%), Positives = 156/316 (49%), Gaps = 50/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W      E     H        +P T  GG+  +           +   D W    
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV--VLHNAARVAEVWMDEWRDFY 427

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
                G  N+ +      GDV+ R +LR  L CKSF+WYLE                 V 
Sbjct: 428 YAMNPGARNVAV------GDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYYYLGDVQ 481

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVI 271
           N  +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V 
Sbjct: 482 NVETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVK 539

Query: 272 LYPCHGSKGNQYFEYD 287
           +  CHG  GNQ + Y+
Sbjct: 540 IVRCHGMGGNQAWVYN 555


>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus terrestris]
 gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus terrestris]
          Length = 606

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 115/314 (36%), Positives = 157/314 (50%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        +P T  GG+  +       L       ++W  E 
Sbjct: 371 FRV-WQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKV------VLHNAARVAEVWMDEW 423

Query: 180 LELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
            +  +  +        GDV+ R +LR  L CKSF+WYLE                 V N 
Sbjct: 424 RDFYYAMNPGARSVAVGDVSERIKLRERLKCKSFRWYLENIYPESPMPLDYFYLGDVQNV 483

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
            +  C+D+  + T   + VG+  CH  GGNQ +  +K  +I  D+ CLD A   G V + 
Sbjct: 484 ETQSCLDTMGRRTG--ENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIV 541

Query: 274 PCHGSKGNQYFEYD 287
            CHG  GNQ + Y+
Sbjct: 542 RCHGMGGNQAWVYN 555


>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 621

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 112/307 (36%), Positives = 153/307 (49%), Gaps = 32/307 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I D+TFE         S+     GGF+W L 
Sbjct: 274 CECTQHWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDMTWGGFNWKLN 326

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLELS
Sbjct: 327 FRWYRVPQREVERRGGDRTLPIRTPTMAGGLFSIDKDYFNELGKYDEGMDIWGGENLELS 386

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P     + + + A   ++   D   D +  
Sbjct: 387 FRIWMCGGELEIVPCSHVGHVFRKSTPYSFPGGTSRIVNHNNARLAEVW-LDEWKDFYFA 445

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID------------SAC 225
            N   +   D GD++ RK+LR  L C +F+WYLE     S M +D            S C
Sbjct: 446 IN-PAAKNVDKGDLSYRKQLRTKLKCNTFRWYLENIYPESHMPLDYYHLGEIKHADTSDC 504

Query: 226 KPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKG 280
             T   K    V +  CH  GGNQ +  +K  +I  D+ CLD +   G V L  CHG  G
Sbjct: 505 LDTFGRKSGENVAVSKCHGMGGNQVFAYTKRQQIMSDDNCLDASSPRGPVKLLRCHGMGG 564

Query: 281 NQYFEYD 287
           NQ + Y+
Sbjct: 565 NQLWIYN 571


>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           laevis]
 gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
          Length = 582

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/283 (38%), Positives = 142/283 (50%), Gaps = 46/283 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +  N + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WHA+PE+ER+R K+  +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 288 FQWHAVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSF 347

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E E   H     P   P                L       ++W     E
Sbjct: 348 RV-WQCGGTLEIEPCSHVGHVFPKKAPYARPNF----------LQNTARAAEVWMDGYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
           L +       K ++GD++ RK LR  L CKSF WYL+       +  D   W G    M 
Sbjct: 397 LFYNRNPPAQKENYGDISERKLLRERLQCKSFDWYLKKVFPELHIPEDRPGWHGAVRSMG 456

Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
           I S C   +   H P G    L+ CH QGGNQF+  +   EIR
Sbjct: 457 ISSECLDYNAPEHNPTGAHLSLFGCHGQGGNQFFEYTTKREIR 499


>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
 gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 3; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
 gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
 gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
          Length = 612

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/327 (36%), Positives = 161/327 (49%), Gaps = 81/327 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PL+  +A +   VV+P+I  I DDTFE       +T+S   + GGF+W+L 
Sbjct: 266 VEVTDGWLEPLVSRVAEDRKRVVAPIIDVISDDTFEY------VTASETTW-GGFNWHLN 318

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P+RE  +R  + + P+ TPT+AGGLF+IDK FF  +G+YD G  +WGGENLE+S
Sbjct: 319 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 378

Query: 123 FKFNWHAIPERE-----------RKR-------------HKNAAEP--VWTPTMAGGLFS 156
           F+  W      E           RK+             H NAA    VW          
Sbjct: 379 FRV-WMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEY------ 431

Query: 157 IDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----- 211
             KAFF K+        +    N+E       GDV+ RK+LR  L CKSFKWYLE     
Sbjct: 432 --KAFFYKM--------VPAARNVEA------GDVSERKKLRETLQCKSFKWYLENIYPE 475

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                       + N ++  C+D+  K  D   P G+  CH  GGNQ W ++  GEIR D
Sbjct: 476 APLPADFRSLGAIVNRFTEKCVDTNGK-KDGQAP-GIQACHGAGGNQAWSLTGKGEIRSD 533

Query: 260 EACLDYA-----GGDVILYPCHGSKGN 281
           + CL        G ++ L  C  SK N
Sbjct: 534 DLCLSSGHVYQIGSELKLERCSVSKIN 560


>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
 gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
          Length = 515

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 105/319 (32%), Positives = 152/319 (47%), Gaps = 53/319 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +  + S VV P I  I D+ F  ++ P  L        G F+W+L 
Sbjct: 165 CEANTGWLEPLLERIYNDRSTVVCPEIDVISDENFAYQYGPSGLMR------GIFNWDLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W A+   E+KR ++  +PV TPTMAGGLF+I++ +F+++GTYD   DIWGGENLE+SF
Sbjct: 219 FRWRAVSTEEQKRRQSPIDPVRTPTMAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISF 278

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
           +          +P          ++P   P          K   + LG       ++W  
Sbjct: 279 RIWQCGGTLEIVPCSHVGHVFRKSQPYGFP----------KGVVDTLGKNSQRVAEVWMD 328

Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------------------V 212
              E  ++         +GD++ R E+R+ L CKSFKWYLE                  V
Sbjct: 329 GYKEFFYQRQPHLRGHAYGDISKRLEIRKKLKCKSFKWYLENIYTDAVLPNESVIAKGKV 388

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GG 268
            N  S MC+DS  +P   +  +GL PC           +   E+   + CLD +    G 
Sbjct: 389 RNPASNMCLDSLSRPKLSY--IGLSPCTLSAMTMIISFTVRQELVVQDICLDVSDYNPGT 446

Query: 269 DVILYPCHGSKGNQYFEYD 287
            V LY CHG KGNQ + ++
Sbjct: 447 KVQLYECHGMKGNQLWMHE 465


>gi|358341053|dbj|GAA48824.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 424

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 112/309 (36%), Positives = 158/309 (51%), Gaps = 36/309 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A +S  VVSP I  I + TFE  F PG   +      G FDW L 
Sbjct: 67  CEATTGWLEPLLHQIALDSHRVVSPSIDVIQESTFE--FVPGAPNT-----WGYFDWRLS 119

Query: 64  FNWHAIPERERKR-HKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F+W    ERER R H +   P+ TPTMAGGLFSI KAFFE+LGTYD G  +WGGEN+E+S
Sbjct: 120 FHWGQATERERARTHGDPNIPLRTPTMAGGLFSISKAFFEELGTYDEGMVVWGGENVEMS 179

Query: 123 FKFNWHAIPE---RERKRHKNAAEPVWTPTMAGGLFSI-DKAFFEKLGTYDSGFDIWGGE 178
            +  W    E       R  +    V   +  GG+  +  +        +     ++  +
Sbjct: 180 LRV-WQCGGELLILPCSRVGHVFRKVSPYSWPGGVSHVLSRNAMRTALVWMDDHKLFYLK 238

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCI 221
           +   +   D+GD++ R+ LR+ L CKSF+WYL                 E+ ++ SG+C+
Sbjct: 239 SSPDAVHTDYGDISERQALRKRLRCKSFRWYLENVDVESVFPVDFHGIGEIRHESSGLCL 298

Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVILYPC-HG 277
           D+  +    H PVGL  CH QGGNQ ++ +  GEI+ +  C+   D     ++  PC   
Sbjct: 299 DTLGQ--KQHGPVGLSSCHGQGGNQLFVWTTKGEIQAEVGCVSPTDDGDTPLLFKPCLRL 356

Query: 278 SKGNQYFEY 286
             G Q F+Y
Sbjct: 357 DTGPQLFDY 365


>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Anolis carolinensis]
          Length = 583

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 109/285 (38%), Positives = 143/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N S ++ P+I  I  +TFE    PG      +  IGGFDW L 
Sbjct: 235 CECVPGWLEPLLQRVAENESVIICPVIDTIDWNTFEFYMQPG------EPMIGGFDWRLT 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER+R K+  +P+ +PTMAGGLF++ K +FE LGTYD G D+WGGENLELSF
Sbjct: 289 FQWHSVPDYERQRRKSKVDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSF 348

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W    I E     H     P   P                L       ++W  +  E
Sbjct: 349 RV-WQCGGILEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDDYKE 397

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K +FGD++ RK LR+ L C +F WYL+       V  D   W G    M 
Sbjct: 398 HFYNRNPPARKENFGDLSERKLLRKKLQCNNFDWYLKNIFPNLHVPEDRPGWHGAIRSMG 457

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C         PT  H  V L+ CH QGGNQF+  + + EIR
Sbjct: 458 ISSECLDYNSPEHNPTGAH--VSLFGCHGQGGNQFFEYTVNQEIR 500


>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
          Length = 539

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 109/299 (36%), Positives = 149/299 (49%), Gaps = 52/299 (17%)

Query: 25  VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHK-NAAEP 83
           VV P+I  I DDTFE         +      GGF+W L F W+ +P+RE  R K +   P
Sbjct: 207 VVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 259

Query: 84  VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA-----IPERERKRH 138
           V TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+SF+  W       I        
Sbjct: 260 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRI-WQCGGTLEIVTCSXXXX 318

Query: 139 KNAAEPVW---TP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF-------KGD 187
                 V+   TP T  GG   I      +L       ++W  E     +       K D
Sbjct: 319 XXXVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEFKNFFYIISPGVTKVD 372

Query: 188 FGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDM 230
           +GD++SR  LR  L C+ F WYL                 E+ N  +  C+D+  +  + 
Sbjct: 373 YGDISSRLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKEN- 431

Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
            + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V +  CH  KGNQ +EYD
Sbjct: 432 -EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 489


>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
           fascicularis]
          Length = 542

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 114/298 (38%), Positives = 153/298 (51%), Gaps = 48/298 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 200 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 253

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 254 FTWHTVPERERIRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 313

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W    + E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 314 RI-WQCGGVLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 362

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM----CIDS 223
           L +  +       FGDVT RK+LR  L CK FKW+LE       V  D  G     C D 
Sbjct: 363 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMYCFDY 422

Query: 224 ACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGGDVIL 272
              P D ++ VG    LY CH  G NQF+  +   EIR    + E C+   AG D+++
Sbjct: 423 --NPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIRYNTHQPEGCIAVEAGMDILI 478


>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
 gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
          Length = 513

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/320 (35%), Positives = 153/320 (47%), Gaps = 52/320 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W +PLL  +A +  +VV P I  I  DTF  +   G   +  +   GGF W+L 
Sbjct: 163 CEATPGWAEPLLARIAADRRNVVCPAIEVINADTFAYQ---GSTNADQR---GGFSWDLF 216

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  E+K   + ++P+ TPTMAGGLFSI + +F  +G+YD   DIWGGENLELSF
Sbjct: 217 FKWKGIPPEEQKLRNDDSDPIRTPTMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSF 276

Query: 124 KFNWHA-----IPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           +  W       I    R  H  +    P   P    G+       F +L       ++W 
Sbjct: 277 RV-WMCGGRLEIVTCSRVGHVFRKYTSPYKFP---DGVERTLTKNFNRLA------EVWM 326

Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------E 211
            E  +L +         D+GD++ R ELR+ L CKSFKWY+                  E
Sbjct: 327 DEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKCKSFKWYINNIYPDVQMPELDPPARGE 386

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAG 267
           V N  S  C+DS     + +  VG+Y CH QGGNQ         I  +E C D    + G
Sbjct: 387 VRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGNQVSKYMPRELIFEEENCFDVSKTHPG 446

Query: 268 GDVILYPCHGSKGNQYFEYD 287
             V L  CHG +GNQ +++D
Sbjct: 447 APVELMKCHGMRGNQEWKHD 466


>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
 gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
          Length = 536

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 150/311 (48%), Gaps = 38/311 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLLD + +N S VV P I  I D TF+ R        S     G F+W+++
Sbjct: 185 CEVTIGWLEPLLDRVHQNRSVVVCPEIDVIDDKTFQYR------AGSSGDIRGVFNWDMK 238

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W   P +E+KR  N       +PTMAGGLF+ID+ +F+++G YDS  DIWGGENLELS
Sbjct: 239 FRWRLTPSQEQKRRNNYNVLFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELS 298

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P            P   P  AG   +I+K        +  G+  +  
Sbjct: 299 FRIWQCGGQLEIMPCSHVGHVFRNVIPYKFPKDAG--LTINKNSVRTAEVWMDGYKEFVY 356

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
           +         FG++T R ELR+ L CKSFKWYL+                  V N  S M
Sbjct: 357 QRQPYMRNIHFGNITERLELRKKLQCKSFKWYLDHVFTDVILPNESAIAKGKVRNPESEM 416

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDVILYPC 275
           C+++  +P   H  +GL PC  +G      ++   E+  DE C D     +GG + L  C
Sbjct: 417 CLNTLGRPK--HAFLGLSPCAHEGKTMIISLTVLNELAMDEVCFDVSDHQSGGKITLLDC 474

Query: 276 HGSKGNQYFEY 286
           H   GNQ++ +
Sbjct: 475 HSMGGNQFWSH 485


>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Nasonia vitripennis]
          Length = 635

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 117/321 (36%), Positives = 154/321 (47%), Gaps = 53/321 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A +   VV P+I  I DDTFE         ++     GGF+W L 
Sbjct: 280 CECTEGWLEPLLARIAHDKKTVVCPIIDVISDDTFEY-------ITASDMTWGGFNWKLN 332

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R+ +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 333 FRWYRVAQREMDRRNGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 392

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIW---- 175
           F+  W    I E     H        +P T  GG+  I           +   D W    
Sbjct: 393 FRV-WQCGGILEISPCSHVGHVFRDKSPYTFPGGVSKI--VLHNAARVAEVWMDEWRDFY 449

Query: 176 -----GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID----SACK 226
                G  N+ +      GDV+ R +LR  L CKSF+WYLE     S M +D       K
Sbjct: 450 YAMNPGARNVPV------GDVSERVKLREQLKCKSFRWYLENIYPESPMPLDYYYLGDIK 503

Query: 227 PTDMHKP------------------VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG- 267
             D + P                  VG+  CH  GGNQ +  +K  +I  D+ CLD A  
Sbjct: 504 NADPNNPEKVQNYCLDTMGRRTGENVGMSYCHGLGGNQIFAYTKRQQIMSDDMCLDAASP 563

Query: 268 -GDVILYPCHGSKGNQYFEYD 287
            G V +  CHG  GNQ + Y+
Sbjct: 564 QGPVKIVRCHGMGGNQAWIYN 584


>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           (Silurana) tropicalis]
 gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
 gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
          Length = 582

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 110/284 (38%), Positives = 147/284 (51%), Gaps = 48/284 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WHA+PE+ER+R K+  +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 288 FQWHAVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK---LGTYDSGFDIWGGENL 180
           +  W      E        EP    +  G +F   KA + +   L       ++W     
Sbjct: 348 RV-WQCGGTLE-------IEPC---SHVGHVFP-KKAPYARPNFLQNTARAAEVWMDGYK 395

Query: 181 ELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----M 219
           EL +       K ++GD++ RK LR  L CKSF WYL+       +  D   W G    M
Sbjct: 396 ELFYNRNPPARKENYGDISERKLLRERLQCKSFDWYLKNVFPDLHIPEDRPGWHGAVRSM 455

Query: 220 CIDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
            I + C   +   H P G    L+ CH QGGNQF+  +   EIR
Sbjct: 456 GISNECLDYNAPDHNPTGAHLSLFGCHGQGGNQFFEYTTMREIR 499


>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
            [Saccoglossus kowalevskii]
          Length = 1082

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 154/319 (48%), Gaps = 57/319 (17%)

Query: 4    CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            CEV   WL+PL++ + R+SS +  P+I  I  D+F     P           GG +W LQ
Sbjct: 739  CEVNYNWLEPLIERIYRDSSTIACPVIDIIDPDSFAYSASP--------LVRGGVNWGLQ 790

Query: 64   FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            F W  +P  E  R  +  EP+ +P MAGGLF++D+ +FE +G+YD    IWGGE+LELSF
Sbjct: 791  FKWKNVPPVELLRRNSEIEPIKSPIMAGGLFAVDRNYFEHIGSYDKDMQIWGGEHLELSF 850

Query: 124  KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
            +          +P          + P    T+ GG+        E + T++S    ++W 
Sbjct: 851  RIWQCGGTLEIVPCSRVGHIFRKSHPY---TIPGGM--------ENVFTHNSIRVAEVWM 899

Query: 177  GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------E 211
             +     +          +GD++ R +L+  L CK FKWYL                  E
Sbjct: 900  DDYKRFFYATRPDAQGKTYGDLSERLKLKSRLKCKDFKWYLDNVYPELSVPNENAYAWGE 959

Query: 212  VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV- 270
              N  S +C+D+  +  +  +PVGLY CH  GGNQ +  +K GE+R +E CLD +   V 
Sbjct: 960  CQNAASNVCLDTLMR--EAGQPVGLYICHGGGGNQVFSYTKLGEVRHEELCLDVSTKKVG 1017

Query: 271  ---ILYPCHGSKGNQYFEY 286
               +   CH   GNQ +E+
Sbjct: 1018 ETPVFEQCHALGGNQMWEH 1036


>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [Mustela putorius
           furo]
          Length = 452

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 110/303 (36%), Positives = 153/303 (50%), Gaps = 51/303 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++ + + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 104 CECNSGWLEPLLERISYDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 157

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 158 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 217

Query: 124 KFNWHA--IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 218 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANCVRAAEVWMDEFKE 266

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT RK+LR  L CK F+W+LE       V  D   + GM  +  
Sbjct: 267 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 326

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDYAGGD 269
            K       P + ++ +G    LY CH  G NQF+  +   EIR    + EAC+    G 
Sbjct: 327 LKDYCFDYNPPNENQIMGHQVLLYLCHGMGQNQFFEYTSQKEIRYNTHQPEACIAVEAGT 386

Query: 270 VIL 272
            IL
Sbjct: 387 DIL 389


>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
          Length = 604

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 112/321 (34%), Positives = 160/321 (49%), Gaps = 59/321 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +  + + VV+P+I NI   TF     P  +T       G F W+L 
Sbjct: 249 CECNVGWLEPLLERIYLDRTTVVTPVIDNIDKKTFAYTGSPTVITR------GIFTWSLT 302

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +P  E+K+ K+   P+ +PTMAGGLFS+D+ +F ++G+YD G D+WGGENLE+SF
Sbjct: 303 FSWLDLPWFEQKKRKDPIAPLPSPTMAGGLFSMDREYFFEIGSYDMGMDVWGGENLEISF 362

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP            P   P  +G + +I+K         +   ++W  E
Sbjct: 363 RIWQCGGTLEFIPCSRVGHVYRDFHPYKFP--SGAVQTINKNL-------NRVAEVWMDE 413

Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------------------V 212
             EL +           GD++ R ELR+ L CK FKWYL+                   V
Sbjct: 414 YKELYYGVRPHHRAIGTGDISDRLELRKKLNCKPFKWYLDNVFPDMMVPLPENLLGKGAV 473

Query: 213 SNDWSGMCIDS-ACKPTDMHKPVGLYPCH--KQGGNQFWMMSKHGEIRRD----EACLDY 265
            N  + MC+DS + +  DM    GLYPC   K     F+  +K+GEIRR+      CLD+
Sbjct: 474 KNAATNMCLDSLSSREVDMK--AGLYPCANGKSENQMFYFTTKYGEIRREGTFGARCLDF 531

Query: 266 AGG----DVILYPCHGSKGNQ 282
           AGG     + +Y CH  KGNQ
Sbjct: 532 AGGKPGSTLSMYGCHLMKGNQ 552


>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
 gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
          Length = 677

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 285 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 337

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 338 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 397

Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
           F+                       + +  P    K   + A  V    M GG+  I   
Sbjct: 398 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGILEIAPC 457

Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
                 F K   Y       +I    N  L                  + K   GDV+ R
Sbjct: 458 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 517

Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
           K LR  L CKSF+WYL                 E+ N  +  C+D+  +    ++ VG+ 
Sbjct: 518 KNLRERLKCKSFRWYLENVYPESLMPLDYYYLGEIRNSETETCLDTMGR--KYNEKVGIS 575

Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
            CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH   GNQ + YD
Sbjct: 576 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 627


>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           partial [Taeniopygia guttata]
          Length = 402

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 155/322 (48%), Gaps = 56/322 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 51  CECTLGWLEPLLSRIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 103

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE +R K +   PV TPTMAGGLFSID+++FE++GTYD+G DIWGGENLE+S
Sbjct: 104 FRWYPVPQREMERRKGDRTLPVRTPTMAGGLFSIDRSYFEEIGTYDAGMDIWGGENLEMS 163

Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSID------------KAFFEK 164
           F+  W      E     +       A P   P   G + + +            K FF  
Sbjct: 164 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYI 222

Query: 165 LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
           +      F +W    L       +G V     L+  + C+ F WYL              
Sbjct: 223 ISPGAPRF-VWDKRIL-------YGIVPWCGTLKIRMKCQPFSWYLENVYPDSQIPRRYY 274

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
              E+ N  +  C+D+  +  +  + VG + CH  GGNQ +  +   EIR D+ CLD + 
Sbjct: 275 SLGEIRNVETNQCLDNMGRKEN--EKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 332

Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
             G V++  CH  +GNQ +EYD
Sbjct: 333 LNGPVLMLKCHHLRGNQLWEYD 354


>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
 gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
          Length = 541

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 161/352 (45%), Gaps = 80/352 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 152 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 204

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 205 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 264

Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
           F+                       + +  P    K   + A  VW   M GG+  I   
Sbjct: 265 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVW---MCGGVLEIAPC 321

Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
                 F K   Y       +I    N  L                  + K   GDV+ R
Sbjct: 322 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 381

Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
           K LR  L CKSF+WYL                 E+ N  +  C+D+  +    ++ VG+ 
Sbjct: 382 KALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 439

Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
            CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH   GNQ + YD
Sbjct: 440 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 491


>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Meleagris gallopavo]
          Length = 528

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/305 (37%), Positives = 149/305 (48%), Gaps = 55/305 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A   S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 176 CECHEGWLEPLLERIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 229

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH  PERE+KR K+  + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 230 FTWHTTPEREQKRRKSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 289

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 290 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 338

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
           L +  +       +GDVT R+ LR  L CK FKW+LE       V  D  G         
Sbjct: 339 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 398

Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAG 267
               C D    P + H+  G    LYPCH  G NQF+  + H EI    R+ EAC     
Sbjct: 399 MANFCFDY--NPPNEHEITGHRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEACAAVIA 456

Query: 268 GDVIL 272
           G   L
Sbjct: 457 GTEYL 461


>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
           gallus]
          Length = 590

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/325 (36%), Positives = 157/325 (48%), Gaps = 61/325 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A   S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 238 CECHEGWLEPLLERIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 291

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH  PERE+KR K+  + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 292 FTWHTTPEREQKRRKSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 351

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 352 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 400

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
           L +  +       +GDV+ R+ LR  L CK FKW+LE       V  D  G         
Sbjct: 401 LYYHRNPHARLEPYGDVSERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 460

Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEAC----- 262
               C D    P++ H+  G    LYPCH  G NQF+  + H EI    R+ EAC     
Sbjct: 461 MANFCFDY--NPSNEHEITGHRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEACAAVIA 518

Query: 263 -LDYAGGDVILYPCHGSKGNQYFEY 286
             DY   ++     H    NQ F +
Sbjct: 519 GTDYLTMNLCQENIHRVPENQKFAF 543


>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 621

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/347 (33%), Positives = 161/347 (46%), Gaps = 93/347 (26%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLLD +A N S VV P+I  I D +F        + ++    IGGFDWN+ 
Sbjct: 255 CECSKGWLEPLLDRIAANRSTVVCPVINQIDDRSFAF------VNATEVSHIGGFDWNII 308

Query: 64  FNWHAIPERERKR-HKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           FNW+ IP+ E+ R   + +EPV +PTMAGGLFSIDK++FE+LG+YD  F+ WGGEN+ELS
Sbjct: 309 FNWYNIPQSEKDRIGGDKSEPVRSPTMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELS 368

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTY---DSGFDI 174
            K                    +W   M GG+            F K   +   ++ +++
Sbjct: 369 LK--------------------IW---MCGGILEFVPCSHVGHVFRKHNPHKYKNTTYNV 405

Query: 175 WGGENLEL------------------SFKGDFGDVTSRKELRRNLGCKSFKWYL------ 210
            G  N  L                  + K D GD++ R +LR+NL CKSF+W+L      
Sbjct: 406 VGRNNRRLAEVWLDEYKYLFYANQPETMKIDPGDISQRVQLRKNLQCKSFRWFLQNIYPD 465

Query: 211 -----------EVSNDWSGMCID---------------SACKPTDMHKPVGLYPCHKQGG 244
                      ++ N  SG C+D                A   T     V L+PCH  G 
Sbjct: 466 SHYNFAFVGVGQLKNVASGACLDFGKAAGHGGKEFKGKDATNVTS--NTVELWPCH-DGK 522

Query: 245 NQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYDYK 289
            Q ++ +   E R    CLDY        LY CHG   NQ + +D K
Sbjct: 523 IQLFIRTDKKEFRYIHMCLDYNVQFSFPFLYECHGQGANQQWIHDLK 569


>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
 gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
          Length = 628

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 113/321 (35%), Positives = 150/321 (46%), Gaps = 60/321 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 288 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 340

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 341 FKWEYLSPAERAARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 400

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +              
Sbjct: 401 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFAKNTR---------RAA 446

Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------- 216
           ++W  E        + L+    FG++  R  L+  L CK FKWYLE V  D         
Sbjct: 447 EVWMDEYKQHYYNAVPLAKNIPFGNIDDRLALKEKLQCKPFKWYLEHVYPDLQTPDPQDV 506

Query: 217 ------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA---- 266
                 +  C+D+     D    VGL+PCH  GGNQ W  SK GEI+ DE CL       
Sbjct: 507 GQFRQDATECLDTMGHIVD--GTVGLFPCHNTGGNQEWTFSKRGEIKHDELCLTLVQFAR 564

Query: 267 GGDVILYPCHGSKGNQYFEYD 287
           G  VIL PC  S+  ++   D
Sbjct: 565 GSQVILKPCDESENQRWVMKD 585


>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
 gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
          Length = 675

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
           F+                       + +  P    K   + A  V    M GG+  I   
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPC 455

Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
                 F K   Y       +I    N  L                  + K   GDV+ R
Sbjct: 456 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 515

Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
           K LR  L CKSF+WYL                 E+ N  +  C+D+  +    ++ VG+ 
Sbjct: 516 KALRERLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 573

Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
            CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH   GNQ + YD
Sbjct: 574 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 625


>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
 gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
          Length = 415

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/275 (38%), Positives = 136/275 (49%), Gaps = 46/275 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PL   +A NSS+VV P+I  I D TF     P        F  G F W L+
Sbjct: 153 CECSKGWLEPLAAKIAENSSNVVMPVIDEISDTTFYYHAVPE------PFHRGVFRWRLE 206

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E +R K+ A+ + TP MAGGLFSIDK +FEK+GTYD+G DIWGGENLE+SF
Sbjct: 207 FGWKPVPQYEMERRKDEADGIRTPVMAGGLFSIDKNYFEKIGTYDTGMDIWGGENLEISF 266

Query: 124 KFNWH---AIPERERKRHKNAAEPVWT---PTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           +  W    AI      R  +   P +    P   G    +      ++       D+W  
Sbjct: 267 RI-WMCGGAIEMLPCSRVGHVFRPRFPYSFPARPGHNTDVVSNNLMRVA------DVWMD 319

Query: 178 E------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
           E      N+    K     DV+ R  LR  L CK+FKWYL                  +V
Sbjct: 320 EYKKHFYNIRFDLKRKQHDDVSQRLALREKLKCKNFKWYLDNVYPELEVPDTNFAASGQV 379

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQF 247
            N  S MC+D+  K  D   P+GLY CH QGGNQ 
Sbjct: 380 RNPSSDMCLDTLGKKDDT--PLGLYQCHGQGGNQV 412


>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
          Length = 496

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/304 (37%), Positives = 153/304 (50%), Gaps = 52/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE    PG         IGGFDW L 
Sbjct: 147 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 200

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 201 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 260

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 261 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 309

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT RK+LR  L CK F+W+LE       V  D   + GM  +  
Sbjct: 310 LYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 369

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
            K       P + ++ VG    LY CH  G NQF+  +   EIR    + EAC+   AG 
Sbjct: 370 LKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIRYNTHQPEACIAVEAGK 429

Query: 269 DVIL 272
           DV++
Sbjct: 430 DVLI 433


>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Ailuropoda melanoleuca]
          Length = 540

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/304 (37%), Positives = 153/304 (50%), Gaps = 52/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE    PG         IGGFDW L 
Sbjct: 191 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 244

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 245 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 304

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 305 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 353

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT RK+LR  L CK F+W+LE       V  D   + GM  +  
Sbjct: 354 LYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 413

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
            K       P + ++ VG    LY CH  G NQF+  +   EIR    + EAC+   AG 
Sbjct: 414 LKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIRYNTHQPEACIAVEAGK 473

Query: 269 DVIL 272
           DV++
Sbjct: 474 DVLI 477


>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
 gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
          Length = 675

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 160/352 (45%), Gaps = 77/352 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  + +N   VV P+I  I D+TFE       +T+S   + GGF+W L 
Sbjct: 283 CECTEGWLEPLLARIVQNRRTVVCPIIDVISDETFEY------ITASDSTW-GGFNWKLN 335

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  R  N    P+ TPTMAGGLFSIDK +F ++G+YD G DIWGGENLE+S
Sbjct: 336 FRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395

Query: 123 FKF----------------------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA 160
           F+                       + +  P    K   + A  V    M GG+  I   
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPC 455

Query: 161 -----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGDVTSR 194
                 F K   Y       +I    N  L                  + K   GDV+ R
Sbjct: 456 SRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKASAGDVSDR 515

Query: 195 KELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKPVGLY 237
           K LR  L CKSF+WYL                 E+ N  +  C+D+  +    ++ VG+ 
Sbjct: 516 KALRDRLKCKSFRWYLENVYPESLMPLDYYYLGEIRNAETETCLDTMGR--KYNEKVGIS 573

Query: 238 PCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILYPCHGSKGNQYFEYD 287
            CH  GGNQ +  +K  +I  D+ CLD +   G V +  CH   GNQ + YD
Sbjct: 574 YCHGLGGNQVFAYTKRQQIMSDDLCLDASSSNGPVNMVRCHNMGGNQEWVYD 625


>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
 gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
          Length = 584

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 100/304 (32%), Positives = 151/304 (49%), Gaps = 52/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + WL+PLLD ++ +   VV+P+I  I D+ FE         ++     GGF+W+L 
Sbjct: 242 VEVTEGWLEPLLDRVSVDRKRVVAPIIDVISDENFEY-------ITASDITWGGFNWHLN 294

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE  +R+ + + P+ TPT+AGGLF+ID+ FF  +G+YD G ++WGGENLE+S
Sbjct: 295 FRWYPVPMREMERRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEIS 354

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIW 175
           F+  W              +  +   +  G +F     +    GT +          ++W
Sbjct: 355 FRV-WMC----------GGSLEIHPCSRVGHVFRKHTPYSFPGGTANVIHRNAARTAEVW 403

Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
             E  ++ +K        D GD+T RK LR NL CKSF+WYLE                 
Sbjct: 404 MDEYKDIFYKMVPAAKNVDIGDLTERKVLRENLQCKSFRWYLETIYPESPIPIDFLSLGQ 463

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
           + N     C+D+A +         + PCH +GGNQ W  +  GEIR DE CL +    + 
Sbjct: 464 IQNMGVVGCLDTAGRSAG--DSPAILPCHGKGGNQLWAYTGKGEIRADELCLAFTVKGIS 521

Query: 272 LYPC 275
           +  C
Sbjct: 522 MEKC 525


>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
          Length = 594

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/356 (34%), Positives = 165/356 (46%), Gaps = 85/356 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  +   VV P+I  I D+TFE       +T+S + + GGF+W L 
Sbjct: 246 CECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEY------VTASDQTW-GGFNWKLN 298

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P RE ++R+ +   P+ TPTMAGGLFSID+ +F ++G+YD G DIWGGENLE+S
Sbjct: 299 FRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMS 358

Query: 123 FKFNWH--AIPERERKRH----------------------KNAAE--PVWTPTMAGGLFS 156
           F+  W    I E     H                      KNAA    VW   M GG   
Sbjct: 359 FRI-WQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVW---MCGGTLE 414

Query: 157 IDKA-----FFEKLGTYD---SGFDIWGGENLEL------------------SFKGDFGD 190
           I         F K   Y        I    N  L                  + K   GD
Sbjct: 415 IAPCSRVGHVFRKSTPYSFPGGTSQIVNKNNARLAEVWLDGWSEFYYNINPGARKASAGD 474

Query: 191 VTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHKP 233
           V+ R+ELR  L CKSF+WYL                 E+ N  S  C+D+  +  +  + 
Sbjct: 475 VSERRELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVESQNCLDTMGRKAN--EK 532

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVILYPCHGSKGNQYFEYD 287
           +G   CH  GGNQ +  +K  +I  D+ CLD   A G V L  CHG  GNQ + YD
Sbjct: 533 IGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDASNALGPVNLVRCHGMAGNQEWIYD 588


>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
          Length = 564

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 112/311 (36%), Positives = 154/311 (49%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 222 CECNVNWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 274

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER+ R ++  + + TP +AGGLF I+KA+FEKLG YD   D+WGGENLE+S
Sbjct: 275 FKWEYLGYAERESRQRDPTQAIRTPMIAGGLFVINKAYFEKLGKYDMKMDVWGGENLEIS 334

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 335 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKH 389

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------VSNDWS 217
             +    + L+    FGD++ R ELRRNL CK FKWYL+               V     
Sbjct: 390 FYYAA--VPLAKNIPFGDISERLELRRNLQCKPFKWYLQHVYPELAIPQATSAHVGELRQ 447

Query: 218 GM-CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGG-DVIL 272
           GM C+D+     D    V LY CH  GGNQ W ++  G I+  + CL   DY  G  V++
Sbjct: 448 GMYCLDTMGHLID--GTVALYQCHHTGGNQEWGLTSGGLIKHHDLCLTLDDYMKGVQVVM 505

Query: 273 YPCHGSKGNQY 283
             C GS   ++
Sbjct: 506 RICDGSDSQKW 516


>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
          Length = 522

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 106/285 (37%), Positives = 142/285 (49%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A N + +V P+I  I   TFE          + +  IGGFDW L 
Sbjct: 174 CECVSGWLEPLLERIAENETVIVCPVIDTIDWKTFEYYM------QTAEPMIGGFDWRLT 227

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +FE LGTYD+G D+WGGENLELSF
Sbjct: 228 FQWHSVPKHERLRRKSETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSF 287

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W    + E     H     P   P                L       ++W  E  E
Sbjct: 288 RV-WQCGGMLEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDEYKE 336

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGM----- 219
             +       K ++GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 337 HFYNRNPSARKENYGDLSERKILRERLKCKSFNWYLKNIFAELHVPEDRPGWHGAIRSAG 396

Query: 220 ----CIDSAC---KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
               C+D A     PT  H  + L+ CH QGGNQF+  + + EIR
Sbjct: 397 IASECLDYALPENHPTGAH--LSLFGCHGQGGNQFFEYTSNKEIR 439


>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Meleagris gallopavo]
          Length = 579

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 106/285 (37%), Positives = 140/285 (49%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A N + V+ P+I  I  +TFE          S +  IGGFDW L 
Sbjct: 231 CECVSGWLEPLLERIAENETVVICPVIDTIDWNTFEYYM------QSAEPMIGGFDWRLT 284

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +FE LGTYD+G D+WGGENLELSF
Sbjct: 285 FQWHSVPKHERLRRKSETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSF 344

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W    + E     H     P   P                L       ++W  E  E
Sbjct: 345 RV-WQCGGMLEIHPCSHVGHVFPKRAPYARPNF----------LQNTARAAEVWMDEYKE 393

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K ++GD++ RK LR  L CKSF WYL        V  D   W G      
Sbjct: 394 HFYNRNPPARKENYGDISERKLLRERLKCKSFNWYLRNVFSELHVPEDRPGWHGAVRSVG 453

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C         PT  H  + L+ CH QGGNQF+  + + E R
Sbjct: 454 ISSECLDYVLPEHNPTGAH--LSLFGCHGQGGNQFFEYTSNKEFR 496


>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Oryctolagus cuniculus]
          Length = 476

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 157/318 (49%), Gaps = 54/318 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE    PG         IGGFDW L 
Sbjct: 127 CECHEGWLEPLLHRIHEKESAVVCPVIDVIDWNTFEYLGNPGEPQ------IGGFDWRLV 180

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 181 FTWHVVPERERLRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 240

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 241 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 289

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT R++LR  L CK FKW+LE       V  D   + GM  +  
Sbjct: 290 LYYHRNPRARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNKG 349

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
            K       P D ++  G    LY CH  G NQF+  +   EIR    + E C+   A  
Sbjct: 350 LKNFCFDYNPPDENQITGHQVILYTCHGMGQNQFFEYTSQMEIRYNTHQPEGCVAVEADK 409

Query: 269 DV-ILYPCHGSK-GNQYF 284
           DV +++PC  +   NQ F
Sbjct: 410 DVLVMHPCQDTTPENQKF 427


>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Takifugu rubripes]
          Length = 557

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 106/314 (33%), Positives = 151/314 (48%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 210 CECTTGWLEPLLARIKKDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 262

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV     AGG     + +F+++GTYD+G DIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEIS 322

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 323 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 375

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD+ +R  LR+ L CK F WYL                 E+ N 
Sbjct: 376 KNFFYIISPGVTKVDYGDIATRTALRQKLQCKPFSWYLESIYPDSQIPRHYYSLGEIRNV 435

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD +   G V++ 
Sbjct: 436 ETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMML 493

Query: 274 PCHGSKGNQYFEYD 287
            CH  KGNQ ++YD
Sbjct: 494 KCHHLKGNQLWDYD 507


>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           4 [Ciona intestinalis]
          Length = 582

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 103/299 (34%), Positives = 146/299 (48%), Gaps = 47/299 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +  + S +V P+I  I  +TFE  +        ++  IGGFDW L 
Sbjct: 236 CECVEGWLEPLLERIMEDESVIVVPVIDTIDWNTFEYYY------GGHEPQIGGFDWRLT 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH IP+ ERKR K+  +P+ +PTMAGGLF++ K +F ++GTYD+G +IWGGENLELSF
Sbjct: 290 FQWHTIPDHERKRRKSPVDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSF 349

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP            P   P          + + +    Y   F I    
Sbjct: 350 RTWMCGGKLETIPCSHVGHVFPKQSPYPRPKFLTNTLRAAEVWMDD---YKRHFYIRNPP 406

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND------------WSGM 219
               + K ++GD+++RK+LR +L C  FKWYL+       V  D             S  
Sbjct: 407 ----ASKENYGDISARKDLRNSLQCHDFKWYLDNVYPDLHVPEDRPGYYGAFRNSGMSSF 462

Query: 220 CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVI 271
           C+D A      H P G    ++ CH QGGNQF+  +   E+R     E C+     D I
Sbjct: 463 CLDYA---PPQHNPTGGRVSIFGCHGQGGNQFFEYTSKREVRFNSEKEMCMSAVEDDTI 518


>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
           ricinus]
          Length = 582

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 114/314 (36%), Positives = 155/314 (49%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I D+TFE         S+     GGF+W L 
Sbjct: 235 CECTQNWLEPLLARIAEDRTRVVCPVIDVISDETFEY-------ISASDLTWGGFNWKLN 287

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F  + +P+RE  +R  +   PV TPTMAGGLF+IDK +F +LG YD G DIWGGENLELS
Sbjct: 288 FRGYRVPQRELDRRGGDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELS 347

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W    E E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 348 FRI-WMCGGELEIVPCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLA------EVWLDEW 400

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
            E  F         D GD++ R+ LR+ L C SF+WYL                 E+ + 
Sbjct: 401 KEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENIYPESHMPLDYYHLGEIKHA 460

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG--GDVILY 273
            S +C+D+  + +  +  V +  CH    NQ +  +K  +I  D+ CLD +   G V L 
Sbjct: 461 DSPVCLDTFGRKSGEN--VAVSTCHGXXXNQVFAYTKRQQIMSDDNCLDASSPRGPVKLL 518

Query: 274 PCHGSKGNQYFEYD 287
            CHG  GNQ + YD
Sbjct: 519 RCHGMGGNQLWIYD 532


>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Ornithorhynchus anatinus]
          Length = 467

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 113/309 (36%), Positives = 158/309 (51%), Gaps = 53/309 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +    S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 118 CECHEGWLEPLLERIREEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 171

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH IPERE+KR ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 172 FTWHPIPEREQKRRRSKVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 231

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W     E
Sbjct: 232 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDGYKE 280

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
           L +  +       +GDVT+R++LR  L C+ FKW+LE       V  D   + GM     
Sbjct: 281 LYYHRNPHARLEPYGDVTARRDLRSKLKCRDFKWFLENVYPELHVPEDRPGYFGMLKNKG 340

Query: 221 IDSAC---KPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRD----EAC--LDYAG 267
           +++ C    P D ++  G    LYPCH  G NQF+  + H EIR +    EAC  +D   
Sbjct: 341 MENHCFDYNPPDENEVTGQRLILYPCHGMGQNQFFEYTSHHEIRYNTRHPEACAAVDVGT 400

Query: 268 GDVILYPCH 276
             V +Y C 
Sbjct: 401 DYVTMYLCQ 409


>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Oryzias
           latipes]
          Length = 600

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 158/320 (49%), Gaps = 43/320 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A+N + VVSP I+ I  +TFE   P     +  +   G FDW L 
Sbjct: 250 CECFNGWLEPLLARIAQNYTAVVSPDISTIDLNTFEFMKPSPYGQNHNR---GNFDWGLS 306

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +F ++G+YD   +IWGGEN+E+SF
Sbjct: 307 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFYQIGSYDEEMEIWGGENIEMSF 366

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 367 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 420

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +++ +  FGD++ RK+LR  L CK+F WYL+                  V 
Sbjct: 421 FYRRNQQAAQIAKEETFGDISKRKDLRERLQCKNFSWYLKNIYPEIFMPDLNPLLFGSVK 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N     C+D A +  +  K + +YPCH  GGNQ++  S H E+R +   E CL  AGG V
Sbjct: 481 NVGKASCLD-AGENNEGGKELIMYPCHGLGGNQYFEYSTHREVRHNIQKELCLHGAGGVV 539

Query: 271 ILYPCHGSKGNQYFEYDYKY 290
            L  C     N +   + K+
Sbjct: 540 KLEECQYKGRNTFVGAEQKW 559


>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
          Length = 466

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 114/303 (37%), Positives = 149/303 (49%), Gaps = 52/303 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 116 CECHEGWLEPLLQRIQEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 169

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 170 FTWHVVPERERMRMRSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 229

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 230 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRKKA----LANSVRAAEVWMDEFKE 278

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT RK+LR  L CK FKW+LE       V  D   + GM  +  
Sbjct: 279 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNKG 338

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
            K       P   H   G    LY CH  G NQF+  +   EIR    + EAC+   AG 
Sbjct: 339 LKDYCFDYNPPSEHDLTGHQVLLYLCHGMGQNQFFEHTSQNEIRYNTHQPEACIAVEAGA 398

Query: 269 DVI 271
           D +
Sbjct: 399 DTL 401


>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
          Length = 669

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 114/319 (35%), Positives = 150/319 (47%), Gaps = 61/319 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 323 CECNVHWLEPLLARVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 375

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ERK R ++   P+ TP +AGGLF ID+++FEKLGTYD+  DIWGGENLE+S
Sbjct: 376 FKWEYLSGAERKERQRDPTAPIRTPMIAGGLFVIDRSYFEKLGTYDTQMDIWGGENLEIS 435

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P   GG  +I    F K        
Sbjct: 436 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFP--GGGSGNI----FAK--NTRRAA 482

Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E        + L+    FGD+  R  LR  L CK F+WYLE              
Sbjct: 483 EVWMDEYKRYYYAAVPLATNIPFGDIEDRLRLREELQCKPFRWYLENVYPQLSVPERRNN 542

Query: 213 -SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---- 267
            S      C+DS          VGLY CH  GGNQ W++++ GE++  + CL        
Sbjct: 543 GSIRQGAFCLDSLGNVAGA--IVGLYSCHGNGGNQNWILNRKGEVKHHDLCLTLIKFSVN 600

Query: 268 ---GDVILYPCHGSKGNQY 283
                VI+  C GS+  Q+
Sbjct: 601 ARYNSVIMKYCDGSENQQW 619


>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Oreochromis niloticus]
          Length = 624

 Score =  160 bits (404), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 107/306 (34%), Positives = 150/306 (49%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP I  I  +TFE   P     +  +   G FDW+L 
Sbjct: 274 CECFNGWLEPLLARIAENYTAVVSPDITTIDLNTFEFMKPSPYGQNHNR---GNFDWSLS 330

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +F ++G+YD   +IWGGEN+E+SF
Sbjct: 331 FGWESLPDHEKRRRKDETYPIKTPTFAGGLFSISKEYFYRIGSYDEEMEIWGGENIEMSF 390

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 391 RVWQCGGQLEIIPCSIVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 444

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +++  G FGD++ R ELR  L CKSF WYL+                  V 
Sbjct: 445 FYRRNQQAAQIAKDGAFGDISKRVELREKLQCKSFSWYLQNVYPEVFMPDLNPLRFGSVK 504

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N     C+D A +  +  K + +YPCH  GGNQ++  S H EIR +   E CL  A G V
Sbjct: 505 NVGKDSCLD-AGENNEGGKQLIMYPCHGLGGNQYFEYSTHHEIRHNIQKELCLHGAEGAV 563

Query: 271 ILYPCH 276
            L  C 
Sbjct: 564 KLEDCQ 569


>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2376

 Score =  160 bits (404), Expect = 9e-37,   Method: Composition-based stats.
 Identities = 102/294 (34%), Positives = 138/294 (46%), Gaps = 61/294 (20%)

Query: 5    EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
            EV + W +PLL  +  +  HVV+P+I  I D  F     P           GGFDW L F
Sbjct: 1124 EVNRDWAEPLLQRINEDPLHVVTPIIDVISDSNFRYSASP--------VVRGGFDWGLTF 1175

Query: 65   NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
             W ++P  ++     A  P+ +PTMAGGLF++ +  F +LGTYD G DIWG ENLE+SF+
Sbjct: 1176 KWKSVPRSQQSSDPTA--PIASPTMAGGLFAMKRTTFYELGTYDLGMDIWGAENLEMSFR 1233

Query: 125  FNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
              W      E           RK H     P   P    G   +  +   +L       +
Sbjct: 1234 I-WQCGARLEIMPCSRVGHVFRKHH-----PYSFPGGGSGHVFLRNSL--RLA------E 1279

Query: 174  IWGGENLEL--SFKG------DFGDVTSRKELRRNLGCKSFKWYL--------------- 210
            +W  E  E   S KG      D GD++ R++LR +L CK FKWYL               
Sbjct: 1280 VWMDEYAEFFKSRKGSAARKIDIGDISERQKLREDLHCKPFKWYLDNVYPELRVPDPNPV 1339

Query: 211  -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
             E      G C+DSA K   +   V LY CH  GGNQ W +S +GE+  ++AC+
Sbjct: 1340 GEGQVQSGGFCLDSAGK--SVGHAVALYRCHGLGGNQLWTLSHNGELAHEDACV 1391


>gi|410978730|ref|XP_003995741.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
           partial [Felis catus]
          Length = 469

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 112/306 (36%), Positives = 154/306 (50%), Gaps = 56/306 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +    S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 120 CECHEGWLEPLLERIHEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 173

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 174 FTWHVVPERERTRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 233

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  E  E
Sbjct: 234 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDEFKE 282

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM-------- 219
           L +  +       FGDVT RK+LR  L CK F+W+LE       V  D  G         
Sbjct: 283 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 342

Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-A 266
               C D    P + ++ VG    LY CH  G NQF+  +   EIR    + EAC+   A
Sbjct: 343 LRDYCFDY--NPPNENQIVGHQVLLYHCHGMGQNQFFEYTSRNEIRYNTHQPEACIAVDA 400

Query: 267 GGDVIL 272
           G D+++
Sbjct: 401 GMDILI 406


>gi|405966385|gb|EKC31678.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 1019

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 82/175 (46%), Positives = 107/175 (61%), Gaps = 35/175 (20%)

Query: 147 TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELR-------- 198
           +PTMAGGLFSI + +F +LGTY  G DIWGGENLELSF+    +V  +  +R        
Sbjct: 794 SPTMAGGLFSISREYFTELGTYHLGMDIWGGENLELSFRRTGVNVVKKNSIRLAKVWMDE 853

Query: 199 -RNL--------GCKSFKWYL-----------------EVSNDWSGMCIDSACKPTDMHK 232
            +N          C +F W++                 E+ +    MCIDSA    + HK
Sbjct: 854 YKNYYYERFNYDLCHNFDWFVKNVYPDLFVPGEAIASGEILSKAKPMCIDSAVDNRNYHK 913

Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-VILYPCHGSKGNQYFEY 286
           PV ++PCH QGGNQFWM+SK+GEIRRD+ CLDY+GG+ VI+YPCHG KGNQ ++Y
Sbjct: 914 PVNMWPCHNQGGNQFWMLSKNGEIRRDDGCLDYSGGESVIVYPCHGQKGNQEWQY 968


>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
 gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
          Length = 504

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 112/323 (34%), Positives = 150/323 (46%), Gaps = 61/323 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 160 CECNVNWLEPLLARVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 212

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ERK R ++   P+ TP +AGGLF IDKA+FE+LGTYD+  DIWGGENLE+S
Sbjct: 213 FKWEYLSNAERKARQRDPTAPIRTPMIAGGLFVIDKAYFERLGTYDTQMDIWGGENLEIS 272

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P    G        F K        
Sbjct: 273 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSG------NIFAK--NTRRAA 319

Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM------ 219
           ++W  E        + L+    FGD+  R +LR+ L CK F+WYLE      G+      
Sbjct: 320 EVWMDEYKKYYYAAVPLATNIPFGDIDDRLQLRKELQCKPFRWYLEHVYPQLGIPERRNN 379

Query: 220 --------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---- 267
                   C+DS          VGLY CH  GGNQ W++++ GE++  + CL        
Sbjct: 380 GSIRQGVYCLDSLGNVAG--AVVGLYSCHGNGGNQNWILNRKGELKHHDLCLTLVKFTIS 437

Query: 268 ---GDVILYPCHGSKGNQYFEYD 287
                V++  C  S+  Q+   D
Sbjct: 438 ARYNSVLMKYCDDSENQQWHLKD 460


>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 4 [Equus caballus]
          Length = 703

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++++ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 355 CECNSGWLEPLLERISKDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 408

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 409 FQWHSVPKHERDRRKSRIDPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 468

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 469 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 517

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR+ L CKSF WYL+       V  D   W G    M 
Sbjct: 518 HFYNRNPPARKEAYGDISERKLLRKRLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 577

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  +   EIR
Sbjct: 578 IPSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 620


>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Sus scrofa]
 gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Sus scrofa]
          Length = 582

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 103/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  LGCKSF WYL+       V  D   W G    + 
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIG 456

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSEC--LDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIR 499


>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Apis mellifera]
 gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Apis florea]
          Length = 571

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/311 (35%), Positives = 155/311 (49%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
             +    + L+    +G++  R EL+R L CK F WYL+                 S   
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G IR    CL    YA G  +L 
Sbjct: 455 GSACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIRHHGLCLTLPVYAKGTTLLM 512

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 513 QICDGSENQKW 523


>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 641

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/322 (35%), Positives = 153/322 (47%), Gaps = 64/322 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRF---PPGRLTSSYKFFIGGFDW 60
           CE    W +PLL  +A  SS+VV P+I  I  DT +      P  R         GGF W
Sbjct: 282 CETTPGWAEPLLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQR---------GGFSW 332

Query: 61  NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
           +L + W  IP  E+   K+  + + TPTMAGGLF+ID+ +F  +GTYD   DIWGGENLE
Sbjct: 333 DLFYKWKPIPLDEQHLRKSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLE 392

Query: 121 LSFKF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +SF+          IP  R     +    P   P        ++K   + L       ++
Sbjct: 393 MSFRIWMCGGRIDIIPCSRVGHIFRKFTSPYKFPD------GVEKTLSKNLNRL---AEV 443

Query: 175 WGGENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL----------------- 210
           W  E  EL ++        D+GD++ R  LR  L CKSFKWY+                 
Sbjct: 444 WLDEYKELYYQKRPQSKGKDYGDISQRLALRNKLNCKSFKWYIENIYPDVQLPDLYPPAR 503

Query: 211 -EVSNDWSGMCIDSACKPTDMH----KPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLD 264
            E+ N  S  C+DS     DM     K +G++PCH QGGNQ ++ S+ GEI  DE  CLD
Sbjct: 504 GEIKNPASSYCLDSM---GDMKGNNVKKLGIFPCHGQGGNQNFVFSRKGEIVFDEEYCLD 560

Query: 265 YA----GGDVILYPCHGSKGNQ 282
            +    G  + +  CH   GNQ
Sbjct: 561 VSSSKPGVLIDIMKCHNFGGNQ 582


>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 12 [Taeniopygia
           guttata]
          Length = 594

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/323 (34%), Positives = 155/323 (47%), Gaps = 61/323 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A   + VV P+I  I  +TFE       L ++ +  IGGFD  L 
Sbjct: 242 CECHEGWLEPLLARIAEEETAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDXRLV 295

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH+ PERE+KR K+  + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 296 FTWHSTPEREQKRRKSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 355

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF--FEKLGTYDSGFDIWGGENLE 181
           +  W              +  +   +  G +F     +   + L       ++W  E  +
Sbjct: 356 RI-WQC----------GGSLEIHPCSHVGHVFPKQAPYSRAKALANSVRAAEVWMDEYKQ 404

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
           L +  +       +GDVT R+ LR  L CK FKW+LE       V  D  G         
Sbjct: 405 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHVPEDRPGFFGMLKNRG 464

Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRDE------ACLDY 265
               C D    PT+ H+  G    LYPCH  G NQF+  + H EIR +       A +D 
Sbjct: 465 MENFCFDY--NPTNEHQITGQRVILYPCHGMGQNQFFEYTSHNEIRYNTRQPEVCAAVDS 522

Query: 266 AGGDVILYPC----HGSKGNQYF 284
               + +Y C    H    NQ F
Sbjct: 523 GTDYLTMYLCQENAHSVPENQKF 545


>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           2 [Sus scrofa]
          Length = 578

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 103/285 (36%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  LGCKSF WYL+       V  D   W G    + 
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIR 495


>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
           lupus familiaris]
          Length = 578

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 112/304 (36%), Positives = 154/304 (50%), Gaps = 52/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +  +  IGGFDW L 
Sbjct: 229 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNPREPQIGGFDWRLV 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 283 FTWHVVPERERMRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 342

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  +  E
Sbjct: 343 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDDFKE 391

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT RK+LR  L CK F+W+LE       V  D   + GM  +  
Sbjct: 392 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHVPEDRPGFFGMLQNKG 451

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDY-AGG 268
            K       P + ++ VG    LY CH  G NQF+  +   EIR    + EAC+   AG 
Sbjct: 452 LKDYCFDYNPPNENQVVGYQVLLYICHGMGQNQFFEYTSQNEIRYNTHQPEACIAVDAGT 511

Query: 269 DVIL 272
           DV++
Sbjct: 512 DVLV 515


>gi|90078941|dbj|BAE89150.1| unnamed protein product [Macaca fascicularis]
          Length = 311

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 100/262 (38%), Positives = 136/262 (51%), Gaps = 39/262 (14%)

Query: 56  GGFDWNLQFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 114
           GGF+W L F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIW
Sbjct: 9   GGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIW 68

Query: 115 GGENLELSFKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
           GGENLE+SF+  W      E     H        TP T  GG   I      +L      
Sbjct: 69  GGENLEISFRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA----- 122

Query: 172 FDIWGGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
            ++W  E     +       K D+GD++SR  LR  L CK F WYL              
Sbjct: 123 -EVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYF 181

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
              E+ N  +  C+D+  +  +  + VG++ CH  GGNQ +  + + EIR D+ CLD + 
Sbjct: 182 SLGEIRNVETNQCLDNMARKEN--EKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 239

Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
             G V +  CH  KGNQ +EYD
Sbjct: 240 LNGPVTMLKCHHLKGNQLWEYD 261



 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/73 (57%), Positives = 58/73 (79%), Gaps = 3/73 (4%)

Query: 114 WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +GG N +L+F+  W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G 
Sbjct: 8   YGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGM 65

Query: 173 DIWGGENLELSFK 185
           DIWGGENLE+SF+
Sbjct: 66  DIWGGENLEISFR 78


>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Oryctolagus cuniculus]
          Length = 579

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/285 (36%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 231 CECNSGWLEPLLERIERDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 284

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 285 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 344

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  +  E
Sbjct: 345 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 393

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K D+GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 394 HFYNRNPPARKEDYGDISERKLLRERLKCKSFDWYLKNVFSSLHVPEDRPGWHGAIRSKG 453

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 454 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 496


>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
          Length = 543

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/303 (35%), Positives = 148/303 (48%), Gaps = 51/303 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 193 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPHIGGFDWRLV 246

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +P RER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 247 FTWHVVPTRERMRMRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 306

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGENLE 181
           +  W                 +   +  G +F     +  K    +S    ++W  E  E
Sbjct: 307 RI-WQC----------GGTLEIHPCSHVGHVFPKQAPYSRKKALANSVRAAEVWMDEFKE 355

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           L +  +       FGDVT R++LR  L CK FKW+LE       V  D   + GM  +  
Sbjct: 356 LYYHRNPHARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNRG 415

Query: 225 CK-------PTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR----RDEACLDYAGGD 269
            K       P + H   G    LY CH  G NQF+  +   EIR    + EAC+    G 
Sbjct: 416 LKDYCFDYNPPNEHDITGHQVLLYLCHGMGQNQFFEYTSQREIRYNTHQPEACIAVEAGT 475

Query: 270 VIL 272
            IL
Sbjct: 476 DIL 478


>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Bombus terrestris]
          Length = 571

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
             +    + L+    +G++  R EL+R L CK F WYL+                 S   
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    YA G  +L 
Sbjct: 455 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPMYAKGTTLLM 512

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 513 QICDGSENQKW 523


>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Bombus impatiens]
          Length = 571

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 281

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 282 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
             +    + L+    +G++  R EL+R L CK F WYL+                 S   
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    YA G  +L 
Sbjct: 455 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 512

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 513 QICDGSENQKW 523


>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
           echinatior]
          Length = 442

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 153 FKWEYLSQTERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 212

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 213 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 265

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
             +    + L+    +G++  R EL+R L CK F WYL+                 S   
Sbjct: 266 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 325

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    YA G  +L 
Sbjct: 326 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 383

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 384 QICDGSENQKW 394


>gi|326670821|ref|XP_003199296.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Danio rerio]
          Length = 435

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/301 (34%), Positives = 146/301 (48%), Gaps = 43/301 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A NSS VVSP I  I  +TFE   P        +   G FDW L 
Sbjct: 84  CECFHGWLEPLLARIAENSSAVVSPDITTIDLNTFEFMKPSPYGQHHNR---GNFDWGLS 140

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ ER+R K+   P+ TPT AGGLFSI + +F  +G+YD   +IWGGEN+E+SF
Sbjct: 141 FGWETLPDHERRRRKDETYPIKTPTFAGGLFSISRDYFYHIGSYDEEMEIWGGENIEMSF 200

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 201 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTQVIARNQVRLAEVWMDD---YKEI 254

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +++ +  FGDV+ R +LR  L CKSF WYL+                  + 
Sbjct: 255 FYRRNQQAAQIAKEHSFGDVSRRVDLRERLQCKSFSWYLKNVYPEVFMPDLNPLQFGAIR 314

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N     C+D   +  +  KP+ +YPCH  GGNQ++  S H EIR +   E CLD   G +
Sbjct: 315 NMGKEACLDVG-ESNEGGKPLIMYPCHGMGGNQYFEYSTHHEIRHNIQKELCLDGTDGAM 373

Query: 271 I 271
           +
Sbjct: 374 V 374


>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
          Length = 632

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 109/312 (34%), Positives = 154/312 (49%), Gaps = 40/312 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N   VVSP IA+I  +TFE   P       +    G FDW+L 
Sbjct: 279 CECFYGWLEPLLARIAENPVAVVSPDIASIDLNTFEFTKPS---PYGHGHNRGNFDWSLS 335

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E KR K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 336 FGWESLPKHENKRRKDETYPIRTPTFAGGLFSISKDYFEHIGSYDEEMEIWGGENIEMSF 395

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +      + + + ++   Y   F 
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  FGD++ R +LR+ L CK+F WYL                   + N 
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNT 511

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
            + MC+D   +     KP+ +Y CH  GGNQ++  S H EIR +   E CL  + G V L
Sbjct: 512 GNRMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570

Query: 273 YPCHGSKGNQYF 284
             CH  KG + F
Sbjct: 571 RECH-YKGQKTF 581


>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Otolemur garnettii]
          Length = 508

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 110/304 (36%), Positives = 148/304 (48%), Gaps = 57/304 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 158 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 211

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER+R K+  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 212 FTWHTVPERERQRMKSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 271

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  +  E
Sbjct: 272 RI-WQCGGSLETHPCSHVGHVFPKQAP------YSRNKA----LANSVRAAEVWMDDYKE 320

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPV 234
           L +  +       FGDVT R++LR  L CK FKW+LE                 ++H P 
Sbjct: 321 LYYHRNPRARLEPFGDVTERRQLREKLQCKDFKWFLETVF-------------PELHVP- 366

Query: 235 GLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--------VILYPCHGSKGNQYFEY 286
                 +     F M+   G     + C DY   D        VILY CHG   NQ+FEY
Sbjct: 367 ------EDRPGFFGMLQNKG---LKKYCFDYNPPDENQVAGRQVILYLCHGLGQNQFFEY 417

Query: 287 DYKY 290
             +Y
Sbjct: 418 TSQY 421


>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Otolemur garnettii]
          Length = 582

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 103/285 (36%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR+ L CKSF WYL+       V  D   W G      
Sbjct: 397 HFYNRNPPARKETYGDISERKLLRQRLRCKSFDWYLKTVFPNLHVPEDRPGWHGAIRSSG 456

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499


>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Takifugu rubripes]
          Length = 583

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 104/300 (34%), Positives = 152/300 (50%), Gaps = 55/300 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL+ +  NSS +V P+I  I  +TFE          + +  IGGFDW L 
Sbjct: 236 CECVPGWIEPLLERIGENSSTIVCPVIDTIDWNTFEF------YMQTEEPMIGGFDWRLT 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++PERERKR K+  +P+ +PTMAGGLF+++K FFE LGTYD G ++WGGENLELSF
Sbjct: 290 FQWHSVPERERKRRKSPVDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSF 349

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK---LGTYDSGFDIWGGENL 180
           +  W              +  +   +  G +F   KA + +   L       ++W     
Sbjct: 350 RV-WQC----------GGSLEIHPCSHVGHVFP-KKAPYARPNFLQNTVRAAEVWMDSYK 397

Query: 181 ELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG-------M 219
           +  +       K  +GD++ R  LR  L C+SF WYL+       +  D +G       +
Sbjct: 398 QHFYNRNPPARKETYGDISGRLLLRDKLKCQSFNWYLKNIYPDLHIPEDRAGWHGAVRHL 457

Query: 220 CIDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGG 268
            I+S C   D + P        + L+ CH QGGNQ++  +   EIR +   E C +   G
Sbjct: 458 GINSEC--LDYNAPEHSVTGAHLSLFGCHGQGGNQYFEYTSQKEIRFNTVTELCAEVVEG 515


>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Megachile rotundata]
          Length = 571

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 108/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+          +     GGFDW+L 
Sbjct: 229 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQY-------IGASADLRGGFDWSLV 281

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER  R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 282 FKWEYLSQSERLARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 341

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 394

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------- 218
             +    + L+    +G++  R EL+R L CK F WYL+       +     G       
Sbjct: 395 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 454

Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    YA G  +L 
Sbjct: 455 GPACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 512

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 513 QICDGSENQKW 523


>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
 gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
          Length = 639

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 148/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + S VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 299 VECNEKWLEPLLERVREDPSRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 351

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 352 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 411

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 412 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 466

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 467 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 524

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 525 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 582

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 583 ACDESENQRWI 593


>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
 gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
          Length = 638

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 148/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + S VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 298 VECNEKWLEPLLERVREDPSRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 350

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 351 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 410

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 411 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 465

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 466 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 523

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 524 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 581

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 582 ACDESENQRWI 592


>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Meleagris
           gallopavo]
          Length = 632

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 154/314 (49%), Gaps = 40/314 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A NS  VVSP IA+I  +TFE   P       +    G FDW+L 
Sbjct: 279 CECFYGWLEPLLARIAENSVAVVSPDIASIDLNTFEFSKPS---PYGHNHNRGNFDWSLS 335

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E KR K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 336 FGWESLPKYENKRRKDETYPIRTPTFAGGLFSISKKYFEHIGSYDDEMEIWGGENIEMSF 395

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +      + + + ++   Y   F 
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  FGD++ R  LR+ L CK+F WYL                   + N 
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLNLRQRLQCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNI 511

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
            + MC+D   +     KP+ +Y CH  GGNQ++  S H EIR +   E CL  + G V L
Sbjct: 512 GNHMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570

Query: 273 YPCHGSKGNQYFEY 286
             C   KG + F +
Sbjct: 571 REC-SYKGQKIFAF 583


>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
 gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
          Length = 630

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 108/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  +RWL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 290 VECNERWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 342

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 343 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 402

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 403 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 457

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
             +    + L+    FG++  R  L+  L CK FKWYLE V  D                
Sbjct: 458 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEIGQFRQDG 515

Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 516 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFSKRGEIKHDDLCLTLVQFARGSQVVLK 573

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 574 ACDESENQRWI 584


>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Oryzias latipes]
          Length = 584

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 107/295 (36%), Positives = 145/295 (49%), Gaps = 37/295 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL+ +A N+S +V P+I  I  ++FE     G      +  IGGFDW L 
Sbjct: 236 CECVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTG------EPMIGGFDWRLT 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++PE ERKR K+  +P  +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 290 FQWHSVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSF 349

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
           +  W      E     H     P   P      L +  +A    + +Y   F        
Sbjct: 350 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKHHF----YNRN 404

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMC----IDSACK 226
             + K ++GD+T R +LR  L C SF WYL          E    W G      I S C 
Sbjct: 405 PPAKKENYGDITERLQLRERLKCNSFDWYLKNIYPELHVPEDREGWHGAIRSSGIQSECL 464

Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
             +   H P G    L+ CH QGGNQ++  +   EIR +   E C +   G   +
Sbjct: 465 DYNAPDHNPTGAHLSLFGCHGQGGNQYFEYTSQKEIRFNSVTELCAEVLDGQTSI 519



 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 101/248 (40%), Gaps = 99/248 (39%)

Query: 109 SGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 168
           +G  + GG +  L+F+  WH++PE ERKR K+  +P  +PTMAGGLF++ K +FE LGTY
Sbjct: 276 TGEPMIGGFDWRLTFQ--WHSVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTY 333

Query: 169 DSG------------FDIWG-GENLEL--------------------------------- 182
           D G            F +W  G +LE+                                 
Sbjct: 334 DMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWM 393

Query: 183 -------------SFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTD 229
                        + K ++GD+T R +LR  L C SF WYL+                  
Sbjct: 394 DSYKHHFYNRNPPAKKENYGDITERLQLRERLKCNSFDWYLK------------------ 435

Query: 230 MHKPVGLYP-CHKQGGNQFWMMSKHGEIRR---DEACLDY-------AGGDVILYPCHGS 278
                 +YP  H     + W    HG IR       CLDY        G  + L+ CHG 
Sbjct: 436 -----NIYPELHVPEDREGW----HGAIRSSGIQSECLDYNAPDHNPTGAHLSLFGCHGQ 486

Query: 279 KGNQYFEY 286
            GNQYFEY
Sbjct: 487 GGNQYFEY 494


>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
           gallus]
          Length = 632

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 106/314 (33%), Positives = 153/314 (48%), Gaps = 40/314 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A NS  VVSP IA+I  +TFE   P       +    G FDW+L 
Sbjct: 279 CECFYGWLEPLLARIAENSVAVVSPDIASIDLNTFEFSKPS---PYGHNHNRGNFDWSLS 335

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E KR K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 336 FGWESLPKYENKRRKDETYPIRTPTFAGGLFSISKEYFEHIGSYDDEMEIWGGENIEMSF 395

Query: 124 KFNWH----------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W           ++     +       P  T  +      + + + ++   Y   F 
Sbjct: 396 RV-WQCGGLLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 451

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  FGD++ R +LR+ L CK+F WYL                   + N 
Sbjct: 452 RRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNV 511

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
            + MC+D   +     KP+ +Y CH  GGNQ++  S H EIR +   E CL  + G V L
Sbjct: 512 GNHMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 570

Query: 273 YPCHGSKGNQYFEY 286
             C   KG + F +
Sbjct: 571 REC-SYKGQKIFAF 583


>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 582

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++ + + ++ P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNSGWLEPLLERISEDETVIICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    M 
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 456

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499


>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
           glaber]
          Length = 582

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKQERDRRTSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  +  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR+ L CKSF WYL+       V  D   W G    + 
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRKQLRCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSLG 456

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499


>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
          Length = 491

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++++ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 169 CECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 222

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 223 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 282

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 283 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 331

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L C+SF WYL+       V  D   W G    M 
Sbjct: 332 HFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMG 391

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 392 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 434


>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
          Length = 466

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 112/328 (34%), Positives = 163/328 (49%), Gaps = 57/328 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFE-----LRFPPGRLTSSYKFFI--- 55
           CE    WL+PLL+ +A + + VV P+I  I  DTF+     LR    R++ + +  I   
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIEICLRCNLKRISETRRDKILFR 159

Query: 56  ---------GGFDWNLQFNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLG 105
                    GGFDW+L F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG
Sbjct: 160 FLGASADLRGGFDWSLVFKWEYLSQGERQARQKDPTQSIRTPMIAGGLFVINKAYFEKLG 219

Query: 106 TYDSGFDIWGGENLELSFKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLF 155
            YD+  D+WGGENLE+SF+      +   IP        RKRH     P   P  +G +F
Sbjct: 220 KYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVF 274

Query: 156 SIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--- 212
           + +     ++   D  +  +    + L+    +G++  R EL+R L CK F WYL+    
Sbjct: 275 ARNTRRAAEVWMDD--YKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYP 332

Query: 213 -------------SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                        S      C+DS     D +  VGLYPCH  GGNQ W ++K G I+  
Sbjct: 333 ELVIPTSEGGPGGSLKQGTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHH 390

Query: 260 EACLD---YAGGDVILYP-CHGSKGNQY 283
           + CL    YA G  +L   C GS+  ++
Sbjct: 391 DLCLTLPVYAKGTTLLMQICDGSENQKW 418


>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Acyrthosiphon pisum]
          Length = 581

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 156/317 (49%), Gaps = 47/317 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL+PLLD +A + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 235 VECNVNWLEPLLDRVAEDPTRVVCPIIDVINMDNFQYIGASSELR-------GGFDWNLV 287

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + +  R +R K+   P+ TP +AGGLF +DK +F KLGTYD   +IWGGENLE+S
Sbjct: 288 FKWEYLSKEVRAQRQKDPTLPIRTPMIAGGLFVMDKDYFVKLGTYDKEMNIWGGENLEIS 347

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++  +   +
Sbjct: 348 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFAHNTRRAAEV--WMDQY 400

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------VSNDWSG---- 218
             +    + LS    FG++  R  L++NLGCK FKWYL+            +++ G    
Sbjct: 401 KRYYYNAVPLSRIVPFGNIADRLALKKNLGCKPFKWYLDNVYPELKLPATVDEFVGSIRQ 460

Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGD-VIL 272
             MC+D+      + K  G++PCH  GGNQ W  +  G I+ D  CL   DY+    +I+
Sbjct: 461 GYMCLDTL--ENQVGKTAGIFPCHDYGGNQEWTFTIGGSIKHDMMCLSPTDYSSMSLIIM 518

Query: 273 YPCHGSKGNQYFEYDYK 289
            PC  +     F+ + K
Sbjct: 519 KPCDSTTDEWKFDENTK 535


>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
           saltator]
          Length = 442

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 108/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 100 CECNADWIEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 153 FKWEYLSQIERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 212

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 213 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 265

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDW 216
             +    + L+    +G++  R EL+R L CK F WYL+                 S   
Sbjct: 266 KQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTSEGGPGGSLKQ 325

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YAGGDVILY 273
              C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    YA G  +L 
Sbjct: 326 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGLTKDGLIKHHDLCLTLPVYAKGTTLLM 383

Query: 274 P-CHGSKGNQY 283
             C GS+  ++
Sbjct: 384 QICDGSENQKW 394


>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Ailuropoda melanoleuca]
          Length = 578

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 146/285 (51%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++++ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L C+SF WYL+       V  D   W G    M 
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Sarcophilus harrisii]
          Length = 631

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 108/317 (34%), Positives = 152/317 (47%), Gaps = 40/317 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P       Y    G FDW+L 
Sbjct: 278 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPS---PYGYNHNRGNFDWSLS 334

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++PE ER+R K+   P+ TPT AGGLFSI K +FE +GTYD    IWGGEN+E+SF
Sbjct: 335 FGWESLPEHERQRRKDETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMKIWGGENIEMSF 394

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   +   F 
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---FKEIFY 450

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  FGD++ R E++  L CK+F WYL                   + N 
Sbjct: 451 RRNTEAAKIVKQKTFGDISKRLEIKHRLQCKNFTWYLNNVYPEIYVPDLNPVISGYIQNK 510

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR   + E CL    G V L
Sbjct: 511 GRHLCLDVG-ENNLGGKPLIMYTCHGLGGNQYFEYSAQHEIRHSIQQELCLHAVQGPVQL 569

Query: 273 YPCHGSKGNQYFEYDYK 289
             C   KG +    D +
Sbjct: 570 NTC-SYKGQKTLTIDVQ 585


>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Nasonia vitripennis]
          Length = 572

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 156/311 (50%), Gaps = 47/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + S VV P+I  I  D F+          +     GGFDW+L 
Sbjct: 230 CECNADWLEPLLERVAEDPSRVVCPVIDVISMDNFQY-------IGASADLRGGFDWSLV 282

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENLE+S
Sbjct: 283 FKWEYLSQSERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEIS 342

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D  +
Sbjct: 343 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAAEVWMDD--Y 395

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------- 218
             +    + L+    +G++  R EL+R L CK F WYL+       +     G       
Sbjct: 396 KQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKHVYPELIIPTSEGGPGGSLKQ 455

Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD---YA-GGDVIL 272
              C+DS     D +  VGLYPCH  GGNQ W M+  G I+  + CL    YA G  +++
Sbjct: 456 GTACLDSMGHLLDGN--VGLYPCHDTGGNQEWGMTNDGLIKHHDLCLTLPVYAKGTSLLM 513

Query: 273 YPCHGSKGNQY 283
             C GS+  ++
Sbjct: 514 QICDGSENQKW 524


>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
 gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
          Length = 631

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 148/314 (47%), Gaps = 46/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 291 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 343

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 344 FKWEYLSPTERAARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 403

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 404 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 458

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
             +    + L+    FG++  R  L+  L CK FKWYLE V  D                
Sbjct: 459 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEPQEVGQFRQDT 516

Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VGL+PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 517 TECLDTMGHVID--GTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFARGSQVVLK 574

Query: 274 PCHGSKGNQYFEYD 287
            C  ++  ++   D
Sbjct: 575 SCDDTENQRWIMRD 588


>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
 gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
 gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
 gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
 gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
          Length = 578

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    M 
Sbjct: 393 HFYNRNPPARKEAYGDLSERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
 gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
 gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
          Length = 578

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  +  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDDYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    M 
Sbjct: 393 HFYNRNPPARKETYGDISERKLLRERLQCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
           catus]
          Length = 582

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 145/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + ++ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNSGWLEPLLERIGKDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  +  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDQYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    M 
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 456

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSEC--LDYNSPDSNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 499


>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
           rerio]
          Length = 578

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 111/293 (37%), Positives = 143/293 (48%), Gaps = 54/293 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TF+    PG         IGGFDW L 
Sbjct: 226 CECHEGWLEPLLQRIKEEPSAVVCPVIDVIDWNTFQYLGNPGEPQ------IGGFDWRLV 279

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH+IPE E+KR   A + V +PTMAGGLF+++K +F  LGTYD+G ++WGGENLE SF
Sbjct: 280 FTWHSIPEHEQKRRSAATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSF 339

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S +KA    L       ++W  +  E
Sbjct: 340 RI-WQCGGSLEIHPCSHVGHVFPKKAP------YSRNKA----LANSVRAAEVWMDDFKE 388

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYL-------EVSNDWSGM-------- 219
           + +          +GDVT R++LR  L CK F+W+L       +V  D  GM        
Sbjct: 389 VYYHRSPHARLEAYGDVTDRRKLRMRLRCKDFRWFLDNIYPDIQVPEDKPGMFGMLKNKG 448

Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIR---RDEA 261
               C D    P D HK  G    LYPCH  G NQF+  S   EIR   RD A
Sbjct: 449 MTNYCFDY--NPPDEHKIAGHRVILYPCHGMGQNQFFEYSTLQEIRYNTRDPA 499



 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 100/229 (43%), Gaps = 90/229 (39%)

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------------F 172
           F WH+IPE E+KR   A + V +PTMAGGLF+++K +F  LGTYD+G            F
Sbjct: 280 FTWHSIPEHEQKRRSAATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSF 339

Query: 173 DIWG-GENLEL----------------------------------SFKG----------- 186
            IW  G +LE+                                   FK            
Sbjct: 340 RIWQCGGSLEIHPCSHVGHVFPKKAPYSRNKALANSVRAAEVWMDDFKEVYYHRSPHARL 399

Query: 187 -DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN 245
             +GDVT R++LR  L CK F+W+L+  N +  + +     P D  KP G++   K  G 
Sbjct: 400 EAYGDVTDRRKLRMRLRCKDFRWFLD--NIYPDIQV-----PED--KP-GMFGMLKNKG- 448

Query: 246 QFWMMSKHGEIRRDEACLDY--------AGGDVILYPCHGSKGNQYFEY 286
               M+ +        C DY        AG  VILYPCHG   NQ+FEY
Sbjct: 449 ----MTNY--------CFDYNPPDEHKIAGHRVILYPCHGMGQNQFFEY 485


>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
 gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
          Length = 640

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 147/310 (47%), Gaps = 46/310 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 300 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 352

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 353 FKWEYLSASERTARHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 412

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 413 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 467

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
             +    + L+    FG++  R  L+  L CK FKWYLE V  D                
Sbjct: 468 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDM 525

Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VGL+PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 526 TECLDTMGHLVD--GTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFSRGSQVVLK 583

Query: 274 PCHGSKGNQY 283
            C  ++  ++
Sbjct: 584 SCDDTENQRW 593


>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Taeniopygia guttata]
          Length = 631

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/312 (34%), Positives = 154/312 (49%), Gaps = 40/312 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N   VVSP IA+I  +TFE   P     S  +   G FDW+L 
Sbjct: 278 CECFYGWLEPLLARIAENPVAVVSPDIASIDLNTFEFSKPSPYGHSHNR---GNFDWSLS 334

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E KR K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 335 FGWESLPKHENKRRKDETYPIRTPTFAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSF 394

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +      + + + ++   Y   F 
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVITRNQVRLAEVWMDE---YKEIFY 450

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  FGD++ R +LR+ L CK+F WYL                   + N 
Sbjct: 451 RRNTEAAKIVKQKTFGDISKRIDLRQRLQCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNI 510

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
            + MC+D   +     KP+ +Y CH  GGNQ++  S H EIR +   E CL  + G V L
Sbjct: 511 GNRMCLDVG-ENNHGGKPLIMYSCHGLGGNQYFEYSAHHEIRHNIQKELCLHASKGPVQL 569

Query: 273 YPCHGSKGNQYF 284
             C   KG + F
Sbjct: 570 REC-TYKGQKTF 580


>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
 gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
          Length = 606

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 266 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 318

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 319 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 378

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 379 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 433

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 434 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 491

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 492 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 549

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 550 ACDDSENQRWI 560


>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
 gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
          Length = 634

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 294 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 346

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 347 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 406

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 407 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 461

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 462 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 519

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 520 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 577

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 578 ACDDSENQRWI 588


>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
           melanogaster]
 gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
           melanogaster]
 gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
 gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           Short=pp-GaNTase 2; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 2; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2
 gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
           melanogaster]
 gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
 gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
           melanogaster]
          Length = 633

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 293 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 345

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 346 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 405

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 406 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 460

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 461 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 518

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 519 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 576

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 577 ACDDSENQRWI 587


>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
 gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
          Length = 631

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 291 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 343

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 344 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 403

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 404 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 458

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 459 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 516

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 517 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 574

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 575 ACDDSENQRWI 585


>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           troglodytes]
 gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           paniscus]
 gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
          Length = 578

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
          Length = 633

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 293 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 345

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 346 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 405

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 406 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 460

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 461 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 518

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 519 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 576

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 577 ACDDSENQRWI 587


>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 615

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/311 (34%), Positives = 147/311 (47%), Gaps = 46/311 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 275 VECNEMWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 327

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 328 FKWEYLSPSERAMRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 387

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 388 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 442

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW--------------S 217
             +    + L+    FG++  R  L+  L CK FKWYLE V  D               S
Sbjct: 443 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDPQEVGQFRQDS 500

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  +K GEI+ D+ CL       G  V+L 
Sbjct: 501 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCLTLVTFARGSQVVLK 558

Query: 274 PCHGSKGNQYF 284
            C  S+  ++ 
Sbjct: 559 ACDDSENQRWI 569


>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Cavia porcellus]
          Length = 568

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/322 (33%), Positives = 153/322 (47%), Gaps = 50/322 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323

Query: 123 FKFNWHAIPERERKRHKNA------AEPVWTPTMAGGLFSID------------KAFFEK 164
           F+  W      E     +       A P   P   G + + +            K FF  
Sbjct: 324 FRI-WQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYI 382

Query: 165 LGTYDSGF-----DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
           +      F     D++ GE         +      KE R  L  +++  Y+         
Sbjct: 383 ISPAKCNFLTRDLDVFMGETDSDIVGTKY--TYKLKEERFVLSHRNYSPYIPSQGNMAKE 440

Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA- 266
               + N  +  C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD + 
Sbjct: 441 KQSMIRNVETNQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 498

Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
             G VI+  CH  +GNQ +EYD
Sbjct: 499 LNGPVIMLKCHHMRGNQLWEYD 520


>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Anolis carolinensis]
          Length = 575

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/320 (33%), Positives = 152/320 (47%), Gaps = 61/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 231 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 283

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 284 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 343

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H     P   P  +G +F+ +              
Sbjct: 344 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 389

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E     +          +G++ SR EL++ L CK FKWYLE              
Sbjct: 390 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRLNCKPFKWYLENVYPELRVPDHQDI 449

Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
              +      C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A
Sbjct: 450 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 507

Query: 267 GGDVI-LYPCHGSKGNQYFE 285
            G +I L  C  + G Q +E
Sbjct: 508 PGSLIKLQGCRENDGRQKWE 527


>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
           laevis]
 gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
          Length = 622

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/310 (34%), Positives = 153/310 (49%), Gaps = 49/310 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  ++FE   P   G+  S      G FDW+
Sbjct: 270 CECFHGWLEPLLSRIAEDYTAVVSPDITTIDLNSFEFAKPVQYGKTHSR-----GNFDWS 324

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W AIPE E+ R KN   P+ TPT AGGLFSI KA+FE +G+YD   +IWGGEN+E+
Sbjct: 325 LTFGWEAIPEAEKLRRKNETYPIKTPTFAGGLFSISKAYFEHIGSYDEDMEIWGGENVEM 384

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H   + P  T  ++     + + + +    Y 
Sbjct: 385 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YK 438

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             +     +  ++  +  FGDV+ R +L+ +L CK+F WYLE                  
Sbjct: 439 IIYYRRNDQAAKMVKEKSFGDVSKRLKLKADLHCKNFTWYLENIYPELFVPDRDPTYSGA 498

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL--DYA 266
           V N+ +  C+D   +     KP+ +YPCH  GGNQ++  S H E+R + A   CL   Y 
Sbjct: 499 VKNEGAQKCLDVG-ENNHGGKPLIMYPCHGMGGNQYFEYSTHKELRHNIAKQLCLRSKYG 557

Query: 267 GGDVILYPCH 276
            G V L  C 
Sbjct: 558 PGQVELGECQ 567


>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
          Length = 578

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
 gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
          Length = 635

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/310 (34%), Positives = 147/310 (47%), Gaps = 46/310 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 295 VECNEQWLEPLLERVREDPTRVVCPVIDVISMDNFQYIGASADLR-------GGFDWNLI 347

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER  RH +    + TP +AGGLF IDKA+F KLG YD   D+WGGENLE+S
Sbjct: 348 FKWEYLSPAERSVRHNDPTTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEIS 407

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 408 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKQ 462

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSG------------- 218
             +    + L+    FG++  R  L+  L CK FKWYLE V  D                
Sbjct: 463 HYYNA--VPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEPQEIGQFRQDG 520

Query: 219 -MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGDVILY 273
             C+D+     D    VG++PCH  GGNQ W  SK GEI+ D+ CL       G  V+L 
Sbjct: 521 TECLDTMGHLID--GTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCLTLVQFSRGSQVVLK 578

Query: 274 PCHGSKGNQY 283
            C  S+  ++
Sbjct: 579 SCDDSENQRW 588


>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 631

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 105/306 (34%), Positives = 150/306 (49%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A+N + VVSP I  I  +TFE   P     +  +   G FDW+L 
Sbjct: 255 CECFNGWLEPLLARIAKNRTAVVSPDITTIDLNTFEFMKPSPYGQNHNR---GNFDWSLA 311

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E+KR K+   P+ TPT AGGLFSI K +F ++G+YD   +IWGGEN+E+SF
Sbjct: 312 FGWESLPDHEKKRRKDETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSF 371

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H   + P  T  ++     + + + +    Y   
Sbjct: 372 RVWQCGGQLEIIPCSIVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YKEI 425

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +L+    FGD++ R + R  L CKSF WYL+                  V 
Sbjct: 426 FYRRNQQAAQLARDKAFGDISERLDFRVRLRCKSFSWYLKNIYPEAFIPDLNPLSFGSVK 485

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N     C+D A +  +  K + +YPCH  GGNQ++  S H EIR +   E CL  A G V
Sbjct: 486 NVGKDSCLD-AGENNEGGKKLIMYPCHGLGGNQYFEYSTHHEIRHNIQKELCLHGAAGAV 544

Query: 271 ILYPCH 276
            L  C 
Sbjct: 545 RLEECQ 550


>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
          Length = 633

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 107/303 (35%), Positives = 154/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W A+P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWEALPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG     
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGSKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAPGPVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KTC 574


>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
           rerio]
 gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
 gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
          Length = 582

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 106/291 (36%), Positives = 147/291 (50%), Gaps = 37/291 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL+ +A N + ++ P+I  I  +TFE          + +  +GGFDW L 
Sbjct: 235 CECVPGWIEPLLERIAENETTIICPVIDTIDWNTFEFYM------QTEEPMVGGFDWRLT 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WHA+PE +RK  K+  +P+ +PTMAGGLF++ KA+FE LGTYD G ++WGGENLELSF
Sbjct: 289 FQWHAVPEIDRKIRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSF 348

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
           +  W      E     H     P   P   +  L +  +A    + TY   F        
Sbjct: 349 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARSNFLQNTVRAAEVWMDTYKQHF----YNRN 403

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC----IDSACK 226
             + K  +GD++ R  LR  L CKSF+WYL+       V  D   W G      I S C 
Sbjct: 404 PPARKESYGDISERIVLRNRLQCKSFEWYLQNVYPGLHVPEDRPGWHGAVRSAGIHSECL 463

Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGG 268
             +   H P G    L+ CH QGGNQ++  +   EIR +   E C +   G
Sbjct: 464 DYNAPDHNPTGAHLSLFGCHGQGGNQYFEYTSQREIRFNSVTELCAEVQDG 514



 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/233 (30%), Positives = 96/233 (41%), Gaps = 95/233 (40%)

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----------- 171
             F WHA+PE +RK  K+  +P+ +PTMAGGLF++ KA+FE LGTYD G           
Sbjct: 287 LTFQWHAVPEIDRKIRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLEL 346

Query: 172 -FDIWG-GENLEL----------------------------------------------S 183
            F +W  G +LE+                                              +
Sbjct: 347 SFRVWQCGGSLEIHPCSHVGHVFPKKAPYARSNFLQNTVRAAEVWMDTYKQHFYNRNPPA 406

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
            K  +GD++ R  LR  L CKSF+WYL+  N + G+ +                P  + G
Sbjct: 407 RKESYGDISERIVLRNRLQCKSFEWYLQ--NVYPGLHV----------------PEDRPG 448

Query: 244 GNQFWMMSKHGEIRR---DEACLDY-------AGGDVILYPCHGSKGNQYFEY 286
               W    HG +R       CLDY        G  + L+ CHG  GNQYFEY
Sbjct: 449 ----W----HGAVRSAGIHSECLDYNAPDHNPTGAHLSLFGCHGQGGNQYFEY 493


>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
           gorilla gorilla]
          Length = 578

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLRNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Oreochromis niloticus]
          Length = 584

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 103/277 (37%), Positives = 140/277 (50%), Gaps = 34/277 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL+ ++ N+S +V P+I  I  +TFE          + +  IGGFDW L 
Sbjct: 236 CECVPGWIEPLLERISENASTIVCPVIDTIDWNTFEF------YMQTDEPMIGGFDWRLT 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++PE ERKR K+  +P+ +PTMAGGLF++ KA+FE LGTYD G D+WGGENLELSF
Sbjct: 290 FQWHSVPEMERKRRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSF 349

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
           +  W      E     H     P   P      L +  +A    + +Y   F        
Sbjct: 350 RV-WQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKKHF----YNRN 404

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMC----IDSACK 226
             + K  +G+++ R  LR  L C SF+WYL          E    W G      I S C 
Sbjct: 405 PPARKEKYGNISERLLLREKLKCNSFEWYLKNIYPELHVPEDREGWHGAVRSSGIHSECL 464

Query: 227 PTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
             +   H P G    L+ CH QGGNQ++  +   EIR
Sbjct: 465 DYNAPEHSPTGSQLSLFGCHGQGGNQYFEYTSQKEIR 501



 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 93/234 (39%), Gaps = 97/234 (41%)

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----------- 171
             F WH++PE ERKR K+  +P+ +PTMAGGLF++ KA+FE LGTYD G           
Sbjct: 288 LTFQWHSVPEMERKRRKSRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLEL 347

Query: 172 -FDIWG-GENLEL----------------------------------------------S 183
            F +W  G +LE+                                              +
Sbjct: 348 SFRVWQCGGSLEIHPCSHVGHVFPKKAPYARPNFLQNTVRAAEVWMDSYKKHFYNRNPPA 407

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYP-CHKQ 242
            K  +G+++ R  LR  L C SF+WYL+                        +YP  H  
Sbjct: 408 RKEKYGNISERLLLREKLKCNSFEWYLK-----------------------NIYPELHVP 444

Query: 243 GGNQFWMMSKHGEIRRD---EACLDY-------AGGDVILYPCHGSKGNQYFEY 286
              + W    HG +R       CLDY        G  + L+ CHG  GNQYFEY
Sbjct: 445 EDREGW----HGAVRSSGIHSECLDYNAPEHSPTGSQLSLFGCHGQGGNQYFEY 494


>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
          Length = 575

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 227 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 280

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 281 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 340

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 341 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 389

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 390 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 449

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 450 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 492


>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Monodelphis domestica]
          Length = 708

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/294 (34%), Positives = 150/294 (51%), Gaps = 53/294 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ + ++ S ++ P+I  I  +TF+     G      +  IGGFDW+L 
Sbjct: 360 CECNQGWLEPLLERIGQDESVIICPVIDTIDWNTFDFYMQEG------EPVIGGFDWHLT 413

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE ER+R ++  +P+ +P MAGGLF++ K +FE LGTYD+G ++WGGENLELSF
Sbjct: 414 FQWQPVPEHERRRWQSRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSF 473

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGENLE 181
           +  W              A  +   +  G +F     +       ++    ++W  +  E
Sbjct: 474 RV-WQC----------GGALEIHPCSHVGHVFPKRAPYARPNFRQNTVRAAEVWMDDYKE 522

Query: 182 -------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
                  L+ K  +GDV+ RK LR+ L CKSF WYL+       V  D   W G      
Sbjct: 523 HFYNRNPLARKESYGDVSERKLLRKRLNCKSFDWYLKTVFPALRVPEDRPGWHGAIRSVG 582

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL 263
           I S C         PT+ H  + L+ CH QGGNQF+  +   E+R   + E CL
Sbjct: 583 ISSECLDYKTPERDPTEAH--LSLFGCHGQGGNQFFEYTLKKELRFSVQTELCL 634


>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Oreochromis niloticus]
          Length = 575

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 113/306 (36%), Positives = 151/306 (49%), Gaps = 57/306 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+P+L  +      VV P+I  I  +TF+       L  + +  IGGFDW L 
Sbjct: 223 CECHEGWLEPVLHRIKEEPKAVVCPVIDVIDWNTFQY------LGHAGEPQIGGFDWRLV 276

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH+IP+ E+KR ++  + + +PTMAGGLF++ K FF  LGTYD+G ++WGGENLE SF
Sbjct: 277 FTWHSIPDYEQKRRRSPVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSF 336

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 337 RI-WQCGGSLEVHPCSHVGHVFPKKAP------YSRSKA----LANSVRAAEVWLDEFKE 385

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM-------- 219
           + +  +       FGDVT R+ LR  LGCKSFKWYL+       V +D  GM        
Sbjct: 386 IYYHRNPHARLEAFGDVTERRMLREKLGCKSFKWYLDNIYPDIHVPHDRPGMFGMLKNRG 445

Query: 220 ----CIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAG 267
               C D    PTD +  VG    LY CH  G NQF+  S +GEI    R    C+  AG
Sbjct: 446 KTNYCFDY--NPTDENVVVGQRVILYLCHGMGQNQFFEYSVNGEICYNTREPAGCI--AG 501

Query: 268 GDVILY 273
            ++  Y
Sbjct: 502 DNISTY 507


>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
           grunniens mutus]
          Length = 566

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + ++ + V+ P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 218 CECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 271

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  EP  +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 272 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 331

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 332 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 380

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    + 
Sbjct: 381 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 440

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C         PT  +  + L+ CH QGGNQF+  + + EIR
Sbjct: 441 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 483


>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
           musculus]
 gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           AltName: Full=Polypeptide GalNAc transferase 2;
           Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 2;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 2
           soluble form
 gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 570

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 226 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 338

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 390 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 449

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 507

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 508 RLQGCRENDSRQKWE 522


>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
          Length = 570

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 226 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 338

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 390 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 449

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 507

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 508 RLQGCRENNSKQKWE 522


>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Nomascus leucogenys]
          Length = 578

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIHSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
           norvegicus]
          Length = 578

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 101/283 (35%), Positives = 145/283 (51%), Gaps = 46/283 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ ++R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRTSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +FS    +     L       ++W  +  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFSKRAPYARPNFLQNTAREAEVWMDDYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  + D++ RK LR  L CKSF WYL+       V  D   W G    M 
Sbjct: 393 HFYNRNPPARKETYDDISERKLLRERLQCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMG 452

Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
           I S C   +   + P G    L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSECLDYNAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
 gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
          Length = 578

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
          Length = 526

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 182 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 234

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 235 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 294

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 295 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 345

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 346 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 405

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 406 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 463

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 464 RLQGCRENDSRQKWE 478


>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
 gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
          Length = 578

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Sarcophilus harrisii]
          Length = 590

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 108/309 (34%), Positives = 155/309 (50%), Gaps = 53/309 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +    S VV P+I  I  +TFE       L +S    IGGFDW L 
Sbjct: 241 CECHDGWLEPLLERIHEEESAVVCPVIDVIDWNTFEY------LGNSGDPQIGGFDWRLV 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++PE+E+KR ++  + + +PTMAGGLF+++K +FE LG+YD+G ++WGGENLE SF
Sbjct: 295 FTWHSVPEKEQKRRRSKIDVIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSF 354

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 355 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 403

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMCIDSA 224
           + +       K  +GD+T RKELR  L CK F+W+LE       +  D   + GM ++  
Sbjct: 404 IYYHRNMHARKEPYGDITERKELRDKLKCKDFRWFLENVYPELHIPEDRPGYFGMLVNRG 463

Query: 225 CKPT--DMHKP---------VGLYPCHKQGGNQFWMMSKHGEI----RRDEAC--LDYAG 267
                 D + P         V LY CH  G NQF+  + H E+    R+ EAC  +D   
Sbjct: 464 MADYCFDYNPPSESEITGNQVILYLCHGMGQNQFFEYTSHNELRYNTRQPEACAAVDVGT 523

Query: 268 GDVILYPCH 276
             + ++ C+
Sbjct: 524 DHLTMHLCY 532


>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
           aries]
          Length = 582

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + ++ + V+ P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 234 CECNTGWLEPLLERIHKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  EP  +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 288 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 348 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 396

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    + 
Sbjct: 397 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 456

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C         PT  +  + L+ CH QGGNQF+  + + EIR
Sbjct: 457 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 499


>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
 gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
 gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
          Length = 578

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + ++ + V+ P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R K+  EP  +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRKSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSG----MC 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G    + 
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIG 452

Query: 221 IDSAC--------KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C         PT  +  + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSECLDYNAPDNNPTSAN--LSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
          Length = 536

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 192 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 244

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 245 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 304

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 305 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 355

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 356 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 415

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 416 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 473

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 474 RLQGCRENDSRQKWE 488


>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (predicted), isoform
           CRA_a [Rattus norvegicus]
          Length = 504

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 160 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 212

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 213 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 272

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 273 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 323

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 324 EFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 383

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 384 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 441

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 442 RLQGCRENDSRQKWE 456


>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
           taurus]
 gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
           taurus]
          Length = 571

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 200 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 252

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 253 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 312

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 313 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 363

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 364 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 423

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 424 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 481

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 482 RLQGCRENDSRQKWE 496


>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
          Length = 569

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 225 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 277

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 278 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 337

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 338 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 388

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 389 EFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 448

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 449 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 506

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 507 RLQGCRENDSRQKWE 521


>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
           anubis]
          Length = 578

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSKG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 633

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/303 (34%), Positives = 150/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGINHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W A+P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWEALPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEAYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 NAC 574


>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 548

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 204 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 256

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 257 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 316

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 317 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 367

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 368 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 427

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 428 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 485

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 486 RLQGCRENDSRQKWE 500


>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
           caballus]
          Length = 460

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 144/300 (48%), Gaps = 57/300 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 110 CECHEGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 163

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 164 FTWHVVPERERLRMRSPTDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSF 223

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W     E
Sbjct: 224 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDGYKE 272

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPV 234
           L +  +       FGDVT RK+LR  L CK F+W+LE  N +            ++H P 
Sbjct: 273 LYYHRNPHARLEPFGDVTERKQLREKLRCKDFRWFLE--NVYP-----------ELHVPE 319

Query: 235 GLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--------AGGDVILYPCHGSKGNQYFEY 286
               C       F M+   G     + C DY         G  V LY CHG   NQ+FEY
Sbjct: 320 DRPGC-------FGMLQNKG---LKDYCFDYNPPNENQITGHQVTLYLCHGMGQNQFFEY 369


>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
           scrofa]
          Length = 633

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 152/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++R L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKRRLQCKNFTWYLNNIYPEAYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSVQHEIRHNIQKELCLHAAQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KTC 574


>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Callithrix jacchus]
          Length = 578

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 142/285 (49%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  +   EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 495


>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Loxodonta africana]
          Length = 572

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 228 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 280

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+S
Sbjct: 281 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 340

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 341 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 391

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 392 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 451

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D   G +I
Sbjct: 452 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTPGSLI 509

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 510 KLQGCRENDSRQKWE 524


>gi|324520233|gb|ADY47590.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
          Length = 267

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 95/232 (40%), Positives = 125/232 (53%), Gaps = 42/232 (18%)

Query: 89  MAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTP 148
           MAGGLF+ID+ +FEKLGTYD GFDIWGGENLE+SFK  W      E          +   
Sbjct: 1   MAGGLFAIDRQYFEKLGTYDPGFDIWGGENLEISFKI-WMCGGRLE----------IVPC 49

Query: 149 TMAGGLFSIDKAFFEKLGTYDSG------FDIWGGENLELSFK------GDFGDVTSRKE 196
           +  G +F     +  + G            ++W  E  E+ ++      G++GDV+ RK 
Sbjct: 50  SHVGHVFRKKSPYKWRTGVNVLQRNNVRLAEVWLDEYKEIYYERINHKLGEYGDVSERKR 109

Query: 197 LRRNLGCKSFKWYL-----------------EVSNDWS-GMCIDSACKPTDMHKPVGLYP 238
           LR  L C SFKWYL                 E+ N  +   C+D       ++  V  YP
Sbjct: 110 LRERLKCHSFKWYLDNVFPDLFIPSKAIGKGEIRNRGNPKFCVDHEVGRNVVNDAVIPYP 169

Query: 239 CHKQGGNQFWMMSKHGEIRRDEACLDYAG-GDVILYPCHGSKGNQYFEYDYK 289
           CH  GGNQFW++SK GEIRRDE C+DY G G V+ Y CHGSKGNQ +EY+++
Sbjct: 170 CHLMGGNQFWLLSKEGEIRRDEYCIDYPGRGSVVTYECHGSKGNQLWEYNHE 221


>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 653

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 108/308 (35%), Positives = 145/308 (47%), Gaps = 74/308 (24%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  +A N + VV+P+I  I D +  L      + +   F I     N+ FNW  +
Sbjct: 343 WLEPLLARVAENHTRVVAPVIDMISDRS--LACGGNEIGNLGTFEIA----NMGFNWLTL 396

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
            + E+ +H   +EP  TPT+AGGLFSI++A+F K+GTYD G DIWGGENLE+SF+     
Sbjct: 397 NKTEKAKH-GQSEPWKTPTIAGGLFSINRAYFTKMGTYDHGMDIWGGENLEISFR----- 450

Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSG------------- 171
                          VW   M GG   I         F  +  Y  G             
Sbjct: 451 ---------------VW---MCGGSLEIHPCSHVAHLFRSMSPYKWGKSFRDILRKNAVR 492

Query: 172 -FDIWGGENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
             ++W  E   + ++      GD+GDV+ RK+LR  LGCKSF WYL              
Sbjct: 493 TAEVWMDEYKHIYYERLNYDLGDYGDVSERKDLRNRLGCKSFGWYLKTMLPDMKLPETAL 552

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
              EV N   GMC+D+    T         PCH QGGNQF+  + +G I RD ACL    
Sbjct: 553 YSGEVRNMEKGMCLDTM--GTTAGNKFQAIPCHHQGGNQFFRFTVNGHIERDSACLSDQD 610

Query: 268 GDVILYPC 275
           G ++   C
Sbjct: 611 GSLLYVLC 618


>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
          Length = 618

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 148/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 229 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 281

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 282 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 341

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 342 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 392

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGCK FKWYL+                 + 
Sbjct: 393 EYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 452

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I
Sbjct: 453 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLI 510

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 511 RLQGCRENDSRQKWE 525


>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
           grunniens mutus]
          Length = 530

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 186 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 238

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 239 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 298

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 299 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 349

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 350 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 409

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 410 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 467

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 468 KLQGCRENDSRQKWE 482


>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Takifugu rubripes]
          Length = 624

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/306 (34%), Positives = 148/306 (48%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N S VVSP I  I  +TFE   P     +  +   G FDW+L 
Sbjct: 274 CECFNGWLEPLLARIAENHSAVVSPDITTIDLNTFEFVKPSPYGQNHNR---GNFDWSLA 330

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +F ++G+YD   +IWGGEN+E+SF
Sbjct: 331 FGWESLPDHEKRRRKDETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSF 390

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H   + P  T  ++     + + + +    Y   
Sbjct: 391 RVWQCGGQLEIIPCSIVGHVFRTKSPH---SFPKGTQVISRNQVRLAEVWMDD---YKEI 444

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +L     FGD++ R +LR  L CKSF WYL+                  V 
Sbjct: 445 FYRRNQQAAQLVRDKAFGDISQRMDLRARLKCKSFSWYLKNIYPEAFIPDLNPLGFGSVK 504

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N     C+D A +  +  K V +YPCH  GGNQ++  S   EIR +   E CL  A G V
Sbjct: 505 NVGKDSCLD-AGENNEGGKRVIMYPCHGLGGNQYFEYSTRHEIRHNIQKELCLHGAAGAV 563

Query: 271 ILYPCH 276
            L  C 
Sbjct: 564 KLEECQ 569


>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Ornithorhynchus anatinus]
          Length = 634

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 281 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFSKPSPYGNNHNR---GNFDWSLS 337

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++PE E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 338 FGWESLPEHEKQRRKDETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 397

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   +   F 
Sbjct: 398 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---FKEIFY 453

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               E  ++  +  FGD++ R ELR  L CK+F WYL          +++   SG     
Sbjct: 454 RRNTEAAKIVKQKAFGDLSKRLELRDRLQCKNFTWYLNTIYPEVYVPDLNPVLSGYIKSV 513

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S+  EIR +   E CL  + G V L
Sbjct: 514 GRHVCLDVG-ENNQGTKPLIMYTCHGLGGNQYFEYSEQHEIRHNIQKELCLHASHGPVQL 572

Query: 273 YPC 275
             C
Sbjct: 573 KAC 575


>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 626

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 159/321 (49%), Gaps = 44/321 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P+++++ ++ + VV P+I  I D T +        TSS  + IGGF+W  +
Sbjct: 259 CEATDHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMG-----TSSDFYQIGGFNWKGE 313

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W   PE  RK  K+ A+P+ +PTMAGGLF+ID+ +F + G+YDS  + WGGENLE+SF
Sbjct: 314 FIWINTPEAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSF 373

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +    P            P   P+       I+ A   ++  +   +  +  +
Sbjct: 374 RIWMCGGSLVIAPCSHVGHIFRDYHPYKFPSNK-DTHGINTARLAEV--WMDNYKYYFYQ 430

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMC 220
           N     K  FGD++ RK LR  L CKSFKWYL+                    N  +GMC
Sbjct: 431 NRPELRKISFGDISERKALRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMC 490

Query: 221 IDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRRDEACL---------DYAGG 268
           +DS     D  +P+G+YPCHK    GGNQ    +   EIR++++C          D    
Sbjct: 491 LDSMSHNYDNTEPLGIYPCHKDTNSGGNQLVSYTWRHEIRKEDSCAELSSEPEKSDKTAR 550

Query: 269 DVILYPC-HGSKGNQYFEYDY 288
            V++ PC  G++  +   +D+
Sbjct: 551 KVMMAPCGEGAESEERQRWDH 571


>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
           gallus]
          Length = 566

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 151/320 (47%), Gaps = 61/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 222 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 274

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+S
Sbjct: 275 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 334

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H     P   P  +G +F+ +              
Sbjct: 335 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 380

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 381 EVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENVYPELRVPDHQDI 440

Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
              +      C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A
Sbjct: 441 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 498

Query: 267 GGDVI-LYPCHGSKGNQYFE 285
            G +I L  C  +   Q +E
Sbjct: 499 PGSLIKLQGCRENDSRQKWE 518


>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3 [Felis catus]
          Length = 633

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSI 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 RAC 574


>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Canis lupus familiaris]
          Length = 632

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 279 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 335

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 336 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 395

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 451

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 452 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSI 511

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 570

Query: 273 YPC 275
             C
Sbjct: 571 RAC 573


>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 578

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 102/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R  + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P++ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Ailuropoda melanoleuca]
 gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
          Length = 632

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 279 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 335

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 336 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 395

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 396 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 451

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 452 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 511

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQRELCLHAAQGLVQL 570

Query: 273 YPC 275
             C
Sbjct: 571 RAC 573


>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 571

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
           abelii]
          Length = 578

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 144/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+++R R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKQKRDRQISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Oryctolagus cuniculus]
          Length = 633

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 155/303 (51%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               +  ++  +  FGD++ R E++  L CK+F WYL          E++   SG     
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKNRLQCKNFTWYLNTVYPEVYVPELNPVISGYIKTV 512

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G++ L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQNEIRHNIQKELCLHAAPGNLQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|301608341|ref|XP_002933751.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Xenopus (Silurana) tropicalis]
          Length = 586

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 102/304 (33%), Positives = 145/304 (47%), Gaps = 36/304 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   G++ S      G FDW+
Sbjct: 233 CECFHGWLEPLLSRVAEDHTAVVSPDITAINYNTFEFGKPVQQGKMNSR-----GNFDWS 287

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L FNW AIP  + K+ K+   P+ TPT AGGLFSI KA+FE +G+YD   +IWGGEN+E+
Sbjct: 288 LAFNWEAIPAADEKQRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENVEM 347

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
           SF+          IP            P   P     +        E  +  Y   +   
Sbjct: 348 SFRVWQCGGKLEIIPCSVVGHVFRTKSPHTFPKGTQVILRNQVRLAEVWMDDYKVLYYRR 407

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
             +  +++ +  FGD++ R +L+ +L CK+F WYLE                  + N+ +
Sbjct: 408 NEQAAKIAKEKSFGDISKRLKLKADLQCKNFTWYLENIYPEMFVPDRDPTYYGAIKNEGT 467

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL--DYAGGDVIL 272
             CID         +   +YPCH  GGNQ++  S H E+R   + + CL   Y  G V L
Sbjct: 468 QNCIDVGENNNYGSQLPIMYPCHGMGGNQYFEYSTHKELRHNLKTQLCLCSKYEPGPVKL 527

Query: 273 YPCH 276
             C 
Sbjct: 528 VDCQ 531


>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Otolemur garnettii]
          Length = 571

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 105/315 (33%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+    TD    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFTD--GVVGVYECHNAGGNQEWALTKEKAVKHIDLCLTVVDRAPGALI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
           boliviensis boliviensis]
          Length = 578

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 101/285 (35%), Positives = 142/285 (49%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKYERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARAAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK LR  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  +   EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEIR 495


>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
           floridanus]
          Length = 451

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 154/320 (48%), Gaps = 56/320 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VV P+I  I  DTF+       L        GGFDW+L 
Sbjct: 100 CECNADWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWSLV 152

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + + ER+ R K+  + + TP +AGGLF I+KA+FEKLG YD+  D+WGGENL + 
Sbjct: 153 FKWEYLSQAERQARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLGIV 212

Query: 123 FKFNWHAIPERE-------------------RKRHKNAAEPVWTPTMAGGLFSIDKAFFE 163
            +F+   I  R                    RKRH     P   P  +G +F+ +     
Sbjct: 213 IQFHVQKISFRVWQCGGSLEIIPCSRVGHVFRKRH-----PYSFPGGSGNVFARNTRRAA 267

Query: 164 KLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------- 212
           ++   D  +  +    + L+    +G++  R EL+R L CK F WYL+            
Sbjct: 268 EVWMDD--YKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTSE 325

Query: 213 -----SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD--- 264
                S      C+DS     D +  VGLYPCH  GGNQ W ++K G I+  + CL    
Sbjct: 326 GGPGGSLKQGTACLDSMGHLLDGN--VGLYPCHNTGGNQEWGLTKDGLIKHHDLCLTLPV 383

Query: 265 YAGGDVILYP-CHGSKGNQY 283
           YA G  +L   C GS+  ++
Sbjct: 384 YAKGTTLLMQICDGSENQKW 403


>gi|170038563|ref|XP_001847118.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
           quinquefasciatus]
 gi|167882317|gb|EDS45700.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
           quinquefasciatus]
          Length = 531

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 79/156 (50%), Positives = 91/156 (58%), Gaps = 54/156 (34%)

Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSN------------- 214
           KGD+GDV+SRK+LR  LGCKSF+WYL                 EV N             
Sbjct: 327 KGDYGDVSSRKQLREELGCKSFRWYLDNIFPELFIPGEAVASGEVRNMGYGNRTCLDAPG 386

Query: 215 ------------------------DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMM 250
                                    WSG+CIDSA KP DMH P+G++PCH+ GGNQ+WM+
Sbjct: 387 GKKNLRKPVGLYPCHNQGGNQVANPWSGLCIDSAAKPEDMHTPLGIWPCHQAGGNQYWML 446

Query: 251 SKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
           SK GEIRRDEACLDYAG DVILYPCHGSKGNQY+ Y
Sbjct: 447 SKTGEIRRDEACLDYAGQDVILYPCHGSKGNQYWNY 482


>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Anolis carolinensis]
          Length = 592

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/295 (34%), Positives = 145/295 (49%), Gaps = 55/295 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +    S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 240 CECHEEWLEPLLERIKEEPSAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERE+K+ ++  + + +PTMAGGLF+++K +F  LG+YD+G ++WGGENLE SF
Sbjct: 294 FTWHVVPEREQKQRRSKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSF 353

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF--FEKLGTYDSGFDIWGGENLE 181
           +  W              +  +   +  G +F     +   + L       ++W     E
Sbjct: 354 RI-WQC----------GGSLEIHPCSHVGHVFPKQAPYSRAKALANSVRAAEVWMDSYKE 402

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG--------- 218
           L +  +       +GDVT R+ LR  L CK FKWYL+       V  D  G         
Sbjct: 403 LYYHRNPHARMEPYGDVTERRLLREKLKCKDFKWYLDNIYPELHVPEDRLGYFGMLKNKG 462

Query: 219 ---MCIDSACKPTDMHKPVG----LYPCHKQGGNQFWMMSKHGEIRRD----EAC 262
               C D    P + H   G    LYPCH  G NQF+  + + EIR +    EAC
Sbjct: 463 MANFCFDY--NPPNEHDITGHVVILYPCHGMGQNQFFEYTSYHEIRYNTRHPEAC 515


>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Anolis carolinensis]
          Length = 634

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/300 (34%), Positives = 149/300 (49%), Gaps = 42/300 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N+++VVSP I++I  +TFE   P     S  +   G FDW+L 
Sbjct: 281 CECFYGWLEPLLARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHNR---GNFDWSLS 337

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++PE E K+ K+   P+ TPT AGGLFSI K +F  +G+YD   +IWGGEN+E+SF
Sbjct: 338 FGWESLPEHESKKRKDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSF 397

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H   + P  T  +      + + + ++   Y + 
Sbjct: 398 RVWQCGGQLEIIPCSVVGHVFRSKSPH---SFPKGTQVITRNQVRLAEVWMDE---YKNI 451

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKP-TDM 230
           F     E  ++  +  FGD++ R EL++ L CK FKWYL  SN +    +     P +  
Sbjct: 452 FYRRNTEAAKIVKQQTFGDISKRHELKQRLQCKDFKWYL--SNVYPEAYVPDLNPPLSGF 509

Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYKY 290
            K VG   C   G N                  ++ G  +I+Y CHG  GNQYFEY  ++
Sbjct: 510 LKNVGRRACLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYSARH 551


>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Taeniopygia guttata]
          Length = 669

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 151/320 (47%), Gaps = 61/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 325 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 377

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+S
Sbjct: 378 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 437

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H     P   P  +G +F+ +              
Sbjct: 438 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 483

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 484 EVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENVYPELRVPDHQDI 543

Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
              +      C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A
Sbjct: 544 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRA 601

Query: 267 GGDVI-LYPCHGSKGNQYFE 285
            G +I L  C  +   Q +E
Sbjct: 602 PGSLIKLQGCRENDSRQKWE 621


>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
           lupus familiaris]
          Length = 588

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 150/320 (46%), Gaps = 61/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 244 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 296

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 297 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 356

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   +P        RK+H     P   P  +G +F+ +              
Sbjct: 357 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 402

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 403 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 462

Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
              +      C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A
Sbjct: 463 AFGALQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRA 520

Query: 267 GGDVI-LYPCHGSKGNQYFE 285
            G VI L  C  +   Q +E
Sbjct: 521 PGSVIKLQGCRENDTRQKWE 540


>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 6 [Ovis aries]
          Length = 792

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/314 (33%), Positives = 156/314 (49%), Gaps = 37/314 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 442 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRVQSR-----GNFDWS 496

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P RE++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+
Sbjct: 497 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 556

Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
           SF+  W    + E            T    T   G+  I +        +  G+ +I+  
Sbjct: 557 SFRV-WQCGGQLEIIPCSVVGHVFRTKSPHTFPKGINVIARNQVRLAEVWMDGYKEIFYR 615

Query: 178 ENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDW 216
            NL   +++ +  FGD++ R +LR  L C++F W+L+                  + N  
Sbjct: 616 RNLQAAQMAREKSFGDISERLQLRERLNCRNFSWFLDNIYPEMFVPDLKPTFFGALKNLG 675

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVILY 273
              C+D   +  +  KP+ LY CH  GGNQ++  +   ++R + A   CL  + G + L 
Sbjct: 676 VDHCLDVG-ENNNGGKPLILYACHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLGLR 734

Query: 274 PCHGSKGNQYFEYD 287
            CH +  N     D
Sbjct: 735 SCHFTGKNSQVPKD 748


>gi|431895640|gb|ELK05066.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Pteropus alecto]
          Length = 367

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/313 (33%), Positives = 145/313 (46%), Gaps = 47/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 23  CECNDHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 75

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 76  FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 135

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 136 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 186

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGM 219
           E     +          +G++ SR ELR+ L CK FKWYLE               +  +
Sbjct: 187 EYKNFYYAAVPSARNVPYGNIQSRLELRKTLACKPFKWYLENVYPELRVPDHQDIAFGAL 246

Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-L 272
              + C  T  H     VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I L
Sbjct: 247 QQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLIKL 306

Query: 273 YPCHGSKGNQYFE 285
             C  +   Q +E
Sbjct: 307 QGCRENDSRQKWE 319


>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
 gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
          Length = 578

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 100/285 (35%), Positives = 143/285 (50%), Gaps = 50/285 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+ + +V P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P+ ER R  +  +P+ +PTMAGGLF++ K +F+ LGTYD+G ++WGGENLELSF
Sbjct: 284 FQWHSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W    + E          +   +  G +F     +     L       ++W  E  E
Sbjct: 344 RV-WQCGGKLE----------IHPCSHVGHVFPKRAPYARPNFLQNTARVAEVWMDEYKE 392

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  +GD++ RK +R  L CKSF WYL+       V  D   W G      
Sbjct: 393 HFYNRNPPARKEAYGDISERKLIRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRG 452

Query: 221 IDSACKPTDMHKP--------VGLYPCHKQGGNQFWMMSKHGEIR 257
           I S C   D + P        + L+ CH QGGNQF+  + + EIR
Sbjct: 453 ISSEC--LDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIR 495


>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Cricetulus griseus]
 gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
           griseus]
          Length = 535

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/313 (32%), Positives = 147/313 (46%), Gaps = 47/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 191 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 243

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 244 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 303

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 304 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 354

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGM 219
           E     +          +G++ SR ELR+ L CK FKWYL+               +  +
Sbjct: 355 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLDNVYPELRVPDHQDIAFGAL 414

Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-L 272
              + C  T  H     VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I L
Sbjct: 415 QQGTNCLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIRL 474

Query: 273 YPCHGSKGNQYFE 285
             C  +   Q +E
Sbjct: 475 QGCRENDSRQKWE 487


>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Rattus norvegicus]
          Length = 579

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 148/304 (48%), Gaps = 53/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +P+RERK  ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERKLMRSPIDVIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSF 339

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  +  E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDDFKE 388

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
           L +  +       FGDVT RK+LR  L CK FKW+L+                    +  
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLENRG 448

Query: 216 WSGMCID---SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
             G C+D    +    + H+ V LY CH  G NQF+  +   EI    R+ EAC+    G
Sbjct: 449 LRGYCLDYNPPSENNVEGHQ-VLLYLCHGMGQNQFFEYTSRQEIRYNTRQPEACIAVEEG 507

Query: 269 DVIL 272
             +L
Sbjct: 508 KDVL 511


>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
          Length = 453

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 109 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 161

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 162 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 221

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 222 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 272

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 273 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 332

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 333 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 390

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 391 KLQGCRENDSRQKWE 405


>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Nomascus leucogenys]
          Length = 570

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 226 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 278

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 279 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 338

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 339 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 389

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 390 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 449

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 450 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 507

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 508 KLQGCRENDSRQKWE 522


>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Rattus norvegicus]
          Length = 576

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 148/304 (48%), Gaps = 53/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +P+RERK  ++  + + +PTMAGGLF++ K +FE LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERKLMRSPIDVIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSF 339

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  +  E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDDFKE 388

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
           L +  +       FGDVT RK+LR  L CK FKW+L+                    +  
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLENRG 448

Query: 216 WSGMCID---SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
             G C+D    +    + H+ V LY CH  G NQF+  +   EI    R+ EAC+    G
Sbjct: 449 LRGYCLDYNPPSENNVEGHQ-VLLYLCHGMGQNQFFEYTSRQEIRYNTRQPEACIAVEEG 507

Query: 269 DVIL 272
             +L
Sbjct: 508 KDVL 511


>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
           sapiens]
 gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           AltName: Full=Polypeptide GalNAc transferase 2;
           Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 2;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 2
           soluble form
 gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
           sapiens]
 gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
 gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_c [Homo sapiens]
 gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
 gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [synthetic
           construct]
          Length = 571

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
          Length = 571

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
           caballus]
          Length = 633

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R  ++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQSLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
           taurus]
 gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
          Length = 622

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 154/316 (48%), Gaps = 41/316 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRIQSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P RE++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
           SF+          IP            P    T   G+  I +        +  G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDGYKEIF 443

Query: 176 GGENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
              NL   +++ +  FGD++ R +LR  L C +F W+L+                  + N
Sbjct: 444 YRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPEMFVPDLKPTFFGALKN 503

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
                C+D   +  +  KP+ LY CH  GGNQ++  +   ++R + A   CL  + G + 
Sbjct: 504 LGVDHCLDVG-ENNNGGKPLILYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562

Query: 272 LYPCHGSKGNQYFEYD 287
           L  CH +  N     D
Sbjct: 563 LRSCHFTGKNSQVPKD 578


>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
 gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
           AltName: Full=Polypeptide GalNAc transferase 12;
           Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 12;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 12
 gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
 gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
 gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [Mus musculus]
 gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
 gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
          Length = 576

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 146/304 (48%), Gaps = 53/304 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       L +S +  IGGFDW L 
Sbjct: 226 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------LGNSGEPQIGGFDWRLV 279

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +P+RER+  ++  + + +PTMAGGLF++ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 280 FTWHVVPQRERQSMRSPIDVIRSPTMAGGLFAVSKRYFDYLGSYDTGMEVWGGENLEFSF 339

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 340 RI-WQCGGTLETHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 388

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV-------------------SND 215
           L +  +       FGDVT RK+LR  L CK FKW+L+                    +  
Sbjct: 389 LYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHVPEDRPGFFGMLQNRG 448

Query: 216 WSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEI----RRDEACLDYAGG 268
             G C+D    P + H     V LY CH  G NQF+  +   EI    R+ EAC+    G
Sbjct: 449 LRGYCLDYN-PPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIRYNTRQPEACITVEDG 507

Query: 269 DVIL 272
              L
Sbjct: 508 KDTL 511


>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
          Length = 576

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 232 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 284

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 285 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 344

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 395

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 396 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 455

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 456 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 513

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 514 KLQGCRENDSRQKWE 528


>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
           troglodytes]
 gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
          Length = 576

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 232 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 284

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 285 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 344

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 395

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 396 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 455

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 456 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 513

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 514 KLQGCRENDSRQKWE 528


>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
           mulatta]
 gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
 gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
          Length = 571

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Oryctolagus cuniculus]
          Length = 571

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
          Length = 571

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 148/317 (46%), Gaps = 55/317 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM----------- 219
           E     +          +G++ SR ELR+ L CK FKWYLE  N +  +           
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLE--NVYPELRVPDHQDIALG 448

Query: 220 -------CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGD 269
                  C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G 
Sbjct: 449 XLQQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGS 506

Query: 270 VI-LYPCHGSKGNQYFE 285
           +I L  C  +   Q +E
Sbjct: 507 LIKLQGCRENDSRQKWE 523


>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
          Length = 626

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/289 (35%), Positives = 146/289 (50%), Gaps = 38/289 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   V+ P+I  I D T E         S   F IG F W+  
Sbjct: 273 CEATLGWLEPLLQRIKEDKRAVLVPIIDVIDDKTLEYYH-----GSPESFQIGSFTWSGH 327

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP+RE KR  +   P  +PTMAGGLF+ID+ +F  LG+YD G D+WGGENLE+SF
Sbjct: 328 FTWMDIPKREIKRRGSRVGPTNSPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSF 387

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
           +      +   IP   R  H   +   +T         I+ A   +  +  Y   F +  
Sbjct: 388 RIWMCGGSLETIP-CSRVGHIFRSFHPYTFPGNKDTHGINTARVVEVWMDDYKELFYMHR 446

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSG 218
           G+   +    D GD ++RK+LR++L CKSFKWYLE                  V ND  G
Sbjct: 447 GDLKTI----DIGDTSARKKLRKDLKCKSFKWYLENVLPDKFIMTEHSLGYGRVMNDAFG 502

Query: 219 --MCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLD 264
             +C+D+  +  D    +G YPCH Q   +Q + +SK G++RR+E+C +
Sbjct: 503 KQLCLDNLQRNEDQPYNLGQYPCHAQMAMSQVFALSKLGQLRREESCAE 551


>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Papio anubis]
          Length = 571

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDXCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
           aries]
          Length = 633

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHAAQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
           And Ea2
 gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
 gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
          Length = 501

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 157 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 209

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 210 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 269

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 270 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 320

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 321 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 380

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 381 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 438

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 439 KLQGCRENDSRQKWE 453


>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           2 [Nomascus leucogenys]
          Length = 571

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Monodelphis domestica]
          Length = 571

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYSFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGNNCLDTLGHFAD--GVVGVYECHNSGGNQEWALTKDKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
           scrofa]
          Length = 571

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 227 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 279

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 280 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 339

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 340 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 390

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 391 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 450

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 451 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 508

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 509 KLQGCRENDSRQKWE 523


>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
 gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
          Length = 622

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 154/316 (48%), Gaps = 41/316 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPNIVTIDLNTFEFSKPVQRGRVQSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P RE++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEVLPAREKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
           SF+          IP            P    T   G+  I +        +  G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDGYKEIF 443

Query: 176 GGENL---ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
              NL   +++ +  FGD++ R +LR  L C +F W+L+                  + N
Sbjct: 444 YRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPEMFVPDLKPTFFGALKN 503

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
                C+D   +  +  KP+ LY CH  GGNQ++  +   ++R + A   CL  + G + 
Sbjct: 504 LGVDHCLDVG-ENNNGGKPLILYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562

Query: 272 LYPCHGSKGNQYFEYD 287
           L  CH +  N     D
Sbjct: 563 LRSCHFTGKNSQVPKD 578


>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
           [Macaca mulatta]
          Length = 554

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 210 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 262

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 263 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 322

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 323 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 373

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 374 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 433

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 434 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 491

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 492 KLQGCRENDSRQKWE 506


>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
          Length = 633

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 152/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGNNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ ER+R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHERQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDDEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + +    Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDD---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG     
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSF 512

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSVQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           4 [Pan troglodytes]
          Length = 533

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|312370886|gb|EFR19191.1| hypothetical protein AND_22918 [Anopheles darlingi]
          Length = 1204

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 102/305 (33%), Positives = 149/305 (48%), Gaps = 37/305 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+ L+  +A+  + +  P I  I +DT  L        +S +++ G FDW L 
Sbjct: 222 CEVIEGWLEALVAHVAQRETMIAIPAIDWIHEDTLALN-----AQNSVRYY-GSFDWGLN 275

Query: 64  FNWHAIPER--ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           F W    +R  +     N A P  TPTMAGGLF+I ++FFE+LG YD G  I+GGEN+EL
Sbjct: 276 FQWRVRADRIMQPAMAGNPAAPYDTPTMAGGLFTIHRSFFERLGWYDEGMQIYGGENMEL 335

Query: 122 SFKFNWHA-----IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-----DSG 171
           SFK  W       I    R  H       +   ++ G   + +        +     D  
Sbjct: 336 SFK-AWMCGGSMQIVGCSRVAHIQKRGHPYLRQLSDGFALVRRNSIRVAEVWLDEYADYF 394

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------VSNDWSGM 219
           ++ +GG     + +G FG++T R ELR+ L CK F+WYLE            V+      
Sbjct: 395 YETFGGR----ARRGSFGNLTERHELRQRLACKPFRWYLETVFPEQFDPSKAVARGEIRF 450

Query: 220 CIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
             D+   P  +  P  + L  CH  GG+Q W ++  GE+ R++ CLDY G  + +  CHG
Sbjct: 451 ADDAKATPLCLDWPSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYDGELLSVVRCHG 510

Query: 278 SKGNQ 282
             GNQ
Sbjct: 511 LGGNQ 515



 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/321 (31%), Positives = 147/321 (45%), Gaps = 56/321 (17%)

Query: 4    CEVQKRWLQPLLDVLARNSSHVVS-PLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNL 62
            CE    WL+  LDV+AR+  H ++ P I  I +         G +++    + G   W L
Sbjct: 849  CECMVGWLEGQLDVVARDPRHTIALPTIDWIDEKNL------GLVSNKAPVYYGAMGWGL 902

Query: 63   QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
             F W    +R  K  +N  EP  TP MAGGLF+I +  FE LG YD   D++GGEN+ELS
Sbjct: 903  DFQWRGRWDRVNK-PENKLEPFSTPVMAGGLFTIHRKLFEWLGWYDQQLDVYGGENIELS 961

Query: 123  FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSG-FDI 174
             K          +P       +    P +   +A  +   +     +  L  Y +  +D+
Sbjct: 962  LKAWMCGGQLLTVPCSRVAHIQKTGHP-YLLGLAKDVARTNSVRVAEVWLDQYAAVLYDL 1020

Query: 175  WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------------------SNDW 216
            +GG      ++GDFGDVT RK+LRR L CKSF+WYLE                    N+ 
Sbjct: 1021 FGGPQ----YRGDFGDVTERKQLRRALHCKSFRWYLETVFPELAPALDKRPGHGRFENEA 1076

Query: 217  SGM------CI---DSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYA 266
              M      C+    SA  PT       + PC       Q W+ +  GE+  +  CLDY 
Sbjct: 1077 LSMEGQPKHCLTAQSSAGLPT-------MEPCQAGSDARQHWLHNLFGELSNENRCLDYD 1129

Query: 267  GGDVILYPCHGSKGNQYFEYD 287
            G  + +Y CH ++GNQ + Y+
Sbjct: 1130 GSALRVYACHKARGNQEWRYN 1150


>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
           caballus]
          Length = 539

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 195 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 247

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 248 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 307

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 308 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 358

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 359 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 418

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 419 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 476

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 477 KLQGCRENDSRQKWE 491


>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Loxodonta africana]
          Length = 633

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG     
Sbjct: 453 RRNTDAAKIVRQKSFGDLSKRFEIKHRLQCKNFTWYLNSVYPEVYVPDLNPVISGYIKSF 512

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQHLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAPGPVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 RTC 574


>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_a [Homo sapiens]
          Length = 508

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Sarcophilus harrisii]
          Length = 542

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/320 (33%), Positives = 150/320 (46%), Gaps = 61/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 198 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 250

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 251 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 310

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H     P   P  +G +F+ +              
Sbjct: 311 FRVWQCGGSLEIIPCSRVGHVFRKQH-----PYSFPGGSGTVFARNTR---------RAA 356

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 357 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 416

Query: 213 ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYA 266
              +      C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A
Sbjct: 417 AFGALQQGNNCLDTLGHFAD--GVVGVYECHNSGGNQEWALTKDKSVKHMDLCLTVVDRA 474

Query: 267 GGDVI-LYPCHGSKGNQYFE 285
            G +I L  C  +   Q +E
Sbjct: 475 PGSLIKLQGCRENDSRQKWE 494


>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
           paniscus]
          Length = 533

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Cavia porcellus]
          Length = 531

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 187 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 239

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 240 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 299

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 300 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 350

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGC+ F+WYLE                 + 
Sbjct: 351 EYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCRPFQWYLENVYPELRVPDHQDIAFGAL 410

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G ++
Sbjct: 411 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGALV 468

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 469 KLQGCRENDSRQKWE 483


>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
           gorilla gorilla]
          Length = 533

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
           taurus]
 gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
           mutus]
          Length = 633

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 153/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG  I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511

Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
             +P  +         KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 VGRPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAALGAVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
 gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
          Length = 533

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 189 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 241

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 242 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 301

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 302 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 352

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 353 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 412

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 413 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 470

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 471 KLQGCRENDSRQKWE 485


>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
           glaber]
          Length = 567

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 147/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 223 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 275

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 276 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 335

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 336 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 386

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ LGC+ F+WYLE                 + 
Sbjct: 387 EYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCQPFQWYLENVYPELRVPDHQDIAFGAL 446

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G ++
Sbjct: 447 QQGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGALV 504

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 505 KLQGCRENDSRQKWE 519


>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
          Length = 530

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 186 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 238

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 239 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 298

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 299 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 349

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 350 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 409

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 410 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 467

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 468 KLQGCRENDSRQKWE 482


>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
          Length = 633

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 153/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG  I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511

Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
             +P  +         KP+ LY CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 VGRPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAALGAVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           1 [Nomascus leucogenys]
          Length = 556

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 146/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 212 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 264

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 265 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 324

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 325 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 375

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 376 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 435

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 436 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 493

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 494 KLQGCRENDSRQKWE 508


>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 599

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +  + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 264 CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 318

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  ER+R K+  +P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 319 LTFGWETLPAHERQRRKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 378

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + ++   Y 
Sbjct: 379 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMDE---YK 432

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C++F WYL                   
Sbjct: 433 EIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPEMFVPDLKPTFYGA 492

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N     C+D   +     KP+ +YPCH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 493 IKNLGIDQCLDVG-ENNRGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 551

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  C  +  N     D
Sbjct: 552 TLGLRGCQFTVKNSQVPKD 570


>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 624

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +  + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 274 CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 328

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  ER+R K+  +P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 329 LTFGWETLPAHERQRRKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 388

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + ++   Y 
Sbjct: 389 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMDE---YK 442

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C++F WYL                   
Sbjct: 443 EIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPEMFVPDLKPTFYGA 502

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N     C+D   +     KP+ +YPCH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 503 IKNLGIDQCLDVG-ENNRGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 561

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  C  +  N     D
Sbjct: 562 TLGLRGCQFTVKNSQVPKD 580


>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Monodelphis
           domestica]
          Length = 631

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/315 (33%), Positives = 154/315 (48%), Gaps = 40/315 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I   TFE   P    ++  +   G FDW+L 
Sbjct: 278 CECFYGWLEPLLSRIAENYTAVVSPDIASIDLTTFEFSKPSPYGSNHNR---GNFDWSLS 334

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD    IWGGEN+E+SF
Sbjct: 335 FGWESLPDHEKQRRKDETYPIRTPTFAGGLFSISKKYFEYIGTYDEEMKIWGGENIEMSF 394

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   +   F 
Sbjct: 395 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---FKEIFY 450

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               E  ++  +  +GD++ R ++R  L CK+F WYL                   + N 
Sbjct: 451 RRNTEAAKIVKQKAYGDISKRLDIRHRLQCKNFTWYLNNIYPEIYVPDLNPVISGYIQNI 510

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S+  EIR   + E CL    G V +
Sbjct: 511 GRHLCLDVG-ENNQGGKPLIMYTCHFLGGNQYFEXSEQHEIRHSIQKELCLHALQGPVQM 569

Query: 273 YPCHGSKGNQYFEYD 287
             C   KG + F  D
Sbjct: 570 KAC-SYKGQKTFTVD 583


>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
          Length = 570

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 103/314 (32%), Positives = 147/314 (46%), Gaps = 51/314 (16%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  +RWL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL F
Sbjct: 227 ECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLVF 279

Query: 65  NW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+SF
Sbjct: 280 KWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISF 339

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   IP            P   P  +G +F+ +              ++W  E
Sbjct: 340 RVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMDE 390

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SND 215
                +          +G++ SR ELR+ LGCK FKWYL+                 +  
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGALQ 450

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI- 271
               C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I 
Sbjct: 451 QGTNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIR 508

Query: 272 LYPCHGSKGNQYFE 285
           L  C  +   Q +E
Sbjct: 509 LQGCRENDSRQKWE 522


>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Ailuropoda melanoleuca]
          Length = 622

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 100/296 (33%), Positives = 144/296 (48%), Gaps = 36/296 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A   + VVSP I  I  +TFE     P GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
           SF+          IP            P   P     +        E  + +Y   F   
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGISVIARNQVRLAEVWMDSYKEIFYRR 446

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DMHKP 233
             +  +++ +  FGD++ R +LR  L C++F W+L  +N +  M +    KPT     + 
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFL--TNIYPEMFVPD-LKPTFYGAIRN 503

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           +G+  C   G N                  ++ G  +I+Y CHG  GNQYFEY  +
Sbjct: 504 LGINQCLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYTTR 541


>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
          Length = 621

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 100/296 (33%), Positives = 144/296 (48%), Gaps = 36/296 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A   + VVSP I  I  +TFE     P GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
           SF+          IP            P   P     +        E  + +Y   F   
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGISVIARNQVRLAEVWMDSYKEIFYRR 446

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DMHKP 233
             +  +++ +  FGD++ R +LR  L C++F W+L  +N +  M +    KPT     + 
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFL--TNIYPEMFVPD-LKPTFYGAIRN 503

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           +G+  C   G N                  ++ G  +I+Y CHG  GNQYFEY  +
Sbjct: 504 LGINQCLDVGEN------------------NHGGKPLIMYTCHGLGGNQYFEYTTR 541


>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
           catus]
          Length = 653

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 105/318 (33%), Positives = 149/318 (46%), Gaps = 57/318 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 309 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 361

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 362 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 421

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   +P        RK+H     P   P  +G +F+ +              
Sbjct: 422 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 467

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSN 214
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 468 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDI 527

Query: 215 DWSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGG 268
            +  +   + C  T  H     VG+Y CH  GGNQ W ++K   ++  + CL   D   G
Sbjct: 528 AFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRTPG 587

Query: 269 DVI-LYPCHGSKGNQYFE 285
            VI L  C  +   Q +E
Sbjct: 588 SVIKLQGCRENDSRQKWE 605


>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
           musculus]
          Length = 633

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E+++ L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
           boliviensis boliviensis]
          Length = 724

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 145/315 (46%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 380 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 432

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE LG YD   D+WGGENLE+S
Sbjct: 433 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEALGKYDMMMDVWGGENLEIS 492

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 493 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 543

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 544 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 603

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
                C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 604 QQGTNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 661

Query: 272 -LYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 662 KLQGCRENDSRQKWE 676


>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
          Length = 581

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/306 (33%), Positives = 144/306 (47%), Gaps = 51/306 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD +  + + VVSP+I  I  D FE       L        GGFDWNL 
Sbjct: 236 CECNVGWLEPLLDRIKGDRTRVVSPIIDVINMDNFEYIGASADLK-------GGFDWNLV 288

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE   KR  N  +P+ TP +AGGLFSI+K +FE+LG YD   D+WGGENLE+S
Sbjct: 289 FKWDYMTPEERNKRAGNPIQPIRTPMIAGGLFSIEKKWFEELGKYDRNMDVWGGENLEIS 348

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 349 FRVWQCHGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFARNTR---------RAAEVWMD 399

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
              E  +         +FGD++ R +LR+ L CK FKW+LE                 S 
Sbjct: 400 NYKEFYYAAVPSAKMVNFGDISERMDLRKRLSCKPFKWFLEHVYPELKVPGHQDQAFGSI 459

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVI 271
                C+D+     D    +G++PCH  GGNQ + ++K G IR  + C+   G   G V+
Sbjct: 460 QQDNNCMDTLGNFAD--GILGIFPCHFAGGNQEFSLTKEGFIRHLDLCVTLTGSMPGTVV 517

Query: 272 -LYPCH 276
            L+ C 
Sbjct: 518 KLFQCQ 523


>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
 gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=Polypeptide GalNAc transferase 3;
           Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 3;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3
 gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
 gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E+++ L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Ornithorhynchus anatinus]
          Length = 628

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 148/318 (46%), Gaps = 57/318 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 284 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 336

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+S
Sbjct: 337 FKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEIS 396

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   +P        RK+H     P   P  +G +F+ +              
Sbjct: 397 FRVWQCGGSLEIVPCSRVGHVFRKQH-----PYTFPGGSGTVFARNTR---------RAA 442

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSN 214
           ++W  E     +          +G++ SR ELR+ L CK FKWYLE              
Sbjct: 443 EVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLSCKPFKWYLENVYPELRVPDHQDI 502

Query: 215 DWSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----G 267
            +  +   + C  T  H     VG+Y CH  GGNQ W ++K   ++  + CL       G
Sbjct: 503 AFGALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKDRSVKHMDLCLTVVERTPG 562

Query: 268 GDVILYPCHGSKGNQYFE 285
             V L  C  +   Q +E
Sbjct: 563 ALVKLQGCRENDSRQKWE 580


>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
          Length = 573

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 105/286 (36%), Positives = 141/286 (49%), Gaps = 42/286 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 231 CECNVDWLEPLLIRVKEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 283

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER +R K+   P+ TP +AGGLF IDK +FEKLG YD+  DIWGGENLE+S
Sbjct: 284 FKWEYLSTAERHERQKDPTTPIRTPMIAGGLFVIDKVYFEKLGKYDTQMDIWGGENLEIS 343

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
           F+      +   IP        RKRH     P   P   +G +F+ +     ++   D  
Sbjct: 344 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSGNIFAKNTRRAAEVWMDD-- 396

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSA------- 224
           +  +    + L+    FGD+  R EL+  L CK FKWYL  +N +  + I          
Sbjct: 397 YKQYYYAAVPLAKNIPFGDIEERMELKERLQCKPFKWYL--ANVYPQLTIPEQQTKGSLR 454

Query: 225 ----CKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
               C  T  H     VGLY CH  GGNQ W ++K G+I+  + CL
Sbjct: 455 QGPYCMDTLGHLVDGIVGLYQCHDSGGNQDWAITKKGQIKHLDLCL 500


>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 6-like [Hydra
           magnipapillata]
          Length = 591

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 146/322 (45%), Gaps = 62/322 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  L  N    V P I  I    FE     G          G F W L 
Sbjct: 230 CEASFGWLEPLLARLQENPKLAVVPDIEVISFKNFEYSSEKGSYNR------GIFSWELM 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD---------SGFDIW 114
           FNW  +P RE+ R K  ++P+ +PTMAGGLF++++ +F + G YD              W
Sbjct: 284 FNWGPLPPREKMRRKYESDPIKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYW 343

Query: 115 GGENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           GGEN+E+SF+          IP            P  +P  +    SI  A         
Sbjct: 344 GGENVEMSFRLWMCGEGIEIIPCSRVGHVFRERAPYKSPDGSTDHNSIRVA--------- 394

Query: 170 SGFDIWGGENLEL--SFKGDF-----GDVTSRKELRRNLGCKSFKWYLE----------- 211
              ++W  E  E+  SF+ +      GDV+ RK+LR +L CKSFKWYL+           
Sbjct: 395 ---EVWMDEFKEIFYSFRANLKPEQGGDVSERKKLREDLKCKSFKWYLQNIIPELEIPDK 451

Query: 212 -------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
                  V N  +  C+D+  +     KP GLYPCHK G NQ+++ +K  EI  D  CLD
Sbjct: 452 YPYGRGDVKNLGTLSCLDTLAQNNQGGKP-GLYPCHKMGTNQYFIFTKKFEIWHDGLCLD 510

Query: 265 YAGGD----VILYPCHGSKGNQ 282
            +  D    V L+PCH   GNQ
Sbjct: 511 LSDSDLNAKVKLWPCHKQGGNQ 532



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 43/73 (58%), Gaps = 4/73 (5%)

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD--EACLDYAGGDVILYPC 275
           G+C+D     +D++  V L+PCHKQGGNQ W  +K G I  +  + CL+  G  +++  C
Sbjct: 506 GLCLD--LSDSDLNAKVKLWPCHKQGGNQKWKHTKSGLIMHESRKKCLEGQGDQILIRAC 563

Query: 276 HGSKGNQYFEYDY 288
             +  NQ + +++
Sbjct: 564 DTNNANQRWLFEH 576


>gi|170038567|ref|XP_001847120.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167882319|gb|EDS45702.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 494

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 146/313 (46%), Gaps = 48/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+  LDV+A +   +  P I  I ++T  L         + + + G FDW + 
Sbjct: 156 CEVIVGWLEAQLDVVAADPQTIAIPSIDWIHEETMALN------AQNSQLYFGSFDWTVN 209

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W +  E++ K  +N   P  TP MAGGLF+I++ FFE LG YD GF  +G EN+ELSF
Sbjct: 210 FQWKSRAEKKVK-PENPVAPFDTPVMAGGLFTINRTFFEHLGWYDEGFQTYGAENMELSF 268

Query: 124 KF----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           K      +  I    R  H       +  +  GG  ++ +             ++W  E 
Sbjct: 269 KTWMCGGFMKIVPCSRVAHIQKRGHPYLASSPGGFNAVKRNTVRLA-------EVWLDEY 321

Query: 180 LELSF--------KGDFGDVTSRKELRRNLGCKSFKWYLEV---------------SNDW 216
            E  +        +GDFGDV+SRK+LR  L C+ F+WY+E                    
Sbjct: 322 AEYYYESFGGRKNRGDFGDVSSRKKLRARLNCRPFRWYMETVFPEQFDPSKAVGRGQFRI 381

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCH 276
            G C+D   K       + +  CH  GG+Q W  +  GEI R++ C+D+    + +  CH
Sbjct: 382 GGGCLDWPTK-------LSVIGCHGLGGHQLWFFTADGEITREDHCMDFDSKKLEMIRCH 434

Query: 277 GSKGNQYFEYDYK 289
             KGNQ + ++ K
Sbjct: 435 KQKGNQMWVFEEK 447


>gi|324507788|gb|ADY43296.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Ascaris suum]
          Length = 580

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 105/310 (33%), Positives = 143/310 (46%), Gaps = 50/310 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   +WL+PLL  +  N   VV+P+I  I  DTF        L        GGF+WNL 
Sbjct: 232 CECNVQWLEPLLARVKENPHAVVAPIIDVINMDTFNYVAASADLR-------GGFEWNLV 284

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  + R  RH +   P+ TP +AGGLF I K +FE LGTYD   D+WGGENLELS
Sbjct: 285 FKWEYLSGKLRDDRHSHPTLPIKTPVIAGGLFMIRKDWFETLGTYDPDMDVWGGENLELS 344

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F  +              ++W  
Sbjct: 345 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFQKNTR---------RAAEVWLD 395

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------VSN 214
           +   L  K        DFGD++ R +L+  L CK+F WYL+                ++ 
Sbjct: 396 DYKMLYLKQVPSARFVDFGDISERLKLKEQLHCKNFTWYLKEVYPELKIPEREDGLYLTF 455

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACL-DYAGGDV 270
             +G+CIDS  K T  H PVG+Y CH  GGNQ W+  K     ++   + C+ D   G V
Sbjct: 456 KQAGLCIDSLGKQT-AHSPVGVYSCHGTGGNQEWVFDKQKGTLKNPFTKLCMSDSDIGVV 514

Query: 271 ILYPCHGSKG 280
            L  C  + G
Sbjct: 515 SLQKCETADG 524


>gi|357624672|gb|EHJ75362.1| hypothetical protein KGM_04161 [Danaus plexippus]
          Length = 771

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 148/326 (45%), Gaps = 63/326 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL  ++     V++PLI  I   TFEL        ++ +F +GGF +   
Sbjct: 414 CEVNVDWLRPLLQRISHKRDAVLTPLIDVIDQSTFELE-------AAQQFQVGGFTFMGH 466

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PERE++R  +   P W+PTMAGGLF+I++ ++ +LG YD     WGGENLE+SF
Sbjct: 467 FTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELGAYDEQMAGWGGENLEMSF 526

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP--TMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           +          +P         A  P   P  T   G+ +   A            ++W 
Sbjct: 527 RIWQCGGTLETVPCSRVGHVFRAFHPYGLPAHTDTHGINTARMA------------EVWM 574

Query: 177 GENLELSF--------KGDFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
            E  EL +            GDVT RK LR  L CKSF+WYL+                 
Sbjct: 575 DEYAELFYLNRPDLRKSPKIGDVTHRKILREKLKCKSFQWYLDNIYKEKFVPVRDVFGYG 634

Query: 212 -VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDY---- 265
              N  S MC+D+  +  +    +GLYPCH +    Q   +S  GE+R +E C +     
Sbjct: 635 RFMNPSSAMCLDTLQREGEA-TALGLYPCHSRLEPTQHLALSLAGELRDEEKCAEVQSPV 693

Query: 266 -----AGGDVILYPCHGSKGNQYFEY 286
                    V++  CHG    Q++ Y
Sbjct: 694 GSNENVSRRVLMVTCHGKHRGQHWRY 719



 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/199 (32%), Positives = 88/199 (44%), Gaps = 32/199 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFE--------------LRFPPGRLTS 49
           CEVQ+ WL+PLL  +      VV P+I  I    F                R    R+  
Sbjct: 344 CEVQEDWLRPLLQRIRDFPHAVVVPIIDVIESSNFYYSVQDPVIFQGLILARISGARIAR 403

Query: 50  SYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD- 108
                       +  +W   P  +R  HK  A  V TP +      ID++ FE       
Sbjct: 404 GDVLIFLDSHCEVNVDWLR-PLLQRISHKRDA--VLTPLID----VIDQSTFELEAAQQF 456

Query: 109 --SGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG 166
              GF   G         F W  +PERE++R  +   P W+PTMAGGLF+I++ ++ +LG
Sbjct: 457 QVGGFTFMG--------HFTWIEVPEREKRRRGSDIAPTWSPTMAGGLFAINRQYYWELG 508

Query: 167 TYDSGFDIWGGENLELSFK 185
            YD     WGGENLE+SF+
Sbjct: 509 AYDEQMAGWGGENLEMSFR 527


>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Cavia porcellus]
          Length = 622

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 102/313 (32%), Positives = 148/313 (47%), Gaps = 47/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A N   VVSP I  I  +TFE     P GR+ S      G FDW 
Sbjct: 272 CECFHGWLEPLLARIAENKMAVVSPDIVTINLNTFEFSKPIPEGRIHSR-----GNFDWI 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +    Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F W+L                   
Sbjct: 441 KIFYRRNLQAAKIAQEKSFGDISERLQLRERLHCHNFSWFLSNIYPEMFVPDLSPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N     C+D   +     KP+ +Y CH  GGNQ++  +   E+R + A   CL    G
Sbjct: 501 IKNLGINQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRELRHNVAKQLCLHARAG 559

Query: 269 DVILYPCHGSKGN 281
            + L  CH +  N
Sbjct: 560 TLGLRACHFTGKN 572


>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 100/312 (32%), Positives = 150/312 (48%), Gaps = 50/312 (16%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL+PLL  +A + S V  P+I+ I    F         ++S +  IGGFDW L F
Sbjct: 249 ECNNGWLEPLLQRIAEDDSVVAVPIISTIAWQDFAFHHS----SNSIEPQIGGFDWRLTF 304

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            WH+IP+  + + K   +PV TPTMAGGLF++ + +F  +G+YD+G ++WGGENLE+SF+
Sbjct: 305 QWHSIPDEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFR 364

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGE---- 178
             W              +  +   ++ G +F     +  K  T ++    ++W  +    
Sbjct: 365 V-WMC----------GGSLEIIPCSIVGHVFPKTAPYERKSFTPNTVRAVEVWLDDYKRH 413

Query: 179 ---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM--------- 219
                 LS    +GD++ R  LR  L CKSF+WYLE       V  D  G          
Sbjct: 414 FYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDLPVPEDTPGQFGALHNKGS 473

Query: 220 ---CIDSACKPTDM-HKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL---DYAGGD 269
              C+D      D+ H  VG + CH QGGNQF+  +  G +R   + E C+   D   G+
Sbjct: 474 PSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRYTSQFELCIAKKDDNSGE 533

Query: 270 VILYPCHGSKGN 281
           +    C+G   N
Sbjct: 534 IAAVMCNGKNVN 545


>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 150/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E+++ L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEAYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
           norvegicus]
          Length = 633

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 152/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI + +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R+K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRNKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E+++ L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L
Sbjct: 513 GQPLCLDVG-ENNQGDKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|405966237|gb|EKC31544.1| Putative polypeptide N-acetylgalactosaminyltransferase 9, partial
           [Crassostrea gigas]
          Length = 513

 Score =  150 bits (378), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 154/324 (47%), Gaps = 67/324 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL+ +A +  +V  P   +I  DTFE +      +S    F+GGFD++L 
Sbjct: 145 CECTKGWLEPLLNEIADDYRNVAIPFTDSIDADTFEYKG-----SSLNYVFVGGFDFDLH 199

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+RE+ R +   +P+W+PT  G   +I K FF++LG YD+   IWGGENLELSF
Sbjct: 200 FAWRVMPDREQNRRRLLTDPIWSPTHLGCCLAISKRFFDELGRYDNELQIWGGENLELSF 259

Query: 124 K-------------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID 158
           K                         ++W     R   R+      VW       +    
Sbjct: 260 KTWMCGGKMKIIPCSHVGHVFRHKMPYSWGKDGYRTFIRNSLRVAEVW-------MDQYK 312

Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------- 210
           + +++++  Y S  +I            D G+++SRK +R+ L CK F WYL        
Sbjct: 313 EVYYDRI--YYSQNEI------------DIGNISSRKAIRQRLHCKPFDWYLKNVYPELY 358

Query: 211 -----EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY 265
                + +   + MCIDS    +   KP+    C   G +Q W  ++   IRRDE CL +
Sbjct: 359 IPRDCKATGQINNMCIDSYTGGSFYGKPISARECIHLGTSQHWTWTRENTIRRDEGCLVF 418

Query: 266 AG-GDVILYPCHGSKGNQYFEYDY 288
            G   V++ PC  +  ++Y +++Y
Sbjct: 419 DGISRVLMGPC--ATLSKYLQWEY 440


>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
          Length = 553

 Score =  150 bits (378), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 104/313 (33%), Positives = 146/313 (46%), Gaps = 56/313 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  + + +V P+I  I  D+F+       L        GGFDWNL 
Sbjct: 206 CECNEGWLEPLLARVVEDRTRIVCPVIDVIAMDSFQYIAASTELR-------GGFDWNLV 258

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +P  E+  R  +   P+ TP +AGGLF ID+ +F+KLG+YD   DIWGGENLE+S
Sbjct: 259 FKWELLPAEEKANRKTDPTIPIRTPMIAGGLFVIDRQYFQKLGSYDLQMDIWGGENLEIS 318

Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           F+  W      E           RK+H     P   P  +G +F+ +     ++   D  
Sbjct: 319 FR-TWQCGGRLEIVPCSRVGHVFRKQH-----PYSFPGGSGTIFARNTRRAAEVWMDD-- 370

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSG-- 218
           +  +    + ++    FG++T R  LR +L CK FKWY+E              D SG  
Sbjct: 371 YKKYYFAAVPMARTVTFGNITDRLALRNSLNCKPFKWYVENVYPELLKHLPTVRDPSGTN 430

Query: 219 --------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL-----DY 265
                   +C D+  +    H  +GLY CH  GGNQ W    +G +R    CL      Y
Sbjct: 431 SGAIKYKSLCFDTYGRGAGSH--IGLYACHMTGGNQAWTY-LNGRLRHGSWCLAPPTPAY 487

Query: 266 AGGDVILYPCHGS 278
            G  VI  PC  S
Sbjct: 488 VGAQVITLPCSSS 500


>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
           paniscus]
          Length = 633

 Score =  150 bits (378), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQSLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KGC 574


>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
           quinquefasciatus]
 gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
           quinquefasciatus]
          Length = 576

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 104/286 (36%), Positives = 142/286 (49%), Gaps = 42/286 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 234 CECNVDWLEPLLVRVQEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLV 286

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER +R K+   P+ TP +AGGLF IDKA+FEKLG YD+  DIWGGENLE+S
Sbjct: 287 FKWEYLSNAERHERQKDPTTPIRTPMIAGGLFVIDKAYFEKLGKYDTQMDIWGGENLEIS 346

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG 171
           F+      +   IP        RKRH     P   P   +G +F+ +     ++   D  
Sbjct: 347 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGGSGNIFAKNTRRAAEVWMDD-- 399

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--------------SNDWS 217
           +  +    + L+    FG++  R +L+  L CK+FKWYL+               S    
Sbjct: 400 YKQYYYAAVPLAKNIPFGNIDERLQLKEQLECKNFKWYLDNVYPQLTIPEQQTKGSLRQG 459

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
             CID+     D    VGLY CH  GGNQ W ++K G+I+  + CL
Sbjct: 460 PYCIDTLGHLVD--GIVGLYHCHNSGGNQDWAITKSGQIKHLDLCL 503


>gi|432112638|gb|ELK35354.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Myotis davidii]
          Length = 416

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +  + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 66  CECFHGWLEPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSR-----GNFDWS 120

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 121 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 180

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 181 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---SYK 234

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     E  +++ +  FGD++ R +LR  L C++F W+L                   
Sbjct: 235 EIFYRRNMEAAKMAQEKTFGDISERLQLREQLHCRNFSWFLHNIYPELFIPDLKPTFYGA 294

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N     C+D   K     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 295 IKNLGINQCLDVGEK-NHGGKPLIMYACHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAG 353

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 354 TLGLRSCHFTGKNSQVPKD 372


>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Nomascus leucogenys]
          Length = 633

 Score =  150 bits (378), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
           anubis]
          Length = 633

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
 gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
           norvegicus]
 gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
          Length = 633

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 100/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI + +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E+++ L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L
Sbjct: 513 GQPLCLDVG-ENNQGDKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Pongo abelii]
 gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           2 [Pongo abelii]
 gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           3 [Pongo abelii]
          Length = 633

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Macaca mulatta]
 gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           2 [Macaca mulatta]
 gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
 gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
          Length = 633

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
          Length = 633

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 105/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E +R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEEQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG  I S
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511

Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
           A +P  +         KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 AGQPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
           gorilla gorilla]
          Length = 622

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 154/319 (48%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N +   D
Sbjct: 560 ALGLGSCHFTGKNSHVPKD 578


>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
 gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
          Length = 670

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/320 (32%), Positives = 150/320 (46%), Gaps = 53/320 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PLL  ++ + + VV+P+I  I DDTF+         ++ +   GGF W + 
Sbjct: 227 VEVTDGWLEPLLSRISEDRTRVVAPVIDVISDDTFQY-------VTAAESTWGGFSWTMN 279

Query: 64  FNWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+    RE+KR  KN   P+ TPT+AGGLFSID+ +F  +G YD G  IWGGENLE+S
Sbjct: 280 FRWYQASAREQKRRGKNKTTPIRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEIS 339

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG  ++        G      ++W  E 
Sbjct: 340 FRV-WMCGGTLEINPCSHVGHVFRKQTPYTFEGGTSNV------IYGNARRTAEVWMDEY 392

Query: 180 LELSFK-------GDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
            E  +K          G+++ R  LR+ LGCKSFKWYL+                 + N+
Sbjct: 393 KEFYYKMTPSAMFAPLGNISDRIALRKRLGCKSFKWYLKNIYPESNIPPTYYSIGYIKNE 452

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ-----FWMMSKHGEIRRDEACLDY---AG 267
            + +C+D+  +         L  CH  GGNQ      W  +    IR DE CL     A 
Sbjct: 453 KNDLCLDTMGRKASGSP--ALLTCHNSGGNQVLFMKVWSYTGTLNIRADELCLQASRKAD 510

Query: 268 GDVILYPCHGSKGNQYFEYD 287
             + L  C+  + +Q ++YD
Sbjct: 511 SPIFLQQCNNDE-SQIWDYD 529


>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
 gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
          Length = 556

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 153/313 (48%), Gaps = 51/313 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL+ + ++ + VV P I +I  + F  ++ P  +        G F+W+L 
Sbjct: 206 CEVTIGWLEPLLNRIHQDRTTVVCPEIDSIDLNNFAYKYGPSGV------LRGTFNWDLS 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W   P  ER R  +A +P+ +PTMAGGLF+ID+ +F +LGTYD G +IWG EN+ELSF
Sbjct: 260 FKWSIAPTSERLRRTSATDPMRSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP           +P  T   +  L SI    ++++       ++W  +
Sbjct: 320 KVWQCGGKLEIIPCSHVGHVFREVQPYDT---SVSLHSIANKNYQRVA------EVWMDD 370

Query: 179 NLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
             +  ++         FGD++   +LR+ L C+SF+WYL+                  V 
Sbjct: 371 YKKFFYQRHPYLTDQSFGDISENLKLRQRLKCRSFRWYLQNVFTDVILPNETAIATGKVR 430

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA----GGD 269
           N  S MC+D+  + ++    +GL PC+ Q     +  +   EI  ++ACLD +    G  
Sbjct: 431 NPISNMCLDTFGRTSNTF--LGLSPCNIQRDTMLFAYTSRKEISWNDACLDASFIMPGFK 488

Query: 270 VILYPCHGSKGNQ 282
           + +  CH   GNQ
Sbjct: 489 IQMAECHRIGGNQ 501


>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
 gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=Polypeptide GalNAc transferase 3;
           Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 3;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3
 gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
 gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
           sapiens]
 gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
           sapiens]
 gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
           CRA_b [Homo sapiens]
 gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
           [synthetic construct]
          Length = 633

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
           mulatta]
          Length = 641

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C SF WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Callithrix jacchus]
          Length = 633

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/303 (33%), Positives = 153/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    +   +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSHHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSG----- 218
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG     
Sbjct: 453 RRNTDAAKIVKQKTFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 219 ---MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GHPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 149/307 (48%), Gaps = 50/307 (16%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  +A + S V  P+I+ I    F         ++S +  IGGFDW L F WH+I
Sbjct: 254 WLEPLLQRIAEDDSVVAVPIISTIAWQDFGFHHS----SNSIEPQIGGFDWQLTFQWHSI 309

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
           P+  + + K   +PV TPTMAGGLF++ + +F  +G+YD+G ++WGGENLE+SF+  W  
Sbjct: 310 PDEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRV-WMC 368

Query: 130 IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWGGE-------NL 180
                       +  +   ++ G +F     +  K  T ++    ++W  +         
Sbjct: 369 ----------GGSLEIIPCSIVGHVFPKTAPYERKSFTPNTVRAVEVWLDDYKRHFYARN 418

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM------------CI 221
            LS    +GD++ R  LR  L CKSF+WYLE       V  D  G             C+
Sbjct: 419 PLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDLPVPEDTPGQFGALHNKGSPSRCL 478

Query: 222 DSACKPTDM-HKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACL---DYAGGDVILYP 274
           D      D+ H  VG + CH QGGNQF+  +  G +R   + E C+   D   G++    
Sbjct: 479 DYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRYTSQFELCIAKKDDNSGEIAAVM 538

Query: 275 CHGSKGN 281
           C+G   N
Sbjct: 539 CNGKNVN 545


>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Cricetulus griseus]
          Length = 633

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/302 (33%), Positives = 149/302 (49%), Gaps = 37/302 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI + +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISREYFEHIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS---------NDWSGMCIDSA 224
               +  ++  +  FGD++ R E+++ L CK+F WYL            N      I S 
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTVYPEVYVPDLNPVISGYIKSV 512

Query: 225 CKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILY 273
            +P  +         KP+ LY CH  GGNQ++  S   EIR +   E CL    G V L 
Sbjct: 513 GQPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQREIRHNIQKELCLHATQGVVQLK 572

Query: 274 PC 275
            C
Sbjct: 573 AC 574


>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
 gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
           fascicularis]
 gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
          Length = 622

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C SF WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           (GalNAc-T3) [Homo sapiens]
          Length = 633

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Papio anubis]
 gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Papio anubis]
          Length = 622

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C SF WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 573

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/318 (33%), Positives = 150/318 (47%), Gaps = 53/318 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+ LL  ++ N   +V P+I  I  DTFE      R         G FDW   
Sbjct: 223 CEVTIGWLETLLKHISENQKRIVCPIIDVISHDTFEYLLGSDRTW-------GTFDWQFN 275

Query: 64  FNWHAIPERERKRHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F+W  + +RE  R  +    P+ TPTMAGGLF+I + +F ++G YD   +IWGGEN+ELS
Sbjct: 276 FHWETVVDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELS 335

Query: 123 FKFNWHA-----IPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEK-----LGTYDS 170
           F+  W       I    R  H  + ++   W     GG+  I    F +     L  Y  
Sbjct: 336 FRV-WQCGGELLIDPCSRVGHVFRKSSPYTW----PGGVSHILHKNFVRTALVWLDQYSR 390

Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDW------- 216
            + +     L +    D+GDVT RK+LR+ L CKSF+WYLE       +  D        
Sbjct: 391 FYFMLNPSALSV----DYGDVTKRKKLRQQLNCKSFRWYLEHIYPESSIPIDVIRLGEIR 446

Query: 217 --SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD------YAGG 268
             SG C+DS      + + VG+  CH QGGNQ + +++ G IR    C+D         G
Sbjct: 447 HKSGQCLDSLGHK--LGETVGVTHCHGQGGNQVFAITESGTIRVHAGCMDGGSSKSVGTG 504

Query: 269 DVILYPCHGSKGNQYFEY 286
            ++   C     +Q FE+
Sbjct: 505 ILVFKKCEKDSISQKFEF 522


>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
 gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
          Length = 454

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/306 (33%), Positives = 135/306 (44%), Gaps = 39/306 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL PLL+ +A N    V P I  I   TF+ +           +  G F+W   
Sbjct: 155 CECNKGWLPPLLERIALNRRTAVCPTIDFIDHKTFQYK-------PMDPYIRGTFNWRFD 207

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +   A+   E  + ++  + V +P MAGGLF+I++ FF +LG YD G  IWGGE  E+SF
Sbjct: 208 YKERAVRPEEMAKRRDPTQEVKSPVMAGGLFAINREFFSELGQYDPGMFIWGGEQYEISF 267

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P     L +  +     +  Y      W  +
Sbjct: 268 KLWQCGGQLENIPCSRVGHVYRHHVPYTYPKHDATLVNFRRVAEVWMDEYKD----WLYD 323

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------VSNDWSGMCID 222
                   D+GD++ R  LR+ L CKSFKWYLE                V N    MC+D
Sbjct: 324 KRPEIKSVDYGDISDRIALRKRLKCKSFKWYLENVANDTVKTKLCACFQVRNQGKNMCLD 383

Query: 223 SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD----YAGGDVILYPCHGS 278
           S  +  D H  VGL  CH  GGNQ +  +   E+R DE C D    + G  V  +PCH  
Sbjct: 384 SMGR-KDGH--VGLASCHNMGGNQAFQYTYIRELRTDETCFDVHESFPGAKVHFFPCHEM 440

Query: 279 KGNQYF 284
           KGNQ F
Sbjct: 441 KGNQEF 446


>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
           catus]
          Length = 622

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 153/316 (48%), Gaps = 41/316 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE     P GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEALPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
           SF+          IP            P    T   G+  I +        + DS  +I+
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSP---HTFPKGISVIARNQVRLAEVWMDSYKEIF 443

Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
              NL+   ++ +  FGD++ R +L+  L C++F W+L                   + N
Sbjct: 444 YRRNLQAAKMAQEKSFGDISERLQLKERLHCRNFSWFLHNIYPEMFVPDLKPTFYGAIRN 503

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
                C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G + 
Sbjct: 504 LGVDQCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562

Query: 272 LYPCHGSKGNQYFEYD 287
           L  CH +  N     D
Sbjct: 563 LRSCHFTGQNSQVPKD 578


>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Otolemur garnettii]
          Length = 633

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/303 (34%), Positives = 154/303 (50%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P     +  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGGNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P++E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDQEKQRRKDETYPIKTPTFAGGLFSISKKYFEYIGSYDDEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
               +  ++  +  FGD++ R E++  L CK+F WYL          +++   SG  I S
Sbjct: 453 RRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGY-IKS 511

Query: 224 ACKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
             KP  +         KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 512 IGKPLCLDVGENNQGGKPLIMYTCHGLGGNQYFEYSSLREIRHNIQKELCLHAAKGPVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
           troglodytes]
 gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
           troglodytes]
          Length = 633

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 151/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KGC 574


>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Takifugu rubripes]
          Length = 618

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 99/287 (34%), Positives = 133/287 (46%), Gaps = 31/287 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +    + VVSP I  I  ++F+   P     SS+ F  G FDW+L 
Sbjct: 266 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLESFQFNKPA---PSSHAFNRGNFDWSLT 322

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IPE  RK  K+   PV TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 323 FGWEQIPEAARKLRKDETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSF 382

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
           +          IP            P   P     +        E  +  Y   F     
Sbjct: 383 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKKIFYRRNK 442

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
              +++ + ++GD++ R  LR  L CK+F WYL                   + N  S  
Sbjct: 443 NAAKMAKENNYGDISERLNLRERLHCKNFSWYLNTVYPEAFVPDLTPDRFGAIKNQGSKT 502

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACL 263
           C+D   +     KPV +Y CH  GGNQ++  S H E+R +   E CL
Sbjct: 503 CLDVG-ENNLGGKPVMMYTCHNMGGNQYFEYSSHKELRHNIGKELCL 548


>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Nomascus leucogenys]
          Length = 622

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLRSCHFTGKNSQVPKD 578


>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
          Length = 622

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERPQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|357629476|gb|EHJ78219.1| hypothetical protein KGM_03405 [Danaus plexippus]
          Length = 353

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/313 (32%), Positives = 145/313 (46%), Gaps = 59/313 (18%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL+PLL  +  + + VV P+I  I  DTF+       L        GGFDWNL F
Sbjct: 24  ECNVHWLEPLLQRIKEDPTRVVCPVIDVISMDTFQYIGASADLR-------GGFDWNLVF 76

Query: 65  NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W  + + ER  R  +  + + TP +AGGLFS+D+ +F KLG YD   D+WGGENLE+SF
Sbjct: 77  KWEYLSQAERGARLSDPTQVIRTPMIAGGLFSMDRKYFSKLGKYDMKMDVWGGENLEISF 136

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P  +G +F+ +              +
Sbjct: 137 RVWQCGGSLEIVPCSRVGHVFRKRH-----PYSFPGGSGAVFARNTR---------RAAE 182

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------V 212
           +W  +  EL ++        DFGD++ R  +R+ L CK F+WYLE              +
Sbjct: 183 VWMDDYKELYYRSQPLAKQVDFGDISERVSIRQRLHCKPFRWYLEHVYPELRVPTFGNSI 242

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
           +      C+D+     D    V +YPCH  GGNQ W    +G IR    CL  +  D   
Sbjct: 243 AIKQGPRCLDTMGHQVD--GTVAMYPCHNTGGNQEWSFD-NGLIRHQSLCLGLSQEDSVT 299

Query: 270 VILYPCHGSKGNQ 282
           V+L  C  S  NQ
Sbjct: 300 VVLAVCDPSDHNQ 312


>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
 gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
          Length = 595

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 151/331 (45%), Gaps = 64/331 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K+WL+PLL  +A + + VV P+I  I  DTFE    P           GGF+W L 
Sbjct: 237 CEVSKQWLEPLLARIAEDRTRVVCPIIDIINSDTFEYTASP--------LVRGGFNWGLH 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P++  +    AA P+ +PTMAGGLF+ID+ +F++LG YD G DIWGGENLE+SF
Sbjct: 289 FKWDQVPQQLLQGPDGAAAPINSPTMAGGLFAIDREYFDELGRYDEGMDIWGGENLEISF 348

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM+     +   +       D   D
Sbjct: 349 RIWMCGGTLEIIPCSRVGHVFRKR-RPYGSPNGEDTMSKNSLRMAHVWM------DEYKD 401

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     E+  +  +GD++ R +LR  L C SFKWYL+                      
Sbjct: 402 QYFSLRPEMKTR-TYGDISDRLKLREKLNCHSFKWYLDNIYPELFVPGGDKLKQVGVGQL 460

Query: 212 -----------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RD 259
                      + +  SG+C+ S   P +    V +  C  +  NQ W ++   E++   
Sbjct: 461 PPRPKVIKKGHIKHLDSGLCLISQNGPNEKGSLVVVSECLSEDKNQVWYLTDQDELQLTG 520

Query: 260 EACLDYAGGDVILYP----CHGSKGNQYFEY 286
             CLD    D   +P    CHG+ G Q +++
Sbjct: 521 LLCLDVNENDPKSFPRIMKCHGTSGGQQWKF 551


>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
           boliviensis boliviensis]
          Length = 633

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 149/303 (49%), Gaps = 39/303 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    +   +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSHHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
               +  ++  +  FGD++ R E++  L CK+F WYL                   + + 
Sbjct: 453 RRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSV 512

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  S   EIR +   E CL  A G V L
Sbjct: 513 GQPLCLDVG-ENNQGGKPLIMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQL 571

Query: 273 YPC 275
             C
Sbjct: 572 KAC 574


>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Cavia porcellus]
          Length = 633

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/307 (34%), Positives = 152/307 (49%), Gaps = 38/307 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP IA+I  +TFE   P    T+  +   G FDW+L 
Sbjct: 280 CECFYGWLEPLLARIADNYTAVVSPDIASIDLNTFEFNKPSPYGTNHNR---GNFDWSLS 336

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF
Sbjct: 337 FGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSF 396

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +A     + + + ++   Y   F 
Sbjct: 397 RV-WQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFY 452

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS---------NDWSGMCIDSA 224
               E  ++  +  FGD++ R  +R+ L CK+F WYL            N      I S 
Sbjct: 453 RRNTEAAKIVKQKTFGDLSKRFAIRKRLQCKNFTWYLNTVYPEVYVPDLNPVISGYIKSV 512

Query: 225 CKPTDMH--------KPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILY 273
            +P  +         KP+ LY CH  GGNQ++  S   EIR   + E CL +A  D++  
Sbjct: 513 GQPLCLDVGENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHSIQKELCL-HATSDLLQL 571

Query: 274 PCHGSKG 280
                KG
Sbjct: 572 KACAYKG 578


>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
          Length = 622

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLMPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
           caballus]
          Length = 622

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/316 (31%), Positives = 153/316 (48%), Gaps = 41/316 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LSFGWEALPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIW 175
           SF+          IP            P    T   G+  I +        +  G+ +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGISVIARNQVRLAEVWMDGYKEIF 443

Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSN 214
              N++   ++ +  FGD++ R +LR  L C +F W+L+                  + N
Sbjct: 444 YRRNMQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLQNIYPEMFVPDLKPTFYGAIKN 503

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGDVI 271
                C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G + 
Sbjct: 504 LGIDHCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASAGTLG 562

Query: 272 LYPCHGSKGNQYFEYD 287
           L  CH +  N     D
Sbjct: 563 LRSCHFTGKNSQVPKD 578


>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Pan paniscus]
 gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pan paniscus]
          Length = 622

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
 gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
 gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
          Length = 622

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pongo abelii]
 gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           3 [Pongo abelii]
          Length = 622

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
          Length = 614

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/296 (35%), Positives = 147/296 (49%), Gaps = 42/296 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 264 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSR-----GNFDWS 318

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++RHK+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 319 LTFGWETLPPHEKQRHKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 378

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
           SF+          IP            P    T   G+  I +        + DS   I+
Sbjct: 379 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDSYKQIF 435

Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT--DM 230
              NL+   ++ +  FGD++ R +LR  L C++F W+L   N +  M +    KPT    
Sbjct: 436 YRRNLQAAKMAQEKSFGDISERLKLRELLHCRNFSWFLH--NVYPEMFVPD-LKPTFYGA 492

Query: 231 HKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
            K +G+  C   G N                  ++ G  +I+Y CHG  GNQYFEY
Sbjct: 493 IKNLGINQCLDVGEN------------------NHGGKPLIMYACHGLGGNQYFEY 530


>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
          Length = 622

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMTQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKRLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|149032012|gb|EDL86924.1| rCG50623 [Rattus norvegicus]
          Length = 431

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/306 (31%), Positives = 148/306 (48%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L 
Sbjct: 81  CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSR---GNFDWSLT 137

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF
Sbjct: 138 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 197

Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 198 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 251

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +++ + +FGDV+ R  LR  L C +F WYL                   + 
Sbjct: 252 FYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPEMFVPDLNPTFSGAIK 311

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  +
Sbjct: 312 NLGTSQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 370

Query: 271 ILYPCH 276
            L  C 
Sbjct: 371 GLRNCQ 376


>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Canis lupus familiaris]
          Length = 622

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 146/313 (46%), Gaps = 35/313 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W AIP  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEAIPAHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIW 175
           SF+          IP            P   P     +        E  +  Y   F   
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGVSVIARNQVRLAEVWMDNYKEIFYRR 446

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
             +  +++ +  FGD++ R +LR  L C +F W+L                   + N   
Sbjct: 447 NMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLHNIYPEMFVPDLKPTLYGAIRNLGI 506

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYP 274
             C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  + G + L  
Sbjct: 507 NQCLDVG-ENNHGGKPLIMYTCHGLGGNQYFEYTTQRDLRHNISKQLCLHASAGTLGLRS 565

Query: 275 CHGSKGNQYFEYD 287
           CH +  N     D
Sbjct: 566 CHFTGKNSQVPKD 578


>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Oreochromis niloticus]
          Length = 559

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 215 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 267

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   E+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 268 FKWDYMTQEQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 327

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 328 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 378

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR E+++ L CK FKWYLE                 + 
Sbjct: 379 EYKNFYYAAVPSARNVPYGNIQSRLEMKKRLNCKPFKWYLENVYPELRVPDHQDIAFGAL 438

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
              G C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      AG  +
Sbjct: 439 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 496

Query: 271 ILYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 497 KLQGCRENDSRQKWE 511


>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
 gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
          Length = 560

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 144/315 (45%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 216 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 268

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   E+ R R  N   P+ TP +AGGLF +DK +FE LG YD   D+WGGENLE+S
Sbjct: 269 FKWDYMTVEQRRVRQGNPTAPIKTPMIAGGLFVMDKDYFELLGKYDMMMDVWGGENLEIS 328

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 329 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 379

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR E+++ LGC+ FKWYLE                 + 
Sbjct: 380 EFKNFYYAAVPSARNVPYGNIQSRMEMKKRLGCQPFKWYLENVYPELRVPDHQDIAFGAL 439

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
              G C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      AG  +
Sbjct: 440 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSQI 497

Query: 271 ILYPCHGSKGNQYFE 285
            +  C  +   Q +E
Sbjct: 498 KMQGCRENDSRQKWE 512


>gi|157820305|ref|NP_001099666.1| polypeptide N-acetylgalactosaminyltransferase 2 [Rattus norvegicus]
 gi|149043195|gb|EDL96727.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (predicted), isoform
           CRA_b [Rattus norvegicus]
          Length = 473

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/303 (33%), Positives = 140/303 (46%), Gaps = 58/303 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  +RWL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 160 CECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 212

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 213 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 272

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
           F+                    VW     G L  I  +    +      +   GG     
Sbjct: 273 FR--------------------VW--QCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTV- 309

Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------------SNDWSGMCIDSACK 226
                F  + SR ELR+ LGCK FKWYL+                 +      C+D+   
Sbjct: 310 -----FARIQSRLELRKKLGCKPFKWYLDNVYPELRVPDHQDIAFGALQQGTNCLDTLGH 364

Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGNQ 282
             D    VG+Y CH  GGNQ W ++K   ++  + CL   D + G +I L  C  +   Q
Sbjct: 365 FAD--GVVGIYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRSPGSLIRLQGCRENDSRQ 422

Query: 283 YFE 285
            +E
Sbjct: 423 KWE 425


>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Oryzias latipes]
          Length = 556

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 212 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 264

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   E+ R R  N   P+ TP +AGGLF +DK +FE LG YD   D+WGGENLE+S
Sbjct: 265 FKWDYMTLEQRRARQGNPIAPIKTPMIAGGLFVMDKEYFELLGKYDMMMDVWGGENLEIS 324

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 325 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 375

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR E+++ LGCK FKWYL+                 + 
Sbjct: 376 EYKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLDNVYPELRVPDHQDIAFGAL 435

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
              G C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      AG  +
Sbjct: 436 QQGGNCLDTLGHFAD--GVVGIYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 493

Query: 271 ILYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 494 KLQGCRENDSRQKWE 508


>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
 gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 6 [Homo sapiens]
          Length = 622

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 153/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 ALGLGSCHFTGKNSQVPKD 578


>gi|148672125|gb|EDL04072.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
          Length = 436

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 147/306 (48%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L 
Sbjct: 86  CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 142

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF
Sbjct: 143 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 202

Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 203 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 256

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  ++  + +FGD++ R  LR  L C +F WYL                   + 
Sbjct: 257 FYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIK 316

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  +
Sbjct: 317 NLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 375

Query: 271 ILYPCH 276
            L  C 
Sbjct: 376 GLRSCQ 381


>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
          Length = 559

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/310 (32%), Positives = 148/310 (47%), Gaps = 41/310 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 215 CECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 267

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   E+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 268 FKWDYMTLEQRRARQGNPIAPIKTPMIAGGLFVMDKDYFEELGKYDMMMDVWGGENLEIS 327

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +     ++   D  F  +  
Sbjct: 328 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDD--FKNFYY 385

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM------------------ 219
             +  +    +G++ SR E+++ LGCK FKWYLE  N +  +                  
Sbjct: 386 AAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLE--NVYPELRVPDHQDIAFGALQQGQN 443

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDVILYPC 275
           C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      AG  + L  C
Sbjct: 444 CLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSQIKLQGC 501

Query: 276 HGSKGNQYFE 285
             +   Q +E
Sbjct: 502 RENDTRQKWE 511


>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
 gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
          Length = 545

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/294 (36%), Positives = 140/294 (47%), Gaps = 38/294 (12%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           W++PLL  +  N S+VV P+I  I D TFE     G + SS     GGF W L F+W  I
Sbjct: 202 WVEPLLHRIWENRSNVVMPIIEAIDDKTFEYH---GGVQSSRYAQRGGFSWELHFDWRVI 258

Query: 70  PERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF--- 125
           PE E KR K +   P+ +PTMAGGLFSIDK++F +LGTYD   D WGGENLELSFK    
Sbjct: 259 PEYEIKRWKGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMC 318

Query: 126 --NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
                  P  +      ++ P   P+             E     DS  D++   N  + 
Sbjct: 319 GGTLEQPPCSKVGHVFRSSAPYSNPSGPKTFIRNTLRVVEVW--LDSYKDLFYALNPHMQ 376

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSAC 225
            +  +GDV+ RK +R  L CKSF W+L                  E+ N     C+D+  
Sbjct: 377 GE-PYGDVSERKRIRERLQCKSFDWFLENIFPELPIPDKNVQGRGELKNLGGNKCMDTMG 435

Query: 226 KPTDMHKP-VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD---VILYPC 275
           +    H P  GLY CH  GGNQ +  +    I   E CL  +      + LYPC
Sbjct: 436 E----HAPYTGLYSCHGMGGNQVFSYTWKNVISYQERCLAVSRNKPDRISLYPC 485


>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
 gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
          Length = 455

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 110/313 (35%), Positives = 152/313 (48%), Gaps = 49/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV +RWL+PLL  +A+N + VVSP+I  I  DTF           S     GGF WNL 
Sbjct: 157 CEVNERWLEPLLSRVAQNETIVVSPIIDVIHMDTFNY-------IGSSADLKGGFGWNLN 209

Query: 64  FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W ++  E + +R  +   P+ TP +AGGLFSI K +F K G YD G D+WGGENLE+S
Sbjct: 210 FKWDSMTSEEQSQRAAHPTRPIKTPMIAGGLFSISKNWFIKSGKYDMGMDVWGGENLEIS 269

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
            +      +   +P        RKRH     P   P   GG F   K        +  G+
Sbjct: 270 LRIWMCGGSLEIVPCSRVGHVFRKRH-----PYTFP--GGGGFVFAKNTRRAAEAWMDGY 322

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSND------W 216
             +  +    +    +GD++ R +LR  L C+SFKWY+          E  ND       
Sbjct: 323 AKFYYKREPGARGVPYGDISDRLKLREKLKCRSFKWYMRNVYPELNVPEGVNDKFGELRQ 382

Query: 217 SGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD--EACLDY-AGGDVIL 272
            G C+DS   KP D    V  +PCH  GGNQ W M+K  +IR +  + CL   + G+++ 
Sbjct: 383 GGKCLDSIGGKPGDR---VSTFPCHGGGGNQAWDMTKD-KIRNNFIQRCLTISSSGEIVA 438

Query: 273 YPCHGSKGNQYFE 285
            PC      Q ++
Sbjct: 439 DPCEDDNEKQIWQ 451


>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Saimiri boliviensis boliviensis]
 gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Saimiri boliviensis boliviensis]
          Length = 622

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 101/319 (31%), Positives = 154/319 (48%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLKLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 501 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASKG 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L+ CH +  N     D
Sbjct: 560 ALGLWNCHFTGKNSQVPKD 578


>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Oreochromis niloticus]
          Length = 598

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 146/316 (46%), Gaps = 48/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + S VVSP+I  I  DTF        L        GGFDW+L 
Sbjct: 248 CEVNKDWLPPLLQRIKEDPSRVVSPVIDIINMDTFAYVAASADLR-------GGFDWSLH 300

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   +R R  +  +P+ TP +AGGLF ID+A+F  LG YD+  DIWGGEN E+SF
Sbjct: 301 FKWEQLSPEQRARRTDPTQPIKTPIIAGGLFVIDRAWFNHLGKYDTAMDIWGGENFEISF 360

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G   +  K        +   F 
Sbjct: 361 RVWQCGGSLEILPCSRVGHVFRKKH-----PYVFP--EGNANTYIKNTRRTAEVWMDDFR 413

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
           ++       +    +GD+ SR ELR+ L CKSFKWYL+           S+  SG+    
Sbjct: 414 LFYYSARPAARGKSYGDIRSRVELRKKLNCKSFKWYLDNVYPELKVPDDSDSQSGVIKQR 473

Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
             C++S          + L PC    G    NQ W+ +   +IR+ + CL     +    
Sbjct: 474 QNCLESRKVEGQEMPVLTLAPCTGTEGVPAINQEWVYTHGQQIRQQQHCLSVSTTFPASQ 533

Query: 270 VILYPCHGSKGNQYFE 285
           V+L PC+ + G Q ++
Sbjct: 534 VLLLPCNMADGKQRWQ 549


>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
          Length = 622

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/306 (31%), Positives = 148/306 (48%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L 
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSR---GNFDWSLT 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 389 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 442

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  +++ + +FGDV+ R  LR  L C +F WYL                   + 
Sbjct: 443 FYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPEMFVPDLNPTFSGAIK 502

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  +
Sbjct: 503 NLGTSQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 561

Query: 271 ILYPCH 276
            L  C 
Sbjct: 562 GLRNCQ 567


>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
 gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
 gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
 gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
          Length = 622

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 147/306 (48%), Gaps = 43/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L 
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R +  H     P  T  +A     + + + +    Y   
Sbjct: 389 RVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKI 442

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
           F     +  ++  + +FGD++ R  LR  L C +F WYL                   + 
Sbjct: 443 FYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIK 502

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDV 270
           N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  +
Sbjct: 503 NLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTL 561

Query: 271 ILYPCH 276
            L  C 
Sbjct: 562 GLRSCQ 567


>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
          Length = 658

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 112/320 (35%), Positives = 148/320 (46%), Gaps = 55/320 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P++ ++  +   VV+P+I +I   T E       + +     +G FDW + 
Sbjct: 313 CEANLNWLEPIMALITEDRRTVVTPVIDSIDHHTMEYSKATQDVPA-----VGTFDWTMD 367

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNW A     R+   +A +PV +PTMAGGLF+++K +F +LG+YD   D WGGENLE+SF
Sbjct: 368 FNWKA---GVRRAGADATDPVDSPTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMSF 424

Query: 124 KFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--------LGTYD 169
           +  W         P          + P   P   GG  SI   F           +  Y 
Sbjct: 425 RI-WQCGGRLVTAPCSHVGHIFRDSHPYTVP---GG--SIHDTFLRNSMRVAEVWMDHYK 478

Query: 170 SGF-DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDWSGMCIDSACKP 227
             F D   G+N+      D GDV+ RKELR+ L C  FKWYL  V  D      +     
Sbjct: 479 QYFLDTRPGQNI-----IDAGDVSERKELRQRLQCHDFKWYLNTVLPDLFIPDANHIQHQ 533

Query: 228 TDMHKP---------------VGLYPCHKQGGNQFWMMSKHGEIR-RDEACLDYAGGD-- 269
             +H P                G+YPCH QG NQ WM S   EIR  D  CLD  G    
Sbjct: 534 GTLHTPDNICVDKMGQRNGGVAGVYPCHGQGTNQAWMYSITNEIRTHDSLCLDAWGSTLP 593

Query: 270 --VILYPCHGSKGNQYFEYD 287
             V L  CHG +GNQ + YD
Sbjct: 594 SPVHLGRCHGMRGNQEWRYD 613


>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Callithrix jacchus]
          Length = 622

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/324 (31%), Positives = 153/324 (47%), Gaps = 57/324 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP--PGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------------- 167
           SF+  W    + E          +   ++ G +F          GT              
Sbjct: 387 SFRV-WQCGGQLE----------IIPCSVVGHVFRTKSPHTFPKGTNVIARNQVRLAEVW 435

Query: 168 YDSGFDIWGGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------- 211
            DS   I+   NL+   ++ +  FGD++ R +LR  L C +F WYL              
Sbjct: 436 MDSFKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTP 495

Query: 212 -----VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL 263
                + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL
Sbjct: 496 TFYGAIKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCL 554

Query: 264 DYAGGDVILYPCHGSKGNQYFEYD 287
             + G + L  CH +  N     D
Sbjct: 555 HASNGALGLRNCHFTGKNSQVPKD 578


>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
           aries]
          Length = 674

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 100/300 (33%), Positives = 141/300 (47%), Gaps = 48/300 (16%)

Query: 4   CEVQKRWLQPLLDVLARNS--SHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWN 61
           CE  +RWL+PLL+ +A  S  + VVSP+I  I  D F+          +     GGFDWN
Sbjct: 328 CECNERWLEPLLERVAEGSDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWN 380

Query: 62  LQFNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
           L F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE
Sbjct: 381 LVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLE 440

Query: 121 LSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           +SF+      +   +P            P   P  +G +F+ +              ++W
Sbjct: 441 ISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVW 491

Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWS 217
             E     +          +G++ SR ELR+ L CK FKWYLE               + 
Sbjct: 492 MDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENVYPELRVPDHQDIAFG 551

Query: 218 GMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI 271
            +   + C  T  H     VG+Y CH  GGNQ W ++K   ++  + CL   D A G +I
Sbjct: 552 ALQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRAPGSLI 611


>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Saccoglossus kowalevskii]
          Length = 633

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 147/328 (44%), Gaps = 65/328 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV  +WL+PLL+ +  +   VV P+I  I  DTFE +  P           GGF+W L 
Sbjct: 274 CEVSTQWLEPLLERIKFDPHTVVCPIIDIINADTFEYQQSP--------LVRGGFNWGLH 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  + K  ++  +PV +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 326 FKWDTIPSSQFKGKEDYIKPVRSPTMAGGLFAMDRKYFHELGEYDDGMDIWGGENLEISF 385

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM+     +   + ++   Y   + 
Sbjct: 386 RIWQCGGTLEIIPCSRVGHVFRKR-RPYGSPNGEDTMSKNSLRVAHVWMDE---YKEHYF 441

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
               +N       D+GD++SR  LR  L C+SFKWYLE                      
Sbjct: 442 ELKKDNR----NKDYGDISSRLALRERLQCQSFKWYLENVYPEIRLPNQKVSYPVDVERR 497

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
                       + +  +G+C+ S    T     V L+ C  +     W  S   E + +
Sbjct: 498 QPVKAEIIKRGQIVHLLTGLCLTSENDFTQKGTLVVLHDCSDKDKQMIWSQSTSHEFLLK 557

Query: 259 DEACLDYAGGDVILYP----CHGSKGNQ 282
           D  CLD    D   +P    CHGS G+Q
Sbjct: 558 DSLCLDTPETDSKAFPRLMKCHGSGGSQ 585


>gi|157118275|ref|XP_001653147.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108875773|gb|EAT39998.1| AAEL008252-PA [Aedes aegypti]
          Length = 648

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 151/316 (47%), Gaps = 48/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL+ +ARN + +  P +  I  DT  L     +L        G FDW   
Sbjct: 301 CEVG--WLEPLLNQVARNPTAIAIPSMDWIDGDTMTLDPQVSQL------IYGKFDWMGN 352

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W    +R + + K+  EP  +P M GGLF+I++  F  LG YD  F+ +G E+LELSF
Sbjct: 353 FQWGLRRDRRQPQAKHPMEPFDSPVMPGGLFAINRTLFAHLGWYDEQFETYGAEHLELSF 412

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT------MAGGLFSIDKAFFEKLGTYDSGF 172
           K      +   +P       +    P  T T      +   L  + + + ++   Y   +
Sbjct: 413 KTWMCGGSMQIVPCSRVAHVQKPNHPYITKTSGSEDVIKRNLVRMAEVWMDEYALY--YY 470

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
           + +GG +     +GDFGDV+SRK+LR++L CKSF+WYLE                   N 
Sbjct: 471 ETFGGPDK----RGDFGDVSSRKQLRQHLNCKSFRWYLENVFPEQFDPSRAVGRGEFRNG 526

Query: 216 WSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILY 273
            +G   C+D            G+  CH +G +Q W  ++ GEI R + CLDY G  + + 
Sbjct: 527 ENGTDRCLDWPLA----RNQCGVTSCHGRGRHQMWYFTREGEITRKDHCLDYDGKTLEMN 582

Query: 274 PCHGSKGNQYFEYDYK 289
            CH   GNQ +EY  K
Sbjct: 583 RCHQMGGNQLWEYAEK 598


>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
          Length = 622

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 97/307 (31%), Positives = 148/307 (48%), Gaps = 45/307 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L 
Sbjct: 272 CECFHGWLEPLLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLT 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF
Sbjct: 329 FGWEMLPEHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSF 388

Query: 124 KFNWHA------IP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
           +  W        IP        R +  H     P  T  +A     + + + +    Y  
Sbjct: 389 RV-WQCGGQLGIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKK 441

Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
            F     +  ++  + +FGD++ R  LR  L C +F WYL                   +
Sbjct: 442 IFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAI 501

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGD 269
            N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  
Sbjct: 502 KNLGTNQCLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGST 560

Query: 270 VILYPCH 276
           + L  C 
Sbjct: 561 LGLRSCQ 567


>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 582

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 103/310 (33%), Positives = 147/310 (47%), Gaps = 50/310 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N   VV+P+I  I  DTF+       L        GGF+WNL 
Sbjct: 236 CECNVNWLEPLLARVKENHRAVVAPVIDIIDKDTFKYVAASADLR-------GGFEWNLI 288

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  + R  RH     P+ TP +AGGLF I K +FEKLGTYD   D+WGGENLELS
Sbjct: 289 FKWEYLLGKLRDDRHAQPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEQMDVWGGENLELS 348

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P   G +F  +              ++W G
Sbjct: 349 FRVWLCGGSLEIIPCSRVGHVFRKQHPYTFPGGNGNVFQKNTR---------RAAEVWLG 399

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------VSN 214
           +   L  +        +FGD+T+R +L++ L CK F WYL+                ++ 
Sbjct: 400 DYKYLYLRKVPSARYVNFGDITARLDLKKRLRCKDFDWYLKEIYPELAIPSKEQGRYLTF 459

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIRRDEACL---DYAGGDV 270
               +CIDS  + T +   VG+Y CH  GGNQ W+++ K G ++   + L   D   G +
Sbjct: 460 RQGNVCIDSLGRHTALSS-VGIYRCHGTGGNQEWVLNDKFGVLKSPYSNLCITDDEKGTL 518

Query: 271 ILYPCHGSKG 280
           IL+ C+ ++G
Sbjct: 519 ILHYCNMTRG 528


>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Otolemur garnettii]
 gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Otolemur garnettii]
          Length = 622

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE     P GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFSKPIPRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPTHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 387 SFRVWQCGGQMEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C++F W+L                   
Sbjct: 441 MIFYRRNQQAAKMAQEKSFGDISERLQLRERLHCRNFSWFLNNVYPEMFVPDLMPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N     C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  +  
Sbjct: 501 IKNLGINQCLDVG-ENNHGEKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHASVD 559

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 560 TLGLRSCHFTGKNSQVPKD 578


>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 603

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 102/309 (33%), Positives = 143/309 (46%), Gaps = 46/309 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL+PLL+ +  + + VV P+I  I  DTF+       L        GGFDWNL 
Sbjct: 259 VECNVNWLEPLLERVVEDKTRVVCPIIDVISMDTFQYIGASADLR-------GGFDWNLV 311

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   +R +R ++    + TP +AGGLF ID+ +F+ LG YD   D+WGGENLE+S
Sbjct: 312 FKWEYLTLDQRLRRQQDPTRAIKTPMIAGGLFVIDRLYFDTLGKYDMQMDVWGGENLEIS 371

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P  +G +F+ +     ++   D   
Sbjct: 372 FRVWQCGGSLEIIPCSRVGHVFRKRH-----PYTFPGGSGNVFARNTRRAAEVWMDDYKK 426

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT---- 228
             +    L  S    FG++  R EL+R L CKSFKWYLE  N +  + I  +  P     
Sbjct: 427 YYYAAVPLAKSIP--FGNIDDRLELKRKLHCKSFKWYLE--NVYPELSIPHSTSPAFGSI 482

Query: 229 ------------DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVILY 273
                        + + VGLY CH  GGNQ W M     I+  + CL   +Y  G ++L 
Sbjct: 483 RQRQLCLDTLGHSIEQTVGLYVCHDTGGNQEWGMEDDSYIKHHDLCLTIPNYVPGALVLM 542

Query: 274 PCHGSKGNQ 282
                  NQ
Sbjct: 543 RLCEDADNQ 551


>gi|47217176|emb|CAG11012.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 598

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 142/312 (45%), Gaps = 51/312 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+          +     GGFDWNL 
Sbjct: 253 CECNAHWLEPLLERVAEDKTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLV 305

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   ++ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 306 FKWDYMTLDQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 365

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 366 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 416

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR EL++ +GCK FKWYLE                 + 
Sbjct: 417 EYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENVYPELRVPDHQDIAFGAL 476

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
              G C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      AG  +
Sbjct: 477 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTAGSLI 534

Query: 271 ILYPCHGSKGNQ 282
            L  C  +   Q
Sbjct: 535 KLQGCRENDSRQ 546


>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
 gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
          Length = 619

 Score =  146 bits (369), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 137/290 (47%), Gaps = 31/290 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +    + VVSP I  I  +TF+   P   + ++     G FDW+L 
Sbjct: 267 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLNTFQFHKP---VATARAHNRGNFDWSLT 323

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP+ E  + K+   PV TPT AGGLFSI KA+FEK+GTYD   +IWGGEN+E+SF
Sbjct: 324 FGWEGIPDYENAKRKDETYPVKTPTFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSF 383

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
           +          IP            P   P     +        E  +  Y   F     
Sbjct: 384 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKLIFYRRSQ 443

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
              +++ +  FGD++ R +LR +L CK+F WYL                   + N  +  
Sbjct: 444 SAAKMAKEKGFGDISDRLKLREDLQCKNFSWYLSNVYPEAFVPDLSPVKFGALKNRGAQQ 503

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
           C+D   +  +  KPV +Y CH  GGNQ++  + H E+R +   + CL  +
Sbjct: 504 CLDVG-ESNNGGKPVIMYTCHNMGGNQYFEYTSHKELRHNIGKQLCLQAS 552


>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Xenopus (Silurana) tropicalis]
          Length = 625

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 146/304 (48%), Gaps = 39/304 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A N + VVSP I  I  +TF+   P     +  +   G FDW L 
Sbjct: 272 CECYYGWLEPLLASIAENYTSVVSPDITGIDLNTFQFSNPSPYGNNHNR---GNFDWTLS 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P  E+ R K+   P+ TPT AGGLFSI KA+FE +G+YD   +IWGGEN+E+SF
Sbjct: 329 FGWESLPSSEKTRRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEQMEIWGGENIEMSF 388

Query: 124 KFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W    + E           R K+    P  T  +      + + + + L      F 
Sbjct: 389 RV-WQCGGQLEILPCSVVGHVFRSKSPHTFPKGTQVIVRNQVRLAEVWMDDLKEI---FY 444

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSND 215
               E   +    ++GD++ R +LR  L CK+F WYL                  ++ N 
Sbjct: 445 RRNREAANIVKSKEYGDLSKRLDLRHRLQCKNFTWYLNNIYPEMYVPERHPLIHGDLKNV 504

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVIL 272
              +C+D   +     KP+ +Y CH  GGNQ++  +   EIR +   E CL  +   +++
Sbjct: 505 GRDLCLDVGGE-NHGDKPLIMYSCHGLGGNQYFEYTSKHEIRHNIQKELCLRPSHSSLVI 563

Query: 273 YPCH 276
            PC+
Sbjct: 564 KPCN 567


>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Oryctolagus cuniculus]
          Length = 622

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 99/308 (32%), Positives = 148/308 (48%), Gaps = 47/308 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFTGWLEPLLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W A+P  E +R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWEAVPAHENRRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +    Y 
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTNVIARNQVRLAEVWMD---NYK 440

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F W+L                   
Sbjct: 441 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWFLHNVYPEMFVPDLNPTFYGA 500

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N   G C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  +  
Sbjct: 501 IKNLGLGQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQKDLRHNIAKQLCLHASAS 559

Query: 269 DVILYPCH 276
            + L  CH
Sbjct: 560 TLGLRGCH 567


>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Takifugu rubripes]
          Length = 558

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 214 CECNDHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 266

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +   ++ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 267 FKWDYMTLDQRRARQGNPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEIS 326

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 327 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 377

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR EL++ +GCK FKWYLE                 + 
Sbjct: 378 EYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENVYPELRVPDHQDIAFGAL 437

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGGDV 270
              G C+D+     D    VG+Y CH  GGNQ W ++K   ++  + CL      A   +
Sbjct: 438 QQGGNCLDTLGHFAD--GVVGVYECHNAGGNQEWALTKDKSVKHMDLCLTVVDRTASSLI 495

Query: 271 ILYPCHGSKGNQYFE 285
            L  C  +   Q +E
Sbjct: 496 KLQGCRENDSRQKWE 510


>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
          Length = 623

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 146/300 (48%), Gaps = 43/300 (14%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  +A + + VVSP I  I  +TF+   P  R  +  +   G FDW+L F W  +
Sbjct: 279 WLEPLLARIAEDKTPVVSPDIVTIDLNTFQFSRPVQRGKAHSR---GNFDWSLTFGWEML 335

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
           P+ E++R K+   P+ +PT AGGLFSI KA+FE +GTYD+  +IWGGEN+E+SF+     
Sbjct: 336 PQHEKQRRKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCG 395

Query: 126 -NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
                IP        R +  H     P  T  +A     + + + +    Y   F     
Sbjct: 396 GQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMDD---YKKIFYRRNL 449

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
           +  ++  + +FGD++ R +LR  L C +F WYL                   + N  +  
Sbjct: 450 QAAKMVQENNFGDISERLQLREQLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIKNLGTNQ 509

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPCH 276
           C+D   +     KP+ +Y CH  GGNQ++  +   ++R +   + CL  +G  + L  C 
Sbjct: 510 CLDVG-ENNRGGKPLIMYVCHNLGGNQYFEYTSQRDLRHNIGKQLCLHASGSTLSLRSCQ 568


>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Metaseiulus occidentalis]
          Length = 648

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/318 (32%), Positives = 148/318 (46%), Gaps = 58/318 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+   VV P+I  I D T +     G      +F IGGF+W  +
Sbjct: 290 CETTPGWLEPLLEPIRRDRRAVVCPVIDIIDDKTLQYVAAEGD-----RFQIGGFNWKGE 344

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+WH IP   RK   + AEP+ +PTMAGGLF+I++ +F + G+YD   D WGGENLE+SF
Sbjct: 345 FSWHNIPAAWRKNRTSIAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSF 404

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWG 176
           +  W                 V  P    G    D   ++     D+         ++W 
Sbjct: 405 RI-WQC-----------GGHIVIAPCSHVGHIFRDYHPYKFPKGKDTNAINTKRAVEVWM 452

Query: 177 GE--------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------- 211
            E          EL+ K   GD+++RK  R    CKSFKWYL+                 
Sbjct: 453 DEFKKYFYQTRPELT-KMKVGDISARKAFREKNRCKSFKWYLDNVYPHKYLMEEHSQGFG 511

Query: 212 -VSNDWSGMCIDSACKPTDMHKPVGLYPCH---KQGGNQFWMMSKHGEIRRDEACLDYAG 267
            + N  + MC+D+  K  D    +G++ CH   ++  NQ   +S+ GE+RRD+ C   + 
Sbjct: 512 IIRNPHTNMCLDTYGKSEDEISDLGVFECHPIPEEATNQLLSLSRKGELRRDDVCAKVSW 571

Query: 268 GDVILYPCHGSKGNQYFE 285
            D    P   +KG    E
Sbjct: 572 VD----PFRRTKGKIVME 585


>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Oreochromis niloticus]
          Length = 620

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 96/303 (31%), Positives = 147/303 (48%), Gaps = 53/303 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +    + VVSP I++I  ++F+   P   + ++  +  G FDW+L 
Sbjct: 268 CECFHGWLEPLLARIVEEPTAVVSPEISSIDLNSFQFHKP---VATNRAYNRGNFDWSLT 324

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W AIPE  ++  K+   PV TPT AGGLF+I K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 325 FGWEAIPEDAKRLRKDETYPVKTPTFAGGLFAISKKYFEHIGTYDDQMEIWGGENVEMSF 384

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGG 177
           +  W    + E          +   ++ G +F          GT           ++W  
Sbjct: 385 RV-WQCGGQLE----------IIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMD 433

Query: 178 ENLELSFKGD-----------FGDVTSRKELRRNLGCKSFKWYL---------------- 210
           +  ++ ++ +           FGD+++R  LR  L CK+F WYL                
Sbjct: 434 DYKKIYYRRNKNAAIMASEHRFGDISARLNLRERLHCKNFSWYLNTVYPEIFIPDLNPEK 493

Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDY 265
              + N  S MC+D A +     KP+ +Y CH  GGNQ++  + H E+R +   + CL  
Sbjct: 494 SGSIKNLGSNMCLD-AGENNQGGKPLIMYHCHNMGGNQYFEYTSHKELRHNIGKQLCLHA 552

Query: 266 AGG 268
           A G
Sbjct: 553 AVG 555


>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Strongylocentrotus purpuratus]
          Length = 749

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 92/277 (33%), Positives = 134/277 (48%), Gaps = 36/277 (12%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  +  + ++VV P I  I   +FE          S    IG F+W ++F W+ I
Sbjct: 411 WLEPLLQRIHDDPTNVVCPAIDAIDATSFEY-------AGSGATIIGAFNWEMKFTWNGI 463

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
           PE E +R  + + P+ +P MAGGLFSIDK FF ++GTYD GFDIWG ENLELSFK     
Sbjct: 464 PEYEARRRDDESWPIRSPAMAGGLFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCG 523

Query: 126 -NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
            +   IP           +P   P      F  +      +   +   DI+     +L  
Sbjct: 524 GSLEIIPCSRVAHIFRKQQPYKFPDGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKPQL-M 582

Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSACK 226
             ++GDV+ R +LR  L C  F+WYL                  +V N  + MC+DS  K
Sbjct: 583 GQEYGDVSDRIKLREELKCHDFQWYLDNVYPALKVPDTKVRARGDVRNAATSMCLDSMGK 642

Query: 227 PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
                  +G++PCH +G NQ + ++   +++    CL
Sbjct: 643 GV-----LGMFPCHGEGNNQAFTLTWDDQLKHKNKCL 674


>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 756

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 151/315 (47%), Gaps = 50/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + S+VV+P+I  I  +   L +     T +    IG FDW+L 
Sbjct: 404 CEASHGWLEPLLARIAEDRSNVVTPVIDVI--NAQNLAYEADNQTPA----IGVFDWSLT 457

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +I  R+    K +   P+ +PTMAGGLF+ID+++F + G YDSGF+IWG ENLE+S
Sbjct: 458 FRWQSIQRRDLPLLKHDPTHPIPSPTMAGGLFAIDRSYFIETGMYDSGFEIWGAENLEIS 517

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-----GFDIWGG 177
           FK  W      E          +   +  G +F     +   L  + S       ++W  
Sbjct: 518 FK-TWMCGGRIE----------ILPCSHVGHIFRKHAPYSNTLTDFISYNNKRLAEVWLD 566

Query: 178 ENLEL-------SFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVS 213
              E        + K + G+ T R ELR  LGC+SF+WYL                 EV 
Sbjct: 567 GYKEFFYFMSPSALKVNAGNYTDRVELRDRLGCRSFQWYLENVFPEGGWPGRNKIYGEVR 626

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDVI 271
           +  +  C+D+  + T + +P+  + C     NQ WM ++  EI+    CLDY      + 
Sbjct: 627 HTATNWCLDTGGRTTPITEPMVAHRC-DNNVNQIWMYTEEQEIKHSSLCLDYDVTTMTLT 685

Query: 272 LYPCHGSKGNQYFEY 286
           L  CH   GNQ ++Y
Sbjct: 686 LMGCHQMGGNQLWDY 700


>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
          Length = 584

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/299 (34%), Positives = 145/299 (48%), Gaps = 60/299 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  ++WL+PLL  +A N   VV+P+I  I  D F        L        GGFDW L F
Sbjct: 240 ECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 292

Query: 65  NWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W  + E+ R +RHKN   P+ +PTMAGGLF+I K +FE+LGTYD   ++WGGENLE+SF
Sbjct: 293 RWEFMNEQLRTERHKNPTAPIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSF 352

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P  +G +F  +              +
Sbjct: 353 RVWQCGGSLEILPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RAAE 398

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWY-------LEVSNDWSGM 219
           +W  E   +  K        +FGD+T R  +R  L CKSFKWY       LEV    +G 
Sbjct: 399 VWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLDTVYPQLEVPKKAAGK 458

Query: 220 ---------CIDSACKPTDMHKPVGLYPCHKQGGNQFWM---MSKHGEIRRDEACLDYA 266
                    C+DS  +  +  +  GL+ CH  GGNQ W+   ++K  +    + CLD+A
Sbjct: 459 SVQVKMGHHCLDSMARKEN--EAPGLFACHGTGGNQEWVFDHLTKTFKNAITQLCLDFA 515


>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
            [Saccoglossus kowalevskii]
          Length = 2434

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/320 (32%), Positives = 147/320 (45%), Gaps = 61/320 (19%)

Query: 4    CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            CE  + W++PL+  +  N+  VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 2086 CECNQNWIEPLITKIQENNKAVVSPIIDVINMDNFQYVAASADLK-------GGFDWNLV 2138

Query: 64   FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
            F W +  P    KR  +    + TP +AGGLF+I K++FE+LG YD   D+WGGENLE+S
Sbjct: 2139 FKWDYMTPAERNKRKSDPIAAIRTPMIAGGLFAISKSWFEELGKYDMMMDVWGGENLEIS 2198

Query: 123  FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
            F+          IP        RK+H     P   P  +G +F+ +              
Sbjct: 2199 FRVWQCGGTLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFAKNTR---------RAA 2244

Query: 173  DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV------------- 212
            ++W  E  +  +          FG++ SR +LR+ L CKSF WYLE              
Sbjct: 2245 EVWMDEYKKYYYSAVPSSKNIAFGNIQSRLDLRKKLQCKSFGWYLENVYPELRIPDKKDI 2304

Query: 213  ---SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY---- 265
               +     +C+D+    TD    +GLY CH  GGNQ + ++K   IR  + CL      
Sbjct: 2305 AFGALQQGHLCMDTLGHFTD--GTLGLYECHNTGGNQEFALTKDKAIRHQDLCLTVMDHR 2362

Query: 266  AGGDVILYPCHGSKGNQYFE 285
              G + L+ C  S  NQ +E
Sbjct: 2363 PSGVIKLHGCSESNLNQKWE 2382


>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
          Length = 512

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 95/294 (32%), Positives = 143/294 (48%), Gaps = 57/294 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A + + VVSP+I  I  D F+       L        GGF+WNL 
Sbjct: 168 CECNVHWLEPLLERVAEDPTRVVSPIIDVINMDNFQYVGASSNLK-------GGFNWNLV 220

Query: 64  FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W ++ PE   +R  N   P+ TP +AGGLF IDK  FE++G YD   D+WGGENLE+S
Sbjct: 221 FKWDSLTPEEVTQRRGNPTAPIKTPMIAGGLFVIDKERFEEIGKYDMMMDVWGGENLEIS 280

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H     P   P  +G +F+ +              
Sbjct: 281 FRVWQCHGSLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFARNTR---------RAA 326

Query: 173 DIWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS- 217
           ++W  E        +  +   +FGD+ SR ELR  L C+ F W+L+       V ++   
Sbjct: 327 EVWMDEYKSYYYAEVPSAKSVNFGDIRSRLELREKLKCRPFSWFLQNVYPSLIVPSEQDV 386

Query: 218 --------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
                    MC+D+      +   VG++ CH  GGNQ W+++K+ +I+  + C+
Sbjct: 387 QFGYIQQGSMCVDTLSNA--LGGKVGMFQCHNTGGNQEWVLTKNQKIKHLDLCI 438


>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
          Length = 603

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 100/319 (31%), Positives = 152/319 (47%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE     L+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 253 CECFHGRLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 307

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 308 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 367

Query: 122 SFKF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           SF+          IP        R +  H     P  T  +A     + + + +   +Y 
Sbjct: 368 SFRVWQCGGQLEIIPCSVVGHVFRTKSPH---TFPKGTSVIARNQVRLAEVWMD---SYK 421

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
             F     +  +++ +  FGD++ R +LR  L C +F WYL                   
Sbjct: 422 KIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGA 481

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGG 268
           + N  +  C+D   +     KP+ +Y CH  GGNQ++  +   ++R + A   CL  + G
Sbjct: 482 IKNLGTNQCLDVG-ENNRGGKPLIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKG 540

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  CH +  N     D
Sbjct: 541 ALGLGSCHFTGKNSQVPKD 559


>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
           partial [Strongylocentrotus purpuratus]
          Length = 531

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/305 (32%), Positives = 146/305 (47%), Gaps = 50/305 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PLL  LA + + VV P++  I  DTF     P  L        GGF+W  +
Sbjct: 192 VEVMIGWLEPLLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQR------GGFNWRFE 245

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           + W  IP  +++  K A  P+ +P M GGL ++D++FF +LG +D G ++WGGENLE S 
Sbjct: 246 YRWKPIPNYDKRPSKVA--PIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSL 303

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           K      +   IP  R    +++ +     P    G   +D      +       ++W  
Sbjct: 304 KIWMCGGSIEIIPCSRVGHVYRDTS-----PYSFLGQNPLDIVEHNAMRV----VEVWTD 354

Query: 178 EN-------LELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
           E+       L +    DFGDV+ RK+LR +L C  F WYL                   +
Sbjct: 355 EHKHHFYDRLPMLKNRDFGDVSKRKKLRESLNCYDFNWYLTNVYPELYVPSSSSVLRQTI 414

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGGDV 270
           +N  S +CIDS  +     K +  + CH  GGN+++  +K GEIR DE CL+    G  V
Sbjct: 415 NNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGNEYFEETKAGEIRNDELCLEANSVGTHV 474

Query: 271 ILYPC 275
           IL PC
Sbjct: 475 ILNPC 479


>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Loxodonta africana]
          Length = 622

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 104/318 (32%), Positives = 153/318 (48%), Gaps = 45/318 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETVPLHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY-DSGFDIW 175
           SF+          IP            P    T   G+  I +        + D   +I+
Sbjct: 387 SFRVWQCGGQLEIIPCSVVGHVFRTKSP---HTFPKGINVIARNQVRLAEVWMDDYKEIF 443

Query: 176 GGENLE---LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT---- 228
              NL+   ++ +  FGD++ R +L+  L C +F W+L   N +  M +    KPT    
Sbjct: 444 YRRNLQAAKMAEEKSFGDISERLKLKEQLHCHNFSWFLH--NVYPEMFVPD-LKPTFYGA 500

Query: 229 --------------DMH--KPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CLDYAGGD 269
                         + H  KP+ +YPCH  GGNQ++  +   ++R + A   CL    G 
Sbjct: 501 IKSLGTDHCLDVGENNHGGKPLIMYPCHSLGGNQYFEYTTQRDLRHNIAKQLCLHANAGT 560

Query: 270 VILYPCHGSKGNQYFEYD 287
           + L  CH +  N     D
Sbjct: 561 LGLRSCHFTGKNSQVPKD 578


>gi|311246104|ref|XP_003122084.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Sus
           scrofa]
          Length = 541

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 89/218 (40%), Positives = 116/218 (53%), Gaps = 26/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    S VV P+I  I  +TFE       + +S +  IGGFDW L 
Sbjct: 229 CECHEGWLEPLLQRIHEKESAVVCPVIDVIDWNTFEY------MGNSREPQIGGFDWRLV 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH +PERER R K+  + + +PTMAGGLF++ K +FE LG YD+G ++WGGENLE SF
Sbjct: 283 FTWHVVPERERLRMKSPIDVIRSPTMAGGLFAVSKKYFEYLGAYDTGMEVWGGENLEFSF 342

Query: 124 KFNWH--AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 343 RI-WQCGGTLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEFKE 391

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV 212
           L +  +       FGDVT RK+LR  L CK FKW+LE 
Sbjct: 392 LYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLET 429


>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
 gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
          Length = 442

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 91/276 (32%), Positives = 139/276 (50%), Gaps = 45/276 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWN-L 62
           CE    WL+P L+ +ARN + V   ++ NI  DTF+  F   + T      +GG ++  L
Sbjct: 179 CECMHGWLEPQLETIARNYTTVPISVLDNILHDTFQYTFMDLQSTQ-----MGGINFKEL 233

Query: 63  QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
            F W  IPE ER+R K+  +P+ +PTMAGG+FSI+K +FE LG YD+G ++WGGEN+E+S
Sbjct: 234 TFIWEPIPEHERRRQKSPVDPIRSPTMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMS 293

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
           F+  W            +    V+ PT     +S   A+ + +       ++W  +  E+
Sbjct: 294 FRI-WQCGGTIVVLPCSHVGH-VFRPTSP---YSTGDAWKKLVHNNRRMAEVWMDDYKEI 348

Query: 183 SF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------------E 211
            +       K D GDVT RK LR+ L C+ F WYL                         
Sbjct: 349 YYRKHPEYRKYDMGDVTQRKLLRKGLHCRDFSWYLSHVFPTLYVPDIRPIAHGQVSHVTS 408

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQF 247
           +S++ +G+C+D         +P G++PCH +GG Q 
Sbjct: 409 ISSEQTGLCLDVI---KAGKEPAGVFPCHGKGGTQV 441


>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
 gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
          Length = 489

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 91/216 (42%), Positives = 115/216 (53%), Gaps = 20/216 (9%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +ARN + V SP I  I   TF+  +            +G FDWNL+
Sbjct: 268 CECTEGWLEPLLDRIARNRNTVASPTIDMIDPKTFQYNY------DGANDVLGVFDWNLE 321

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP RE KR  + AEP+ TPT+AGGLF+ID  FF  +GTYD GF+IWGG+NLELSF
Sbjct: 322 FYWIPIPLRELKRRNHFAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSF 381

Query: 124 KFNW------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIW 175
           K  W        IP            P   P+    +   + A   +  L  Y   +   
Sbjct: 382 K-TWMCGGILEIIPCSHVGHIFRDDSPYEWPSSRAMMVESNLARLAEVWLDDYAKYYYER 440

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            G N  L+      DV+ RK+LR  LGCKSFKWYL+
Sbjct: 441 SGGNKSLA-----TDVSDRKKLREKLGCKSFKWYLD 471


>gi|157114758|ref|XP_001652407.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883560|gb|EAT47785.1| AAEL001146-PA [Aedes aegypti]
          Length = 552

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 105/308 (34%), Positives = 148/308 (48%), Gaps = 39/308 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P LD +ARN + V  P I  +  D   L F    + +    + G  DW LQ
Sbjct: 211 CECTTGWLEPQLDRVARNPTTVAIPTIDWV--DEHNLAF----IANRSHIYYGACDWGLQ 264

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W    +R+ K  +N  EP  TP MAGGLFSI+K FF  +G YD G  I+GGEN+ELS 
Sbjct: 265 FGWRGRWDRKVK-PENKLEPFPTPIMAGGLFSINKTFFAHIGWYDEGLGIYGGENVELSL 323

Query: 124 KF-----NWHAIPERERKRHKNAAEP----VWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
           K          IP       + A  P    V T  +  G   + + + ++       +D+
Sbjct: 324 KAWMCGGRLETIPCSRVGHIQKAGHPYLDGVKTDWVRVGSVRVAEVWMDQYA--QVVYDM 381

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS----NDWSGMCIDSACKPTDM 230
           +GG      F+G+FGDV+ RK+LR +L CKSFKWYLE +     D     +    K T++
Sbjct: 382 FGGP----EFRGNFGDVSDRKKLRESLNCKSFKWYLENAFPELEDPVSYGVGHG-KFTNL 436

Query: 231 HKPVGLYPCHKQGGNQF------------WMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
                  P +++ G  F            W+ +  GEI     CLDY G  + ++ CH  
Sbjct: 437 GVGKNFCPRYRKAGYTFRMEPCTDDDYQHWVHNMLGEISTSNVCLDYDGITLYMFECHKG 496

Query: 279 KGNQYFEY 286
           +GNQ + Y
Sbjct: 497 QGNQKWRY 504


>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Xenopus (Silurana) tropicalis]
          Length = 622

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 102/302 (33%), Positives = 144/302 (47%), Gaps = 33/302 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A + + VVSP I  I  ++FE   P     +  +   G FDW+L 
Sbjct: 270 CECFHGWLEPLLSRIAEDHTAVVSPDIPIIDLNSFEFHKPVQYGKTHNR---GNFDWSLT 326

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W AIP  E++R K+   P+ TPT AGGLFSI KA+FE +G+YD   +IWGGENLE+SF
Sbjct: 327 FGWEAIPAAEKERRKDETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENLEMSF 386

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
           +          IP            P   P     +F       E  +  Y   +     
Sbjct: 387 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTQVIFRNLVRLAEVWMDDYKLLYYQRNE 446

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
           +  ++  +  FGD++ R +L+ +L CK+F WYLE                  V N+ S  
Sbjct: 447 QAAKMVREKSFGDISKRLKLKADLQCKNFTWYLENIYPEMFVPDRDPTYYGKVKNEGSQN 506

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA---CL--DYAGGDVILYP 274
           C+D+  K     KP+ +  C+  GG Q++  S H E+R + A   CL   Y  G V L  
Sbjct: 507 CLDAGEK-NHGGKPLIMNLCNGMGGTQYFEYSTHKELRHNIAKQLCLRSKYVPGPVELGE 565

Query: 275 CH 276
           C 
Sbjct: 566 CQ 567


>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
           glaber]
          Length = 622

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 99/298 (33%), Positives = 145/298 (48%), Gaps = 46/298 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A +   VVSP I  I  DTFE     P GR+ S      G FDW+
Sbjct: 272 CECFYGWLEPLLARIAEDQVAVVSPDIVTINLDTFEFSKPIPGGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P +E++R ++   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPAQEKQRREDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
           SF+  W      E           R R  +   P  T  ++     + + + +    Y  
Sbjct: 387 SFRV-WQCGGRLEIAPCSVVGHVFRSRSPHTF-PKGTSVISRNQVRLAEVWMDD---YKK 441

Query: 171 GFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPT-- 228
            F     +  +++ +  FGD++ R +LR  L C++F W+L   N +  M +     PT  
Sbjct: 442 IFYRRNLQAAKIAQEKSFGDISERLQLREQLHCRNFSWFLH--NIYPEMFVPD-LNPTFY 498

Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEY 286
              K +G+  C   G N                  +  G  +I+Y CHG  GNQYFEY
Sbjct: 499 GAIKNLGINQCLDVGEN------------------NRGGKPLIMYSCHGLGGNQYFEY 538


>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
          Length = 554

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 98/316 (31%), Positives = 147/316 (46%), Gaps = 48/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + V SP+I  I  DTF          ++     GGFDW+L 
Sbjct: 204 CEVNKDWLPPLLQRVKEDPTSVASPVIDIINMDTFAY-------VAASSDLRGGFDWSLH 256

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   +R +  +  EP+ TP +AGGLF ID+++F +LG YD+  DIWGGEN E+SF
Sbjct: 257 FKWEQLSAEKRAKRADPTEPIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIWGGENFEISF 316

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P   G   +  K        +   F 
Sbjct: 317 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYIFP--EGNANTYIKNTRRTAEVWMDEFK 369

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
           ++       +    +GD+  R+ELR++L CKSFKWYL+           S+  SG+    
Sbjct: 370 LFYYSARPAARGKSYGDIHGRQELRKSLNCKSFKWYLDNVYPELKVPDDSDAKSGVIRQR 429

Query: 220 --CIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEIRRDEACLD----YAGGD 269
             C++S          + L PC         NQ W+ +   +IR+ + CL     +    
Sbjct: 430 QNCLESRVVEGQDLPVLTLAPCIITKETPAANQEWIYTHGQQIRQQQYCLSVSTTFPASQ 489

Query: 270 VILYPCHGSKGNQYFE 285
           ++L PC+ S G Q ++
Sbjct: 490 ILLMPCNISDGKQRWQ 505


>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
 gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
          Length = 539

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 143/315 (45%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+P+L+ +  + + VV P+I  I  D F+       L        GGFDWNL 
Sbjct: 194 CECNQHWLEPMLERVMEDRTRVVCPIIDVINMDNFQYVGASADLR-------GGFDWNLV 246

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   +R  R  +   P+ TP +AGGLF IDK++F++LG YD   D+WGGENLE+S
Sbjct: 247 FKWDYMTANQRNARRSDPIAPIRTPMIAGGLFMIDKSWFDELGKYDMMMDVWGGENLEIS 306

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 307 FRVWQCQGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFTRNTR---------RAAEVWMD 357

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E  E  +          FG++ SR ELR+ L CK F WYLE                 + 
Sbjct: 358 EYKEYYYAAVPSARNVPFGNIQSRLELRKKLSCKPFAWYLEHVYPELRIPDKKDVAFGAL 417

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG---GDVI 271
               +C+D+     D    VG+Y CH  GGNQ W ++K   IR  + CL       G+++
Sbjct: 418 QQGTLCMDTLGHFAD--GTVGVYECHGSGGNQEWALTKDKSIRHSDLCLTVVNQNPGELL 475

Query: 272 -LYPCHGSKGNQYFE 285
            L+ C      Q +E
Sbjct: 476 KLHGCQEKNTKQKWE 490


>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 526

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 94/269 (34%), Positives = 137/269 (50%), Gaps = 34/269 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P+++++ ++ + VV P+I  I D T +        TSS  + IGGF+W  +
Sbjct: 259 CEATDHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMG-----TSSDFYQIGGFNWKGE 313

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W   PE  RK  K+ A+P+ +PTMAGGLF+ID+ +F + G+YDS  + WGGENLE+SF
Sbjct: 314 FIWINTPEAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSF 373

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +    P            P   P+       I+ A   ++  +   +  +  +
Sbjct: 374 RIWMCGGSLVIAPCSHVGHIFRDYHPYKFPS-NKDTHGINTARLAEV--WMDNYKYYFYQ 430

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMC 220
           N     K  FGD++ RK LR  L CKSFKWYL+                    N  +GMC
Sbjct: 431 NRPELRKISFGDISERKALRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMC 490

Query: 221 IDSACKPTDMHKPVGLYPCHK---QGGNQ 246
           +DS     D  +P+G+YPCHK    GGNQ
Sbjct: 491 LDSMSHNYDNTEPLGIYPCHKDTNSGGNQ 519


>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Strongylocentrotus purpuratus]
          Length = 521

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 150/322 (46%), Gaps = 48/322 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLL+ +A N   +V P+I  I ++ F      G +        G FDW L 
Sbjct: 154 CEANYNWLPPLLERIALNRRRIVCPMIDVISNEDFHYESQAGDVMR------GAFDWELY 207

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF-EKLGTYDSGFDIWGGENLELS 122
           +    I E E KR  + ++P  TP MAGGLF++D+ +F E+LG YD G +IWGGE  +LS
Sbjct: 208 YKRIPISEAENKRRSHESDPFRTPIMAGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLS 267

Query: 123 FKFNWHAIPERER---KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           FK  W    E E     R  +      + T+ GG   I+K     +  +    D WG   
Sbjct: 268 FKV-WMCGGEMEEIPCSRVGHIYRKFMSYTVPGGAGVINKNLLRVVEVW---MDEWGKYF 323

Query: 180 LELS--FKG-DFGDVTSRKELRRNLGCKSFKWYL----------------------EVSN 214
            E     KG D+GD++ +  LR  L CK+F W+L                       +++
Sbjct: 324 YERRPYLKGQDYGDISKQLALRERLQCKNFTWFLTEVAPDILQYYPPVEPEGGAKGHITH 383

Query: 215 DWSGMCIDSACKPTDMHKPVGLYP-CHKQGGNQFWMMSKHGEIR----RDEACLDYA--- 266
             +G C+  +    D  +     P   KQGG+QFW ++ H + R      + C+D+    
Sbjct: 384 TSTGKCLTLSQGGKDELRVQECNPRSMKQGGSQFWELTWHDDFRPSSKSRKQCVDFPYGR 443

Query: 267 -GGDVILYPCHGSKGNQYFEYD 287
            G + ILYPCH   GNQ + YD
Sbjct: 444 EGAEPILYPCHHGGGNQLWVYD 465


>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
 gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
           adhaerens]
          Length = 490

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 111/318 (34%), Positives = 154/318 (48%), Gaps = 65/318 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL+ +  N + VV P I  I D TF+ +F P  L        G F+W L 
Sbjct: 156 CEVTTGWLEPLLERIYLNETTVVCPEIDVIDDRTFQYQFGPPALMR------GVFNWQLY 209

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  E KR K+  +PVW+PTMAGGLF+I K FF++LGTYD  FD+WGGEN+E+SF
Sbjct: 210 FRWALIPPEEHKRRKSPIDPVWSPTMAGGLFAISKKFFKRLGTYDDQFDVWGGENMEISF 269

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG------TYDSGFDIWGG 177
           K  W    + E          +   +  G +F  ++ +  K G            ++W  
Sbjct: 270 K-AWLCGGKLE----------IVPCSRVGHVFRHNQPY--KFGGNFLSRNSQRVAEVWLD 316

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
           +  E  +       K +FG++  R EL++ L CK FKWYL+                  +
Sbjct: 317 DYKEFFYQVQPHLRKEEFGNIAERLELKKKLKCKPFKWYLQNIYTDVVLPNESSIAKGKL 376

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMM----SKHGEIRRDEACLDYA-- 266
            N  S MC+D+  K  + +  + +YPC        W M    S   E+   E CLD +  
Sbjct: 377 KNPASNMCLDTMGKTANAY--MSIYPCANS-----WTMEMSYSILEELVVSELCLDVSDN 429

Query: 267 --GGDVILYPCHGSKGNQ 282
             G  + LY CHG  GNQ
Sbjct: 430 KDGARIQLYDCHGQGGNQ 447



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 32/60 (53%), Gaps = 7/60 (11%)

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLDYAGGDV-ILYPCHGSKGNQYFEYDYKY 290
           + LY CH QGGNQ W+   H +IR      CLD   G+  ++ PC G   +Q + +D  Y
Sbjct: 435 IQLYDCHGQGGNQLWL---HKKIRHPNTGKCLDRGSGNTPVMKPCSGGV-SQMWSFDTYY 490


>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mustela putorius
           furo]
          Length = 461

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 95/265 (35%), Positives = 129/265 (48%), Gaps = 44/265 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSND 215
               +       K D+GD++SR  LR  L CK F WYL                 E+ N 
Sbjct: 378 KNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNV 437

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH 240
            +  C+D+  +  +  + VG++ CH
Sbjct: 438 ETNQCLDNMARKEN--EKVGIFNCH 460


>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
 gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
          Length = 494

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/317 (32%), Positives = 142/317 (44%), Gaps = 58/317 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WLQPLL  + +N   VVSP+I  I  D F           +     GGFDW+L 
Sbjct: 149 CECNTDWLQPLLKRVVQNKKAVVSPIIDVINMDDFSY-------IGASADIKGGFDWSLH 201

Query: 64  FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + PE+++ R      P+ TP +AGGLF + K++FE++G YD+  DIWGGEN E+S
Sbjct: 202 FKWDNLTPEQKQSRRSTPIAPIKTPMIAGGLFVVTKSWFEEMGKYDTMMDIWGGENFEIS 261

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RKRH     P   P      +         +       
Sbjct: 262 FRTWQCGGSMEIIPCSRVGHVFRKRH-----PYTFPDGNANTY---------MKNTRRTA 307

Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEV---------SNDW 216
           ++W  E     +          +G + SRKELR+ L CK FKWYL+          S D 
Sbjct: 308 EVWMDEYKRFYYAARPMARSALYGSIKSRKELRKRLQCKPFKWYLQNVYPELQIPDSQDV 367

Query: 217 S-------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY-AGG 268
           S         C+D+    +     VG++ CH Q GNQ W ++K   +R  + CL   +GG
Sbjct: 368 SFGELKQGKSCLDTL--GSQAGGSVGMFDCHGQAGNQEWALTKKSTVRHLDLCLTLGSGG 425

Query: 269 DVILYPCHGSKGNQYFE 285
            V L  C      Q +E
Sbjct: 426 AVTLEGCRDGDPKQIWE 442


>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Oryzias latipes]
          Length = 621

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 94/288 (32%), Positives = 132/288 (45%), Gaps = 31/288 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +    + VVSP I  I  + F    P   + ++  +  G FDW+L 
Sbjct: 269 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLNNFNFNKP---IATNRAYNRGNFDWSLT 325

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W AIPE  R+  K+   PV TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF
Sbjct: 326 FGWEAIPEEARRLRKDETYPVKTPTFAGGLFSISKKYFEHIGTYDDKMEIWGGENVEMSF 385

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSGFDIWGG 177
           +          IP            P   P     +        E  +  Y   +     
Sbjct: 386 RVWQCGGQLEIIPCSVVGHVFRTKSPHTFPKGTEVITRNQVRLAEVWMDDYKKIYYRRNK 445

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGM 219
               ++ +  +GD++ R +LR +L CK+F WYL                   + N  S  
Sbjct: 446 NAAIMAQEKKYGDISDRLKLREDLHCKNFSWYLNTIYPEIFVPDLTPEKFGAIKNLGSDT 505

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLD 264
           C+D   +     KPV +Y CH  GGNQ++  S H E+R +   + CL 
Sbjct: 506 CLDVG-ENNQGGKPVIMYMCHNMGGNQYFEYSSHKELRHNIGKQLCLQ 552


>gi|449271781|gb|EMC82021.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Columba
           livia]
          Length = 314

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 88/217 (40%), Positives = 118/217 (54%), Gaps = 26/217 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A   S VV P+I  I  +TFE       L ++ +  IGGFDW L 
Sbjct: 107 CECHEGWLEPLLARIAEEESAVVCPVIDVIDWNTFEY------LGNAGEPQIGGFDWRLV 160

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH+ PERE+KR K+  + + +PTMAGGLFS+ K +F+ LG+YD+G ++WGGENLE SF
Sbjct: 161 FTWHSTPEREQKRRKSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSF 220

Query: 124 KFNWHAIPERERK--RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           +  W      E     H     P   P      +S  KA    L       ++W  E  E
Sbjct: 221 RI-WQCGGSLEIHPCSHVGHVFPKQAP------YSRSKA----LANSVRAAEVWMDEYKE 269

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
           L +  +       +GDVT R+ LR  L CK FKW+LE
Sbjct: 270 LYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLE 306


>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Takifugu rubripes]
          Length = 579

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/316 (31%), Positives = 149/316 (47%), Gaps = 48/316 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  + ++ + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 229 CEVNKDWLPPLLQRIKQDPTRVVSPVIDIINMDTFAY-------VAASADLRGGFDWSLH 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   +R R  + A+P+ TP +AGGLF ID+++F  LG YD+  DIWGGEN E+SF
Sbjct: 282 FKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGGENFEISF 341

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G   +  K        +   F 
Sbjct: 342 RVWQCGGSLEILPCSRVGHVFRKKH-----PYVFP--EGNANTYIKNTRRTAEVWMDDFS 394

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV----------SNDWSGM---- 219
           ++       +    +GD+  R ELR+ L CK+FKWYL+           S+  SG+    
Sbjct: 395 LFYYSARPAARGKSYGDIRGRLELRKKLKCKTFKWYLDNVYPELKVPDDSDSKSGVIKQR 454

Query: 220 --CIDSACKPTDMHKPVGLYPCH-KQGGN---QFWMMSKHGEIRRDEACLD----YAGGD 269
             C++S          + L PC   QG N   Q W+ +   +IR+ + CL     +    
Sbjct: 455 QNCLESQRVEGQELPVLTLAPCVGSQGVNAIKQEWVYTHGQQIRQQQHCLSLSTTFPASQ 514

Query: 270 VILYPCHGSKGNQYFE 285
           V+L PC+ + G Q ++
Sbjct: 515 VLLLPCNMADGKQRWQ 530


>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
          Length = 579

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/316 (30%), Positives = 147/316 (46%), Gaps = 51/316 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P++  ++   + +V P+I +I D T             +   +GGF W L 
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDSTLAYH-------GDWSLSVGGFSWALH 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IPE E+KR K   + + +PTMAGGL + ++ +F ++G YD   DIWGGENLE+SF
Sbjct: 283 FTWEGIPEDEQKRRKKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           + NW        IP         A  P +  T       +     ++L       ++W  
Sbjct: 343 R-NWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMD 394

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
           +   L +         D GD+TSR ELR+ L CKSFKW+L+                  +
Sbjct: 395 DYKRLYYMHREDLRTKDVGDLTSRHELRKRLNCKSFKWFLDNIAKGKFIMDEDVVAYGAL 454

Query: 213 SNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGD 269
               SG  MC D+  +   M + +G++ C  +G + Q   +S+ G +RR+  C     G+
Sbjct: 455 HTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSREGNLRRENTCASEENGN 514

Query: 270 VILYPCHGSKGNQYFE 285
           V +  C  SK  Q+ E
Sbjct: 515 VRMKTC--SKKAQFNE 528


>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
          Length = 577

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/290 (31%), Positives = 142/290 (48%), Gaps = 44/290 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +    + V+ P+I  I  +T +          +    +GGF W+L 
Sbjct: 228 CEANEGWLEPLLARIKEKRTAVLCPIIDYISAETMQYS------GDANVNAVGGFWWSLH 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W +I + ER R K+A EPV +PTMAGGL + ++ +F ++G YD G DIWGGENLE+SF
Sbjct: 282 FRWDSIGKAERDRRKSAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISF 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   IP         A  P +  T  GG   +     ++L       ++W  +
Sbjct: 342 RVWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGPGGNLDVHGTNSKRLA------EVWMDD 394

Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
              L +         D GD++ RK LR+ L CKSFKWYL+                  + 
Sbjct: 395 YKRLYYLHRPDLKTKDVGDLSERKALRKKLKCKSFKWYLDNVIPHKFIPDEGVVGYGALR 454

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEAC 262
           N  SG+C+D+  +       +G++ C   G + Q + ++K G++RR+  C
Sbjct: 455 NPNSGLCLDTLQRDEKSTITLGIFACQTGGSSAQVFSLTKSGQLRREITC 504


>gi|390349674|ref|XP_003727260.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Strongylocentrotus purpuratus]
          Length = 379

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/307 (32%), Positives = 144/307 (46%), Gaps = 52/307 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+PLL  LA + + VV P++  I  DTF     P  L        GGF+W  +
Sbjct: 33  VEVMIGWLEPLLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQR------GGFNWRFE 86

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           + W  IP  +++  K A  P+ +P M GGL ++D++FF +LG +D G ++WGGENLE S 
Sbjct: 87  YRWKPIPNYDKRPSKVA--PIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSL 144

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           K      +   IP  R    +++      +P    G   +D      +       ++W  
Sbjct: 145 KIWMCGGSIEIIPCSRVGHVYRDT-----SPYSFLGQNPLDIVEHNAMRV----VEVWTD 195

Query: 178 EN-------LELSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
           E+       L +    DFGDV+ RK+LR +L C  F WYL                    
Sbjct: 196 EHKYHFYDRLPMLKNRDFGDVSKRKKLRESLNCYDFNWYLANVYPELYVPSSSSVLRQTI 255

Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY--AGG 268
              N  S +CIDS  +     K +  + CH  GGN+++  +K GEIR DE CL+    G 
Sbjct: 256 NFQNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGNEYFEETKAGEIRNDELCLEANSVGT 315

Query: 269 DVILYPC 275
            VIL PC
Sbjct: 316 HVILNPC 322


>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
          Length = 587

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 98/335 (29%), Positives = 151/335 (45%), Gaps = 60/335 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   W+QPLL  +  N   V  P+I  I  DTF     P           GGF+W L 
Sbjct: 217 CEVNVEWIQPLLSHIHGNHKRVAVPIIDIIDQDTFRYESSP--------LVRGGFNWGLF 268

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           + W  IPE   ++ ++  +P+ TPTMAGGLF++++ +F  LG YD+G D+WGGENLE+SF
Sbjct: 269 YRWDQIPESLLRKQEDYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGENLEISF 328

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      + H +P            P  +P    G+ +I K        +   +  +  +
Sbjct: 329 RVWQCGGSMHILPCSRVGHIFRKRRPYGSPV---GVDTITKNSLRVAHVWMDEYIKYFFQ 385

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSG------------- 218
             + +   ++GDV+ RK LR  L C+SFKW+L+       + +D  G             
Sbjct: 386 VRKTADHAEYGDVSDRKALRNELQCQSFKWFLDNVYPEQTLPSDKEGGGLIAKGHNLIKK 445

Query: 219 ----------------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR-RDEA 261
                           +C+ ++  P D    + L PC+     Q W  +  G +R     
Sbjct: 446 DPEVIRKAHLKHFSSTLCVVASRSPYDKKSLLELKPCNPNNKQQVWHETFEGSMRLMGVL 505

Query: 262 CLDY---AGGDVILYP----CHGSKGNQYFEYDYK 289
           CLD+   +GG    YP    CH S G+Q + + +K
Sbjct: 506 CLDFVDDSGGGNSPYPMLSKCHFSGGSQQWSWLHK 540


>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Saccoglossus kowalevskii]
          Length = 524

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 95/287 (33%), Positives = 138/287 (48%), Gaps = 41/287 (14%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+P+L  +  +  +VV+P+I  I    F          ++     GGF W +QF W  I
Sbjct: 178 WLEPMLQRIKEDRRNVVAPMIDGIDATKFSY--------AASNLIRGGFSWEMQFKWKPI 229

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
           P+ E KR K+   P+ +PTMAGGLF+IDK++F ++GTYD G +IWG ENLELSFK     
Sbjct: 230 PDYEMKRRKDETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCG 289

Query: 126 -NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
            N   IP         A++P   P      F  +     ++   D   DI+    L+   
Sbjct: 290 GNLEMIPCSHVGHVFRASQPYKFPEGNIKTFMRNNMRVAEVWM-DEYKDIFYA--LKPQL 346

Query: 185 KG-DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
           KG D+GDVT RKELR  L C  FKWYL+                 +++  + +     Q 
Sbjct: 347 KGEDYGDVTERKELRDRLQCHDFKWYLQ-----------------NIYPELPIPDLKVQA 389

Query: 244 GNQFWMMSKHGEIRRDEACLDYAGGDVI-LYPCHGSKGNQYFEYDYK 289
             +   + K G       C+D  G + +  +PCHG   NQ F + ++
Sbjct: 390 RGELRNLGKIG------YCMDTMGANAMCAHPCHGIGHNQMFSFSWQ 430


>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
          Length = 568

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 101/315 (32%), Positives = 155/315 (49%), Gaps = 44/315 (13%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV + WL+PL+  +A + + +++P+I NI D+ F   F  GR         GGF W L F
Sbjct: 227 EVSEGWLEPLISRVADDRTRIIAPIIDNISDEDFG--FSTGRTD-----LWGGFSWILSF 279

Query: 65  NWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W  +   + +R     AEP+ TPT+AGGLF+I++ +F ++G YD G ++WGGEN+E+SF
Sbjct: 280 KWFDMNGNDTQRLIAKKAEPIRTPTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISF 339

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E     +      T T     F+ +  F  +     +  ++W  E  E  
Sbjct: 340 RI-WMCGGSMEIHPCSHVGHVFRTKTPYS--FTKEVNFVIRRNQARTA-EVWMDEYKEFF 395

Query: 184 F-------KGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGM 219
           F       K + GD+  RK LR  L CK FKWYL+                 + N  +G 
Sbjct: 396 FKMVPSAQKMEIGDLQERKSLRERLKCKPFKWYLKNVCSECHMPSEYHSLGAIVNKLNGK 455

Query: 220 CIDSACKPTDMHKPVGLYPC---HKQGGNQFWMMSKHGEIRRDEACL--DYAGGDVILYP 274
           C+D   +   +  P GL  C   H+Q GNQ W  + + EIR    CL  +  G ++ +  
Sbjct: 456 CVDRGGRV--LGGPPGLGTCIHSHEQQGNQVWSWTGNKEIRSQNFCLSSNKKGSELKIEM 513

Query: 275 CHGSKGNQYFEYDYK 289
           C+GS+ +Q FE++ K
Sbjct: 514 CNGSE-DQKFEFNRK 527


>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
 gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
          Length = 621

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 94/300 (31%), Positives = 146/300 (48%), Gaps = 33/300 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A   + VVSP I  I  +TFE   P   +    +   G FDW+L 
Sbjct: 271 CECFHGWLEPLLSRIAEEPTAVVSPDITTIDLNTFEFSKP---VQYGKQHSRGNFDWSLT 327

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P RER+R K+   P+ +PT AGGLF+I +++FE +G+YD   +IWGGEN+E+SF
Sbjct: 328 FGWEVVPPRERQRRKDETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSF 387

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
           +          IP         +  P   P     + S ++    +  +  Y   F    
Sbjct: 388 RVWQCGGQLEIIPCSVVGHVFRSKSPHTFPK-GTQVISRNQVRLAEVWMDDYKEIFYRRN 446

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSG 218
            +  +++ +  +GD+T R+ LR  L CK+F WYL+                  + N+ + 
Sbjct: 447 QQAAQMAREKTYGDITERRRLRERLHCKNFTWYLQNVYPEMFVPDLNPTSYGAIKNEGTN 506

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPC 275
            C+D   +     KP+ +YPCH  GGNQ++  +   ++R +   + CL    G V L  C
Sbjct: 507 SCLDVG-ENNHGGKPLIMYPCHGMGGNQYFEYTTQRDLRHNVGKQLCLRAGAGPVQLGEC 565


>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Hydra magnipapillata]
          Length = 578

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 150/324 (46%), Gaps = 59/324 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +  N   V SP+I  I  + F       +  SS     GGF WNL 
Sbjct: 233 CECNEMWLEPLLQAIKDNRKIVASPIIDVIGHEDF-------KYLSSSSDLRGGFGWNLN 285

Query: 64  FNWHAIPERERKRHKNAAEP-VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +P     +H+      + +P +AGGLFSI K++FE+LG YD   D+WGGENLE+S
Sbjct: 286 FKWDFLPPNHLIKHQQDGTAFILSPVIAGGLFSIHKSWFEELGKYDPQMDVWGGENLEIS 345

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+        + IP        R RH     P   P   GG  ++    F+K        
Sbjct: 346 FRTWQCGGEMYIIPCSRVGHVFRDRH-----PYKFP---GGSMNV----FQK--NTRRAA 391

Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSND--- 215
           ++W  +  +  F          FGD+  R +LR++L CKSFKWYLE       V +D   
Sbjct: 392 EVWMDDYKKYYFAAVPSARYSLFGDIRDRLQLRKDLNCKSFKWYLENIYPELKVPDDDVI 451

Query: 216 ---WSGMCIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
                   +   C  T  H   + +GL+PCH QGGNQ W  +K  +I+ +  CL      
Sbjct: 452 KYGQIKYKVSEDCLDTMGHIKGEGIGLFPCHGQGGNQDWSWTKSNQIKHESLCLSGISKK 511

Query: 270 ----VILYPCHGSKGNQYFEYDYK 289
               V + PC  +   Q ++YD K
Sbjct: 512 SEEIVRMVPCVATDNFQKWKYDEK 535


>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 637

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 104/305 (34%), Positives = 151/305 (49%), Gaps = 38/305 (12%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+ LL  + ++ + VV P+I  I DD F        LT S     GGF+W L F W+ +
Sbjct: 294 WLEYLLYEVKKDRTAVVCPIIDVINDDDF------AYLTGS-DMTWGGFNWRLNFRWYPV 346

Query: 70  PERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH 128
           P RE  +R+ + + P+ +PTMAGGLF+ID+ +F ++G YD G ++WGGENLE+SF+  W 
Sbjct: 347 PNREEVRRNYDHSLPLLSPTMAGGLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRV-WQ 405

Query: 129 AIPERERK--RHKNAAEPVWTP-TMAGG----LFSIDKAFFEKLGTYDSGFDIWGGENLE 181
              +       H        TP T  GG    +F  +K   E     D   D       E
Sbjct: 406 CGGKVLIHPCSHVGHVFRKQTPYTFPGGTGKVIFHNNKRLVEVW--LDKYKDFVYAIMPE 463

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCID----SACKPTD-------- 229
           L    D GDV+ R  LR  L CK F+WYL+     S M +D     A +  D        
Sbjct: 464 LK-NVDAGDVSERLALRERLQCKDFRWYLQNIYPESSMPVDFHHVGALRNQDHGCADSLG 522

Query: 230 ------MHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI-LYPCHGSKGNQ 282
                 +++  G++PCH QGGNQ  + SK GE++ D+ C++ +    + L  C      Q
Sbjct: 523 YDSENGVNQNAGIFPCHNQGGNQIVVFSKSGELKFDDLCMEGSKNSAVKLQKCTEGNQKQ 582

Query: 283 YFEYD 287
            +EY+
Sbjct: 583 VWEYN 587


>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
           catus]
          Length = 552

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 143/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  D F           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY-------IESAAELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF +DK++FE LG YD+  DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFEYLGKYDTDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR ELR+NL C+SFKWYLE       V ND S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLELRKNLHCQSFKWYLENVYPELRVPNDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGTIRQRQKCLESQRQRNTEIYNLRLSPCVKIKGEDAKSQIWAFTYTQQILQEELCLSVV 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TIFPGAPVVLVLCKNGDDRQ 500


>gi|405977048|gb|EKC41520.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
          Length = 635

 Score =  140 bits (353), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 101/313 (32%), Positives = 149/313 (47%), Gaps = 58/313 (18%)

Query: 11  LQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIP 70
           L+P+L  +    + VV P++  I   T E     G       + +GGF W+L F W  +P
Sbjct: 294 LEPILSRIKEFPNSVVCPIVDAIDAHTLEYSKNGG-------YQVGGFSWSLHFTWRDVP 346

Query: 71  ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA- 129
            R+   H+   +PV +PTMAGGLF+ D+ FF ++G YD G D+WGGENLE+SF+  W   
Sbjct: 347 SRDLV-HRKYTDPVGSPTMAGGLFAADRKFFFEIGAYDPGMDVWGGENLEISFR-TWMCG 404

Query: 130 -----IPERERKRHKNAAEPVWTP--TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
                IP         A+ P   P      G+ S+  A            ++W  E   L
Sbjct: 405 GKLEFIPCSRVGHIFRASHPYTFPGNKDTHGINSMRLA------------EVWMDEYKRL 452

Query: 183 SFK-------GDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
            +         D+GD++ R ELR+ L CKSFKW+L+                  V N  S
Sbjct: 453 FYTHRKDLLGQDYGDISERVELRKRLNCKSFKWFLDNVYPEKFIPDENVHAWGMVRNPPS 512

Query: 218 GMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYP 274
            +C+D+  K       +G+Y C      N+ + +S + E+RR+EACL     GG V L  
Sbjct: 513 NLCLDTLQKDEKTVFDMGIYSCQNGASANEVFSLSINDELRREEACLTVVSEGGRVPLES 572

Query: 275 CHGSKGNQYFEYD 287
           C G+  NQ +++D
Sbjct: 573 CTGA-ANQKWKHD 584



 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 97/196 (49%), Gaps = 29/196 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P+L  +  + + V+ P I  I  +T +          +  F +GGF W+L 
Sbjct: 219 CETNTGWLEPMLARIKEDRTAVLCPEIDLIDKNTLQY-------GGTGSFSVGGFWWSLH 271

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLF------------SIDKAFFE--KLGTYDS 109
           F+W  IPE E+KR  +   P+    +   +             +ID    E  K G Y  
Sbjct: 272 FSWRPIPEHEQKRRSSGIAPIRLEPILSRIKEFPNSVVCPIVDAIDAHTLEYSKNGGYQV 331

Query: 110 GFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD 169
           G   W       S  F W  +P R+   H+   +PV +PTMAGGLF+ D+ FF ++G YD
Sbjct: 332 GGFSW-------SLHFTWRDVPSRDLV-HRKYTDPVGSPTMAGGLFAADRKFFFEIGAYD 383

Query: 170 SGFDIWGGENLELSFK 185
            G D+WGGENLE+SF+
Sbjct: 384 PGMDVWGGENLEISFR 399


>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           2, partial [Ciona intestinalis]
          Length = 582

 Score =  140 bits (353), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 145/322 (45%), Gaps = 63/322 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  K WL+PLL  +A + + VV P+I  I  D FE       L        GGFDWNL 
Sbjct: 237 VECNKNWLEPLLQRIADDRTAVVCPIIDVINMDNFEYIGASADLR-------GGFDWNLV 289

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +   ER+ R  N   P+ TP +AGGLFS+DK++F +LG YD+  D+WGGENLE+S
Sbjct: 290 FKWDYMSSEERRSRAGNPTAPISTPMIAGGLFSMDKSYFNQLGKYDTAMDVWGGENLEIS 349

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+          IP        RK+H     P   P  +G +F+ +              
Sbjct: 350 FRVWQCGGRLEIIPCSRVGHVFRKQH-----PYTFPGGSGNVFTRNTR---------RAA 395

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS- 217
           ++W  +  E  +          FG++ +R ++R    CK FKWYLE       V +  S 
Sbjct: 396 EVWMDDYKEYYYAAVPSAKLIPFGNIENRLQIRVRNQCKPFKWYLENVYPELRVPSKESV 455

Query: 218 ---------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA-- 266
                      CID+     +    +GLY CH  GGNQ + M+K  +IR  + C      
Sbjct: 456 AFGSIKQGVNKCIDTLGHVQE--GSIGLYECHDSGGNQEFSMNKEMQIRHQDLCFTAGEG 513

Query: 267 ---GGDVILYPCHGSKGNQYFE 285
              G  + L  C  +   Q FE
Sbjct: 514 AREGSIIKLRHCDENNTMQKFE 535


>gi|268569766|ref|XP_002648333.1| C. briggsae CBR-GLY-4 protein [Caenorhabditis briggsae]
          Length = 523

 Score =  140 bits (352), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 101/322 (31%), Positives = 150/322 (46%), Gaps = 68/322 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  ++WL+PLL  +A N   VV+P+I  I  D F        L        GGFDW L F
Sbjct: 171 ECNQKWLEPLLSRIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 223

Query: 65  NWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W  + E  RK RH +   P+ +PTMAGGLF+I K +FE+LGTYD   ++WGGENLE+SF
Sbjct: 224 RWEFMNEELRKDRHAHPTAPIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSF 283

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H+         T  GG  ++    F+K        +
Sbjct: 284 RVWQCGGSLEILPCSRVGHVFRKKHQY--------TFPGGSGNV----FQK--NTRRAAE 329

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV-------------- 212
           +W  E   +  K        ++GD+  R  +R  L CKSFKWYL+               
Sbjct: 330 VWMDEYKAIYLKNVPSARFVNYGDIGDRLAIRDRLQCKSFKWYLDTVYPQLASLTRNVSS 389

Query: 213 -SNDWS-------GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWM---MSKHGEIRRDEA 261
             + W         +C+DS  +  +  +   L+ CH  GGNQ W+   ++K  +    + 
Sbjct: 390 QKDAWQIAPMKIGHLCLDSMARKEN--EAPALFACHGTGGNQEWIFDDLTKTFKNAISQM 447

Query: 262 CLDYAG--GDVILYPCHGSKGN 281
           CLD++    DV++  C   + N
Sbjct: 448 CLDFSAEKKDVVMVKCENLRSN 469


>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
           saltator]
          Length = 598

 Score =  140 bits (352), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 104/314 (33%), Positives = 145/314 (46%), Gaps = 45/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +    + V+ P+I NI ++T E          +  F +GGF W+  
Sbjct: 240 CEVIKDWLQPLLQRIKEKRNAVLMPIIDNISEETLEYFHD----NEASFFQVGGFTWSGH 295

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I + E K   +   P  +PTMAGGLF+ID+ +F ++G+YD   D WGGENLE+SF
Sbjct: 296 FTWINIQKHELKSRLSLISPTRSPTMAGGLFAIDRKYFWEVGSYDDKMDGWGGENLEMSF 355

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           +          IP            P   P      G+ +   AF   +  Y   F +  
Sbjct: 356 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVW-MDEYKRLFLLHR 414

Query: 177 GENLELSFKGD---FGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
            E     FK     FGD++ R +LRR L CKSFKWYL+                  V   
Sbjct: 415 SE-----FKNKSSLFGDISERLKLRRKLKCKSFKWYLDNIYPEKFIPDEHAIAYGRVRLR 469

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEACLDYAGGD----- 269
              +C+D+  +  D    +GLY CH K   +QF+ +S  GE+RRD+ C      D     
Sbjct: 470 NRLLCLDNLQRDEDKPYNLGLYSCHSKLYPSQFFSLSNSGELRRDDNCARVNADDSRVHT 529

Query: 270 -VILYPCHGSKGNQ 282
            V +  C+  KG +
Sbjct: 530 QVEMSDCNNEKGGK 543


>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
           lupus familiaris]
          Length = 552

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 143/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  D F           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY-------IESAAELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  + AEP+ TP +AGGLF +DK++F  LG YD+  DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPAEPIRTPIIAGGLFVMDKSWFNYLGKYDTDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ SR +LR+NL C+SFKWYLE       + ND S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIESRLDLRKNLQCQSFKWYLENVYPELRIPNDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQKNTEIYDLRLSPCVKTKGKDAKSQIWAFTYTQQILQEELCLSVV 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TVFPGAPVVLVVCKNGDDKQ 500


>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
           ATCC 50818]
          Length = 641

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 106/326 (32%), Positives = 155/326 (47%), Gaps = 67/326 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+P+LD++A N + VV+P+I  I   T E      + TS+    +G FDW L 
Sbjct: 299 CEANQGWLEPILDIIATNRTTVVTPVIDTIDHRTMEY----AKWTSNIPS-VGTFDWTLD 353

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNW +   R  ++     +P+ +PTMAGGLF+ID+ +F ++G+YD   D WGGEN+E+SF
Sbjct: 354 FNWKSGVLRPGQK---LTDPIDSPTMAGGLFAIDRDYFYEIGSYDEDMDGWGGENVEMSF 410

Query: 124 KFNWHAIPE-------------RERKRHKNAAEPVWTPTMAGGLFSID------KAFFEK 164
           +  W                  R+   +K   + +    M   +   +      K FF  
Sbjct: 411 RI-WQCGGRLVTAPCSHVGHIFRDTHPYKVPGKGIHHTFMKNSMRLAEVWMDDYKQFF-- 467

Query: 165 LGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------- 216
              YD+       EN+      D GD+T RK LR  L CK FKWYL+ V  D        
Sbjct: 468 ---YDTKPK---RENI------DIGDLTKRKALRERLKCKPFKWYLKHVLPDLFVPDSEH 515

Query: 217 ----------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR-DEACLDY 265
                     +G+C+D            G++ CH +GGNQ WM + + EIR  D  CLD 
Sbjct: 516 VLHKGALRAGNGLCLDKMGHRAGGQ--AGVFSCHGEGGNQGWMYTVNDEIRTADSLCLDV 573

Query: 266 AGGD----VILYPCHGSKGNQYFEYD 287
                   + L  CH  +GNQ ++Y+
Sbjct: 574 YSSKFPAPIHLQRCHQKQGNQAWKYE 599


>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
          Length = 560

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 46/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  D +  R      +S  K + G F+W L 
Sbjct: 211 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRI---VYSSVDKHYRGIFEWGLL 267

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP +E  R K+++EP  +PT AGGLF+I K +FE+LG YD G  IWGGE  ELSF
Sbjct: 268 YKETEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSF 327

Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
           K  W        IP         +  P         P ++  +  + K + ++   Y   
Sbjct: 328 KI-WQCGGGILFIPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDEYEKYYYI 386

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
            +      L        GD++S+ +LR  L CKSF+WY+                     
Sbjct: 387 REPSAKHRLP-------GDISSQLKLRERLKCKSFEWYMEKVAYDVIVSYPLPPENHVWG 439

Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV 270
           E  N  +G CID+  +   +   VG  PCH  GGNQ   ++K G++ + E C+   GG++
Sbjct: 440 EAKNHATGKCIDTIGQ--TIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCITPVGGNL 497

Query: 271 ILYPCHGSKGNQYFEYDYK 289
           +   C     +  F YD K
Sbjct: 498 VTKYCVKGTVDGPFAYDEK 516


>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
          Length = 562

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 46/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  D +  R      +S  K + G F+W L 
Sbjct: 213 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRI---VYSSVDKHYRGIFEWGLL 269

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP +E  R K+++EP  +PT AGGLF+I K +FE+LG YD G  IWGGE  ELSF
Sbjct: 270 YKETEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSF 329

Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
           K  W        IP         +  P         P ++  +  + K + ++   Y   
Sbjct: 330 KI-WQCGGGILFIPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDEYEKYYYI 388

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
            +      L        GD++S+ +LR  L CKSF+WY+                     
Sbjct: 389 REPSAKHRLP-------GDISSQLKLRERLKCKSFEWYMEKVAYDVIVSYPLPPENHVWG 441

Query: 211 EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDV 270
           E  N  +G CID+  +   +   VG  PCH  GGNQ   ++K G++ + E C+   GG++
Sbjct: 442 EAKNHATGKCIDTIGQ--TIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCITPVGGNL 499

Query: 271 ILYPCHGSKGNQYFEYDYK 289
           +   C     +  F YD K
Sbjct: 500 VTKYCVKGTVDGPFAYDEK 518


>gi|72000997|ref|NP_001024216.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
 gi|51316004|sp|Q8I136.2|GALT4_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           Short=pp-GaNTase 4; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 4; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|3047189|gb|AAC13670.1| GLY4 [Caenorhabditis elegans]
 gi|11064525|emb|CAC14394.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
          Length = 589

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 62/301 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E  ++WL+PLL  +A N   VV+P+I  I  D F        L        GGFDW L F
Sbjct: 243 ECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLVF 295

Query: 65  NWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
            W  + E+ RK RH +   P+ +PTMAGGLF+I K +F +LGTYD   ++WGGENLE+SF
Sbjct: 296 RWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMSF 355

Query: 124 KFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +  W      E           RK+H     P   P  +G +F  +              
Sbjct: 356 RV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RAA 400

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
           ++W  E   +  K        +FGD+T R  +R  L CKSFKWYLE              
Sbjct: 401 EVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLENVYPQLEIPRKTPG 460

Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
                    +C+DS  +  +   P GL+ CH  GGNQ W+  +  +  ++   + CLD++
Sbjct: 461 KSFQMKIGNLCLDSMAR-KESEAP-GLFGCHGTGGNQEWVFDQLTKTFKNAISQLCLDFS 518

Query: 267 G 267
            
Sbjct: 519 S 519


>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
 gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
          Length = 374

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 114/219 (52%), Gaps = 32/219 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLLD + +N SHVV+P+I  I   TFE R             + GFDW L 
Sbjct: 158 CECNIGWLEPLLDRIVQNRSHVVTPVIDVIDFKTFEYRHLA-------IIQVRGFDWRLI 210

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP    KR   + +P+ +PTMAGGLF+IDK +F  LG YD+G +IWGGENLELSF
Sbjct: 211 FRWEKIPASYEKRRGLSVDPILSPTMAGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSF 270

Query: 124 KF-----NWHAIP-------ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          +P        R+R  ++ + E          L  + + + ++   Y   
Sbjct: 271 RIWQCGGTLEIMPCSRVGHVFRQRFPYQTSTE-----VTTRNLMRVAEVWMDQYKEY--- 322

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL 210
              +   +++   K  FGDVT R+ELRR L C+ F WYL
Sbjct: 323 --FYQIRHIK---KKSFGDVTERQELRRRLQCRDFHWYL 356


>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Sus scrofa]
          Length = 552

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 144/320 (45%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF+          S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDY-------IESATELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF +DK++F+ LG YD+  DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFDYLGKYDTDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGE-------NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E       +   + +  FG++ SR +LRRNL C+SFKWYLE       +  D S  
Sbjct: 361 VWMDEYKQYYYASRPFALERPFGNIESRLDLRRNLQCQSFKWYLENVYPELRIPKDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQKQKDQEISNLRLSPCVKIEGKDAKSQIWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|390347275|ref|XP_003726736.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Strongylocentrotus purpuratus]
          Length = 507

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/315 (32%), Positives = 146/315 (46%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD + RNS+ VVSP I +I D  F   F  G   +      GGF W  +
Sbjct: 145 CECTEGWLEPLLDCINRNSTRVVSPAIDSISDTDFSYTFIRGIART------GGFSWFPE 198

Query: 64  FNWHAIPERERKRH-KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W   P+RE KR  ++AA P+ TPT+AGGLF+ID+ FF+ LG YD    +WG ENLELS
Sbjct: 199 FMWTHAPQREMKRVWQDAATPLRTPTIAGGLFAIDRKFFKSLGYYDPELHVWGSENLELS 258

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           FK      +   +P   R  H   ++P +     G   ++       L       ++WGG
Sbjct: 259 FKVWQCGGSLEVVP-CSRVGHVFRSKPPY--DFPGNPETV------LLRNNKRVLEVWGG 309

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VS 213
           +   L +         D GD++SR  +R  L CK+F+WYLE                   
Sbjct: 310 QIKHLFYGLTPEYQAVDAGDISSRIRIRDELKCKNFEWYLENVYPENILPLNFQALGRFM 369

Query: 214 NDWSGMCID--SACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD-- 269
           N+   +CID   A     M   + +  C +    Q +  +   ++R D  C+    GD  
Sbjct: 370 NEGVNLCIDVLHATDGRRMGAHLAVNACREGALAQTFSWNDLSQLRHDRFCITAVEGDNH 429

Query: 270 VILYPCHGSKGNQYF 284
           V+L  C     N+  
Sbjct: 430 VMLLECQDVHYNRLL 444


>gi|224047294|ref|XP_002195048.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Taeniopygia guttata]
          Length = 552

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/321 (32%), Positives = 143/321 (44%), Gaps = 64/321 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 202 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  +  EP+ TP +AGGLF IDKA+F  LG YDS  DIWGGEN E+SF
Sbjct: 255 FKWEQLSPEQKAKRLDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDSAMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------VSNDW 216
           +W  E  +  +          FG++ SR ELR+ L C SFKWYLE           S   
Sbjct: 361 VWMDEFKQYYYAARPAAQGRPFGNIQSRVELRKKLKCHSFKWYLENVYPELRIPKESLYQ 420

Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGG----NQFWMMSKHGEIRRDEACLD- 264
           +G+      C++S  K  D   P+  L PC+   G     Q W  + +  IR+ + CL  
Sbjct: 421 TGIIRQRQSCLESH-KSEDQEFPILSLTPCNSSKGIVPKAQEWTYTYNHHIRQQQLCLSV 479

Query: 265 ---YAGGDVILYPCHGSKGNQ 282
              + G  V+L PC      Q
Sbjct: 480 YTLFPGSQVLLSPCKEGDNKQ 500


>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
 gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
          Length = 552

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGHANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQSNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Gorilla gorilla gorilla]
          Length = 552

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Anolis carolinensis]
          Length = 608

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 104/329 (31%), Positives = 144/329 (43%), Gaps = 66/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNELWLQPLLTPIRESRKTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   D
Sbjct: 360 RIWMCGGKLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKD 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  + ++G++T R ELR+ L CKSFKWYL+                      
Sbjct: 413 QYFALRPELRMR-NYGNITDRVELRKKLNCKSFKWYLDNIYPEMQISGSNAKVQPPLFFN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
                        + +  S  C+ +   P+     V +  C     NQ W+ ++ H  I 
Sbjct: 472 KGQKRPKTLQRGRLRHLQSDKCLVAQGHPSQKGGLVVVRECDYSDQNQVWLYNEDHELIL 531

Query: 258 RDEACLDYAGGDVI----LYPCHGSKGNQ 282
            +  CLD +         L  CHGS G+Q
Sbjct: 532 NNLLCLDVSETRTSDPPRLMKCHGSGGSQ 560


>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
 gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
          Length = 563

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 97/319 (30%), Positives = 140/319 (43%), Gaps = 62/319 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 222 CEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +    + TP +AGG+F IDK++F +LG YD+  DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P         D      +       +
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYEFP---------DGNALTYIKNTKRTVE 380

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VS 213
           +W  E  +  ++         +G V  R ELR+ L CKSF+WYL+             +S
Sbjct: 381 VWMDEYKQYYYQARPSAIGKSYGSVADRAELRKKLSCKSFQWYLQNVYPELKVPEKEVIS 440

Query: 214 N--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA- 266
                 G C++S  + T  + P+ L  C     +    Q W +S +  IR+ + CL  + 
Sbjct: 441 GLIKQGGNCLESQTRDTTGNNPIMLTQCKGSANSAPAAQEWALSDNV-IRQQDRCLTISS 499

Query: 267 ---GGDVILYPCHGSKGNQ 282
              G  V++ PC+     Q
Sbjct: 500 FSTGALVMMEPCNQKDSRQ 518


>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Gorilla gorilla gorilla]
          Length = 532

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 460

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480


>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
           floridanus]
          Length = 597

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/288 (34%), Positives = 140/288 (48%), Gaps = 39/288 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  N + V+ P+I NI ++T E          +  F +GGF W+  
Sbjct: 240 CEVIKDWLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHD----NEASFFQVGGFTWSGH 295

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I + E +   +   P  +PTMAGGLF+I++ +F ++G+YD   D WGGENLE+SF
Sbjct: 296 FTWINIQKHEVESRPSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSF 355

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           +          IP            P   P      G+ +   AF    G Y   F +  
Sbjct: 356 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVWMDG-YKRLFLLHR 414

Query: 177 GENLELSFKGD---FGDVTSRKELRRNLGCKSFKWYLE------------------VSND 215
            E     FK +   FGDV+ R ELR+ L CKSFKWYL+                  V   
Sbjct: 415 SE-----FKDNPKLFGDVSERLELRKRLKCKSFKWYLDNIYPEKFIPDEDAVAYGRVRLR 469

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEAC 262
              +C+D+  +  D    +GLY CH K   +QF+ +S  GE+R+D++C
Sbjct: 470 NKPLCLDNLQQEEDKPYNLGLYTCHSKLYPSQFFSLSNAGELRKDDSC 517


>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           3 [Gorilla gorilla gorilla]
          Length = 517

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVI 445

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465


>gi|426335183|ref|XP_004029112.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           4 [Gorilla gorilla gorilla]
          Length = 557

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQEELCLSVITLFPG 490

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 491 APVVLVLCKNGDDRQ 505


>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
           adamanteus]
          Length = 608

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 103/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLTPIQESRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++D+ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLLEMEGPEQATAPIKSPTMAGGLFAMDREYFNALGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   +
Sbjct: 360 RIWMCGGKLVIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  + ++G++T R ELR+ L CKSFKWYL+                      
Sbjct: 413 QYFALRPELRTR-NYGNITDRVELRKKLNCKSFKWYLDNVYPEMQISGPNAKVQPPIFFN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
                        + +  +  C+ +   P+     V +  C     NQ WM ++ H  I 
Sbjct: 472 KGQKRPKLLQQGRLYHLQTNKCLVAQSNPSQKGGLVVVKECDYSNKNQIWMYNEDHELIL 531

Query: 258 RDEACLDYAGGDVI----LYPCHGSKGNQ 282
            +  CLD +         L  CHGS G+Q
Sbjct: 532 NNLLCLDVSETRTSDPPRLMKCHGSGGSQ 560


>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
 gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
          Length = 398

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 92/272 (33%), Positives = 130/272 (47%), Gaps = 47/272 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N S VV+P I  I   TF      G          G F+W L 
Sbjct: 145 CEANLGWLEPLLARIGENRSIVVTPDIEVIDLRTFGYTHEHGANNR------GIFNWELT 198

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IPE ER+R K+ ++P+ +PTMAGGLF+IDK++F ++G+YD+    WGGEN+E+SF
Sbjct: 199 FKWRGIPEYERRRRKSDSDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFWGGENVEISF 258

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE- 178
           +  W              +  +   +  G +F   + +    G  D       ++W  + 
Sbjct: 259 RI-WMC----------GGSLEIIPCSKVGHVFRESQPYKIGEGAIDRNNMRLAEVWMDDY 307

Query: 179 -----NLELSFKG-DFGDVTSRKELRRNLGCKSFKWYL------------------EVSN 214
                 +    KG D+GDV+ RK LR  L CKSFKWYL                  E+ N
Sbjct: 308 KKIFYAMRPQLKGKDYGDVSGRKALRERLMCKSFKWYLDNVISELAIPDLYPIGRGEIRN 367

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ 246
             +  C+D+  K     +P G+Y CH  G NQ
Sbjct: 368 LGTNTCLDTLAKNEAGGEP-GMYMCHGMGNNQ 398


>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
           sapiens]
 gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
           AltName: Full=Polypeptide GalNAc transferase 14;
           Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 14;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 14
 gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
           sapiens]
 gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
 gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Homo sapiens]
 gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
 gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_a [Homo sapiens]
 gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
           [synthetic construct]
 gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
           sapiens]
          Length = 552

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 608

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 109/348 (31%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNTMWLQPLLATIQEDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  E      A  P+ +PTMAGGLF++++ +F++LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLIPPSELGGPGGATAPIKSPTMAGGLFAMNRDYFDELGRYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGRDTMA 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELRR LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +G C+ +  +P+     V L  C    
Sbjct: 457 ISGPNAKPQQPLFINRGPKRPKVLQRGRLYHLQTGKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQVWIYNEEHELVLSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|170038569|ref|XP_001847121.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
 gi|167882320|gb|EDS45703.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
          Length = 541

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 102/308 (33%), Positives = 145/308 (47%), Gaps = 39/308 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WLQP LD +AR+ + +  P I  I  D   L F   +    Y    G  DW  Q
Sbjct: 202 CECTLGWLQPQLDRVARDPTTIAVPTIDWI--DEHNLAFVSNKSLGYY----GATDWGFQ 255

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W    +R + +  N  EP  TP MAGGLFSI++ FF  LG YD GF+I+GGEN+ELS 
Sbjct: 256 FAWRGRWDR-KVQPANKLEPFPTPIMAGGLFSINRTFFGHLGWYDEGFEIYGGENVELSL 314

Query: 124 KF-----NWHAIPERERKRHKNAAEPVW----TPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
           K          +P       + A  P      T  +      + + + ++       +D+
Sbjct: 315 KAWMCGGRIETVPCSRVGHVQKAGHPYLRVETTDWVRINTVRVAEVWLDQYA--QVVYDM 372

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDSA 224
           +GG      F+G+FGDV+SRK+LR +L C SF+WYL          E      G  I+ A
Sbjct: 373 FGGPQ----FRGNFGDVSSRKKLRESLKCHSFRWYLDNVFPELDDPEGRGVGHGEVINLA 428

Query: 225 CKPTD-MHKPV-----GLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGS 278
              T  +  P      GL  C+     Q W+ +  GE+  +  C+D+ G  + +Y CH  
Sbjct: 429 AGATRCLQYPTAEGTFGLERCNGD-SRQHWVYNMLGELSTNNTCVDFTGTALAMYKCHKM 487

Query: 279 KGNQYFEY 286
           +GNQ + Y
Sbjct: 488 RGNQEWRY 495


>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Nomascus leucogenys]
          Length = 552

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|449268007|gb|EMC78887.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Columba
           livia]
          Length = 514

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 142/322 (44%), Gaps = 65/322 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 163 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 215

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  +  EP+ TP +AGGLF IDKA+F  LG YDS  DIWGGEN E+SF
Sbjct: 216 FKWEQLSPEQKAKRLDPTEPIKTPIIAGGLFMIDKAWFNHLGKYDSAMDIWGGENFEISF 275

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P      +         +       +
Sbjct: 276 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 321

Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL----------EVSNDW 216
           +W  E     +          +G+V SR ELR+ L C SFKWYL          E S   
Sbjct: 322 VWMDEFKRYYYAARPAAQGRPYGNVQSRVELRKRLKCHSFKWYLENVYPELRIPEESLYQ 381

Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGGN-----QFWMMSKHGEIRRDEACLD 264
           +GM      C++S  K  D   PV  L PC    G      Q W  + + ++R+ + CL 
Sbjct: 382 TGMIRQRQSCLESH-KSEDQEFPVLSLNPCTGSKGTTAATAQEWTYTYNHQVRQQQLCLS 440

Query: 265 ----YAGGDVILYPCHGSKGNQ 282
               + G  V+L PC      Q
Sbjct: 441 VYTLFPGSQVLLSPCKEGDNKQ 462


>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
           mulatta]
          Length = 517

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 445

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465


>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           3 [Pan paniscus]
          Length = 517

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 385

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 445

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465


>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           5 [Macaca mulatta]
          Length = 552

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Pan paniscus]
          Length = 532

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480


>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Nomascus leucogenys]
          Length = 532

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480


>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Pan paniscus]
          Length = 552

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDDRQ 500


>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Macaca mulatta]
          Length = 532

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 400

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480


>gi|62630154|gb|AAX88899.1| unknown [Homo sapiens]
          Length = 452

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 102 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 154

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 155 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 214

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 215 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 260

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 261 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 320

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 321 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 380

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 381 TLFPGAPVVLVLCKNGDDRQ 400


>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
           sapiens]
 gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_d [Homo sapiens]
 gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
          Length = 532

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 182 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 234

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 235 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 294

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 295 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 340

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 341 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 400

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 401 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 460

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 461 TLFPGAPVVLVLCKNGDDRQ 480


>gi|355565588|gb|EHH22017.1| hypothetical protein EGK_05198 [Macaca mulatta]
          Length = 557

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 431 QRQKCLESQRQNNQETANLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 491 APVVLVLCKNGDDRQ 505


>gi|109102570|ref|XP_001104659.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Macaca mulatta]
          Length = 557

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 491 APVVLVLCKNGDDRQ 505


>gi|359465583|ref|NP_001240755.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 2 [Homo
           sapiens]
 gi|10434341|dbj|BAB14227.1| unnamed protein product [Homo sapiens]
 gi|119620892|gb|EAX00487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_b [Homo sapiens]
          Length = 557

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIR 430

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 491 APVVLVLCKNGDDRQ 505


>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
           tropicalis]
 gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
          Length = 452

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 86/219 (39%), Positives = 113/219 (51%), Gaps = 25/219 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R + +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRRGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
               +       K D+GD+++R  LR  L CK F WYLE
Sbjct: 378 KNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLE 416


>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Nomascus leucogenys]
          Length = 535

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 185 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 237

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 238 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 297

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 298 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 343

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 344 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 403

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 404 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 463

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 464 TLFPGAPVVLVLCKNGDDRQ 483


>gi|119620893|gb|EAX00488.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_c [Homo sapiens]
          Length = 519

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 169 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 221

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 222 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 281

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 327

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 328 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 387

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 388 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 447

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 448 TLFPGAPVVLVLCKNGDDRQ 467


>gi|397513819|ref|XP_003827205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           4 [Pan paniscus]
          Length = 557

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 207 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 260 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 319

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 320 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 370

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 371 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 430

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 431 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 490

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 491 APVVLVLCKNGDDRQ 505


>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Callithrix jacchus]
          Length = 552

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLALCKNGDDRQ 500


>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Saimiri boliviensis boliviensis]
          Length = 552

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 143/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FHWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLALCKNGDDRQ 500


>gi|355751232|gb|EHH55487.1| hypothetical protein EGM_04701, partial [Macaca fascicularis]
          Length = 516

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 138/315 (43%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 166 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 219 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 278

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 279 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 329

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S       
Sbjct: 330 YKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR 389

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +       + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 390 QRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPG 449

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 450 APVVLVLCKNGDDRQ 464


>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
          Length = 517

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 167 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 219

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 220 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 279

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 280 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 325

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 326 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQ 385

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 386 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQKILQEELCLSVI 445

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 446 TLFPGAPVVLVLCKNGDDRQ 465


>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
          Length = 562

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 141/319 (44%), Gaps = 62/319 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 221 CEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 273

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +    + TP +AGG+F IDK++F +LG YD+  DIWGGEN ELSF
Sbjct: 274 FKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSF 333

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P         D      +       +
Sbjct: 334 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYEFP---------DGNALTYIKNTKRTVE 379

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VS 213
           +W  E  +  ++         +G V  R ELR+ L CKSF+WYL+             +S
Sbjct: 380 VWMDEYKQYYYQARPSAIGKSYGSVADRVELRKKLSCKSFQWYLQNVYPELKIPEKEVIS 439

Query: 214 N--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA- 266
                 G C++S  + T  + PV L  C     +    Q W +S++  I++ + CL  + 
Sbjct: 440 GLIKQGGNCMESQTRDTTGNIPVMLTQCKGSANSAPAAQEWALSENV-IKQQDRCLTISS 498

Query: 267 ---GGDVILYPCHGSKGNQ 282
              G  V+L PC+     Q
Sbjct: 499 FSTGALVMLEPCNQKDSRQ 517


>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
           aries]
          Length = 552

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 145/315 (46%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF +DK++F  LG YD+  DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P   G   +  K        +   + 
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPD--GNANTYIKNTKRTAEVWMDEYK 367

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
            +   +   + +  FG++ SR  LR+NL C+SFKWYLE       V  D S         
Sbjct: 368 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 427

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
             C+++  +       + L PC K  G    +Q W  +   +I ++E CL     + G  
Sbjct: 428 QKCLEAQKQKDQEISSLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 487

Query: 270 VILYPC-HGSKGNQY 283
           V+L  C +G K  Q+
Sbjct: 488 VVLVLCKNGDKRQQW 502


>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Oryctolagus cuniculus]
          Length = 551

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 102/313 (32%), Positives = 138/313 (44%), Gaps = 48/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 201 CEVNKDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 253

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD+  DIWGGEN E+SF
Sbjct: 254 FHWEQLSPEQKARRLDPTEPIRTPVIAGGLFVIDKAWFDYLGKYDTDMDIWGGENFEISF 313

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H  A     T T         + + +    Y     
Sbjct: 314 RVWMCRGSLEIIPCSRVGHVFRKKHPYAFPNGNTNTYIKNTKRTAEVWMDDYKQYYYAAR 373

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWSGM------- 219
            +  E         FG++ SR  LR NL C+ FKWYLE       +  D S +       
Sbjct: 374 PFALER-------PFGNIRSRVMLRANLQCQDFKWYLENVYPELRIPKDSSILKGSIRQR 426

Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD----YAGGD 269
             C+ S  +       + L PC K  G     Q W  +   +I ++E CL     + G  
Sbjct: 427 HKCLASQKQNNQGSPNLKLRPCVKFKGEESKAQVWAFTYTQQIIQEELCLSVVTLFPGAP 486

Query: 270 VILYPCHGSKGNQ 282
           VIL  C      Q
Sbjct: 487 VILAVCKNGDEKQ 499


>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
           saltator]
          Length = 606

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 93/302 (30%), Positives = 144/302 (47%), Gaps = 38/302 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + W++PLL  +A + + V  P+I  I  DTF+    P           GGF+W L 
Sbjct: 234 IEVNEVWIEPLLSRIAHSKTIVAMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P    K+  +  +P+ +PTMAGGLF+ID+ +F K+G YD+G D+WGGENLE+SF
Sbjct: 286 FKWDNLPIGTLKQEDDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISF 345

Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +      N   IP         R+R   + +P    TM      +   + ++   Y    
Sbjct: 346 RIWMCGGNIELIPCSRVGHVFRRRRPYGSDDP--QDTMLKNSLRVAHVWLDEYKDY---- 399

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
                  L    K DFGD++ R+ LR+ L CK+F WYL+V      +  D+  +  D   
Sbjct: 400 ------FLRNVRKIDFGDISERQALRQRLKCKTFGWYLKVVYPELTLPDDTERRLKDKWS 453

Query: 231 ---HKPVGLYPCHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
               +PV  +   K+   +Q+ + +S      + E  +   G  +IL PC   K   ++E
Sbjct: 454 KLDQRPVQPWHSRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSRLILMPCLRIKSQMWYE 513

Query: 286 YD 287
            D
Sbjct: 514 TD 515


>gi|402890489|ref|XP_003908519.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Papio anubis]
          Length = 551

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 99/320 (30%), Positives = 142/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 201 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 253

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 254 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 313

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 314 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 359

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG+V SR +LR+NL C+SFKWYLE       +  + S  
Sbjct: 360 VWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQ 419

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 420 KGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVI 479

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 480 TLFPGAPVVLVLCKNGDDRQ 499


>gi|363731300|ref|XP_419370.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Gallus
           gallus]
          Length = 552

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 100/321 (31%), Positives = 145/321 (45%), Gaps = 64/321 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 202 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  +  +P+ TP +AGGLF IDKA+F  LG YD+  DIWGGEN E+SF
Sbjct: 255 FKWEQLSPEQKAKRLDPTKPIKTPIIAGGLFVIDKAWFNHLGKYDNAMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-------VSNDW--- 216
           +W  E  +  +          +G++ SR ELR+ L C SFKWYLE       +  +    
Sbjct: 361 VWMDEFKQYYYAARPAAQGRPYGNIQSRVELRKRLKCHSFKWYLENVYPELRIPEELLYQ 420

Query: 217 SGM------CIDSACKPTDMHKPV-GLYPCHKQGGN----QFWMMSKHGEIRRDEACLD- 264
           +GM      C++S  K  D   P+  L PC    G     Q W  + + ++R+ + CL  
Sbjct: 421 TGMIRQRQSCLESH-KSEDQELPILSLNPCITSKGTSATAQEWTYTYNHQVRQQQLCLSV 479

Query: 265 ---YAGGDVILYPCHGSKGNQ 282
              + G  V+L PC  S   Q
Sbjct: 480 YTLFPGSPVLLSPCKESDNKQ 500


>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
           caballus]
          Length = 609

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 106/351 (30%), Positives = 148/351 (42%), Gaps = 96/351 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL V+  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 249 CEVNVMWLQPLLAVIQEDRRMVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 300

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++ + +F +LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMSRRYFSELGQYDSGMDIWGGENLEISF 360

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 361 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 397

Query: 177 GENLELSF------------------KGDFGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L++                     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 398 HNSLRLAYVWLDEYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 457

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 458 ISGPNAKPQQPIFINRGPKRPKVLQRGRLCHLQTNKCLVAQSRPSQKGSLVVLKACDYGD 517

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +  K
Sbjct: 518 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTFGKK 568


>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
           grunniens mutus]
          Length = 509

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 145/315 (46%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 159 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 211

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF +DK++F  LG YD+  DIWGGEN E+SF
Sbjct: 212 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISF 271

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G   +  K        +   + 
Sbjct: 272 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYIFPD--GNANTYIKNTKRTAEVWMDEYK 324

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
            +   +   + +  FG++ SR  LR+NL C+SFKWYLE       V  D S         
Sbjct: 325 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 384

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
             C+++  +       + L PC K  G    +Q W  +   +I ++E CL     + G  
Sbjct: 385 QKCLEAQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 444

Query: 270 VILYPC-HGSKGNQY 283
           V+L  C +G K  Q+
Sbjct: 445 VVLVLCKNGDKRQQW 459


>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
          Length = 579

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 93/316 (29%), Positives = 145/316 (45%), Gaps = 51/316 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P++  ++   + +V P+I +I D T             +   +GGF W L 
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDSTLAYH-------GDWSLSVGGFSWALH 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+ E KR     + + +PTMAGGL + ++ +F ++G YD   DIWGGENLE+SF
Sbjct: 283 FTWEGLPDEELKRRTKVTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           + NW        IP         A  P +  T       +     ++L       ++W  
Sbjct: 343 R-NWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMD 394

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
           +   L +         D GD+T+R ELR+ L CKSFKW+L+                  +
Sbjct: 395 DYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKSFKWFLDNIAKGKFIMDEDVLAYGAL 454

Query: 213 SNDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGD 269
               SG  MC D+  +   M + +G++ C  +G + Q   +SK G +RR+  C     G+
Sbjct: 455 HTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGYLRRENTCAAEENGN 514

Query: 270 VILYPCHGSKGNQYFE 285
           V +  C  SK  Q+ E
Sbjct: 515 VRMKAC--SKRAQFNE 528


>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
          Length = 527

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E  R + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483


>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
 gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
          Length = 575

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 46/315 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  +   +VSP+I  I  DTF+       L SS     GGF WNL 
Sbjct: 231 CECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDY------LGSSADLR-GGFGWNLN 283

Query: 64  FNWHAIPER-ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +P     +R      P+ +P +AGGLFS+ K +FE LG YD   D+WGGENLE+S
Sbjct: 284 FKWDFLPPHILAERQGKPTLPIKSPVIAGGLFSVAKKWFETLGKYDMQMDVWGGENLEIS 343

Query: 123 FKFNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           F+  W        IP        R RH     P   P   G +    K     +  +   
Sbjct: 344 FR-TWQCGGAMEIIPCSRVGHVFRNRH-----PYQFP--GGSMNVFQKNTRRAVEVWMDD 395

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------EVSNDWSGM 219
           +  +    +  +    +GD+  R ELRR L C+ FKWY+            E +  +  +
Sbjct: 396 YKRYYYAAVPYAKNTPYGDIEERVELRRKLRCRPFKWYVQNVYPELKLPSDESTKSFGEI 455

Query: 220 CIDSACKPTDMH---KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD----VIL 272
              + C  T  H   + +GL+ CH  GGNQ W ++K   ++ +  CL    G     V L
Sbjct: 456 KQGNQCVDTLGHMRGQTIGLFECHGAGGNQMWSLTKSSLLKHETMCLGVNDGKATEPVQL 515

Query: 273 YPCHGSKGNQYFEYD 287
             C  +   Q++EY+
Sbjct: 516 LDCDENNSMQHWEYE 530


>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
           glaber]
          Length = 553

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/320 (31%), Positives = 140/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 203 CEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 255

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 256 FRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 315

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 316 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 361

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ SR  LRRNL C+SFKWYLE       V  D S  
Sbjct: 362 VWMDEYKQYYYAARPFALERPFGNIESRLNLRRNLQCQSFKWYLENVYPELSVPQDSSIQ 421

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G     Q W  +   +I ++E CL   
Sbjct: 422 KGNIRQRQKCLESQKQNNQEIPNLRLSPCVKLKGEEAKAQGWAFTYTQQIIQEELCLSVV 481

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 482 TLFPGAPVVLVLCKNGDERQ 501


>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
          Length = 608

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E  R + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Bombus terrestris]
          Length = 643

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/291 (31%), Positives = 135/291 (46%), Gaps = 16/291 (5%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV KRW++PLL  +A++ + V  P+I  I  DTF+    P           GGF+W L 
Sbjct: 271 IEVNKRWIEPLLSQIAQSKTIVAMPIIDIINPDTFQYTGSP--------LVRGGFNWGLH 322

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P       ++  +P+ +PTMAGGLF++D+ +F KLG YD+G DIWGGENLE+SF
Sbjct: 323 FKWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISF 382

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E                 G     D      L       D +    L+  
Sbjct: 383 RI-WMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSLRVAHVWLDEYKDYFLKNV 441

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
            K D+GD++ R  LR+ L CK+F WYL V      +  D+  +  D       KP+  + 
Sbjct: 442 QKVDYGDISERLNLRKRLKCKNFAWYLNVVYPELALPDDNKNRLKDKWAKIEQKPIQPWH 501

Query: 239 CHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYD 287
             K+   +Q+ + +S      + E  +   G  +IL PC   K   ++E D
Sbjct: 502 SRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSKLILAPCLRIKSQMWYETD 552


>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
 gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
           [Homo sapiens]
 gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
           sapiens]
 gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_b [Homo sapiens]
 gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [synthetic construct]
 gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 608

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E  R + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 399

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 87/219 (39%), Positives = 112/219 (51%), Gaps = 25/219 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 191 CECTVGWLEPLLTRIKQDRRTVVCPIIDVISDDTFEC-------MAGSDMTYGGFNWKLN 243

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGG FSID+ +F+++GTYD+G DIWG ENLE+S
Sbjct: 244 FRWYLVPQREMDRRKGDRTLPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEIS 303

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 304 FRI-WQCGGTLEIVTCSHVGHVFQKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 356

Query: 180 LELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
               +       K D+GDV+SR  LR  L CK F WYLE
Sbjct: 357 KNFFYIISPGVTKVDYGDVSSRLGLRHKLQCKPFSWYLE 395


>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
          Length = 608

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E  R + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLVKCHGSGGSQQWTF 564


>gi|260793003|ref|XP_002591503.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
 gi|229276709|gb|EEN47514.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
          Length = 618

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/181 (38%), Positives = 105/181 (58%), Gaps = 24/181 (13%)

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
             F W  IP+ ER R K+  +PV +PTMAGGLF+IDK +FE +GTYD+G D+WGGENLE+
Sbjct: 385 LTFTWGLIPDYERSRRKSPVDPVRSPTMAGGLFAIDKWYFEHIGTYDAGMDVWGGENLEM 444

Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMCIDSA 224
           SF+  +GDV++R +L+  L CK FKW+++                  + N  S +C DS 
Sbjct: 445 SFRERYGDVSARLDLKDKLHCKPFKWFMQTIMPDMYVPEDRPGRSGALRNSASNLCFDSE 504

Query: 225 CKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD----EACLDYAGGDVILYPCHGSKG 280
                  +P  ++ CH  GGNQ++ ++   E R +    E C++  GG+ ++   H + G
Sbjct: 505 GAENAGKRPT-MWGCHGMGGNQYFELNSREEFRHNTGGKEMCVEAQGGEFVVL-MHCTSG 562

Query: 281 N 281
           N
Sbjct: 563 N 563


>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Metaseiulus occidentalis]
          Length = 658

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 137/294 (46%), Gaps = 52/294 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + R+   VV P+I  I   T +     G      +F IGGF+W  +
Sbjct: 300 CETTPGWLEPLLEPIRRDRRAVVCPVIDVIDYRTLQYVAAEGD-----RFQIGGFNWRGE 354

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH IP   R+   + AEP+ +PTMAGGLF+I++ +F + G+YD   D WGGENLE+SF
Sbjct: 355 FTWHNIPSAWRRNRVSVAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSF 414

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS-------GFDIWG 176
           +  W                 V  P    G    D   ++  G  D+         ++W 
Sbjct: 415 RI-WQC-----------GGHIVIAPCSHVGHIFRDYQPYKIPGGKDTNAINTKRAVEVWM 462

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
            E  +  ++          GD+++R+  R    CK FKWYL+                  
Sbjct: 463 DEFKKYIYQARPELKKIRIGDISARRAFRELNRCKPFKWYLDNVYPHKYLIEEDSQGFGI 522

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCH---KQGGNQFWMMSKHGEIRRDEAC 262
           V N  + MC+D+  K       +G++ CH   ++  NQ   +S+ GE+R+++ C
Sbjct: 523 VRNPLTNMCLDTYGKARGKTSDLGIFECHPIPEEATNQLLSLSRKGELRQEDLC 576


>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
           partial [Pongo abelii]
          Length = 587

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 149/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELRGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRSDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLCHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|10438776|dbj|BAB15338.1| unnamed protein product [Homo sapiens]
          Length = 379

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 147/333 (44%), Gaps = 66/333 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 19  CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 70

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E  R + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 71  FKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 130

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   + ++    +  F 
Sbjct: 131 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWLDEY--KEQYFS 187

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
           +      +L  K  +G+++ R ELR+ LGCKSFKWYL+                      
Sbjct: 188 L----RPDLKTK-SYGNISERVELRKKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVN 242

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR 258
                        + +  +  C+ +  +P+     V L  C     NQ W+ ++  E+  
Sbjct: 243 RGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSDPNQIWIYNEEHELVL 302

Query: 259 DE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
           +   CLD     +     L  CHGS G+Q + +
Sbjct: 303 NSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 335


>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Takifugu rubripes]
          Length = 580

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 143/328 (43%), Gaps = 65/328 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +  +   VV P+I  I  DT  L + P  +        GGF+W L 
Sbjct: 221 CEVNQMWLEPLLASIHEDRRTVVCPVIDIISADT--LSYSPSPIVR------GGFNWGLH 272

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E K  K   +P+ +PTMAGGLF+I++ +F ++G YD+G DIWGGENLE+SF
Sbjct: 273 FKWDPVPPSELKSPKGPVDPIRSPTMAGGLFAINRKYFNEMGQYDAGMDIWGGENLEISF 332

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   + ++   Y   + 
Sbjct: 333 RIWMCGGQLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWMDE---YKEQYL 388

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
               E  E     D+GD++ R  LR  L C+SF+WYL+                      
Sbjct: 389 SMRPELRE----RDYGDISDRVALRERLQCRSFRWYLDNVYPEMQTVSNGNKHPPLFINK 444

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
                       + N  +  C+ +  + +     V L PC  Q   Q W   + G+ +  
Sbjct: 445 DLKRPKVLQRGRLHNRATNRCLVAQGRASQKGGAVVLRPCDPQDPEQEWAYDEEGQLVLA 504

Query: 259 DEACLDYAGGDVI----LYPCHGSKGNQ 282
              CLD +         L  CHGS G+Q
Sbjct: 505 GLLCLDVSEVRTFDPPRLMKCHGSGGSQ 532


>gi|270008661|gb|EFA05109.1| hypothetical protein TcasGA2_TC015209 [Tribolium castaneum]
          Length = 565

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/324 (32%), Positives = 150/324 (46%), Gaps = 74/324 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W++PLL  + +  + V+ P+I  I  +T  L +     TS   + +GGF W+  
Sbjct: 218 CEATTDWMEPLLSRIEQEPTAVLVPIIDVIEANT--LAYSTNGDTS---YQVGGFSWSGH 272

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I + E  +HK    PV +PTMAGGLF+ID+ FF ++G+YD   D WGGENLE+SF
Sbjct: 273 FTWIDI-QNEEDKHK--LTPVKSPTMAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSF 329

Query: 124 K----------------------FNWHAIPERERKRHKNAAE--PVWTPTMAGGLFSIDK 159
           +                      F+ ++ P+ +     N A    VW        F    
Sbjct: 330 RIWQCGGRLETVPCSRVGHIFRDFHPYSFPDNKDTHGINTARLAHVWMDDYKRFFFMYQP 389

Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------- 211
           A                 EN  +      GD+T RK+LR+ L CKSFKWYLE        
Sbjct: 390 AL----------------ENNPV-----VGDLTHRKQLRQKLRCKSFKWYLENVYPEKFI 428

Query: 212 ----------VSNDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDE 260
                     V ND+ GMC+D      D   P+GLY CH     +Q++ ++  GE+R++ 
Sbjct: 429 PDENVYAHGQVQNDY-GMCLDDLQLGEDKIGPLGLYQCHPYLAMSQYFSLNFKGELRKEN 487

Query: 261 ACLDYAG-GDVILYPCHGSKGNQY 283
            C +  G  +V L  CHG K  Q+
Sbjct: 488 FCAETFGVREVQLTECHGHKREQF 511


>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Cavia porcellus]
          Length = 552

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 135/315 (42%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P      +         +       ++W  E
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 365

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG++ SR  LRRNL C SFKWYLE       V  D S       
Sbjct: 366 YKQYYYAARPFALERPFGNIESRLNLRRNLQCHSFKWYLENVYPELSVPQDSSIQKGNIR 425

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAG 267
               C++S          + L PC K  G    +Q W  +   +I ++E CL     + G
Sbjct: 426 QRQKCLESQKHNNQEIPNLRLSPCVKLKGEEAKSQGWAFTYTQQIIQEELCLSVITLFPG 485

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 486 APVVLVLCKNGDERQ 500


>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Nasonia vitripennis]
          Length = 587

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 140/288 (48%), Gaps = 39/288 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CEV K+WL+PLL  +    + VV+P+I NI ++TFE        +    FF +GGF W+ 
Sbjct: 228 CEVTKQWLEPLLQRIKEKKNAVVTPIIDNISEETFEYSH-----SDEPSFFQVGGFTWSG 282

Query: 63  QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
            F W  I E + K   +A  PV +PTMAGGLF+I++ +F  +G+YD   + WGGENLE+S
Sbjct: 283 HFTWINIQEADLKSKTSAISPVKSPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMS 342

Query: 123 FKFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           F+  W        IP            P   P M      I+ A    +   D    ++ 
Sbjct: 343 FRI-WQCGGVLETIPCSRVGHVFRNFLPYKFP-MDKDTHGINTARLANVWM-DDYKRLYY 399

Query: 177 GENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYLE------------------VSNDWS 217
               E   K +  GD+  R  LR  L CKSFKWYL+                  V     
Sbjct: 400 LHREEYKDKPELIGDIKERVNLREKLKCKSFKWYLDNVYPEKFIPDENVQAFGRVQVQKG 459

Query: 218 GMCIDSACKPTDMHKP--VGLYPCHKQG-GNQFWMMSKHGEIRRDEAC 262
            +C+D+     D  KP  +G+Y CH Q   +Q++ +SK GE+RR++ C
Sbjct: 460 NLCLDNL--QNDEEKPYNLGVYECHSQLFPSQYFSLSKVGELRREDTC 505


>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
           davidii]
          Length = 507

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 141/320 (44%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 157 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFSY-------IESATELRGGFDWSLH 209

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  + +EP+ TP +AGGLF +DK++F  LG YD   DIWGGEN E+SF
Sbjct: 210 FQWEQLSPEQKAQRLDPSEPIRTPIIAGGLFVMDKSWFNFLGKYDMDMDIWGGENFEMSF 269

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 270 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 315

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FGD+ SR +LRR L C+SFKWYLE       V  D S  
Sbjct: 316 VWMDEYKQYFYAARPFALERPFGDIESRLDLRRKLRCQSFKWYLENVYPELRVPKDSSIQ 375

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 376 KGPIRQRQKCLESQRQKNQEVSNLKLRPCVKIKGEDAKSQIWAFTYTQQIIQEELCLSVI 435

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 436 TFFPGAPVVLVLCKNGDDKQ 455


>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
 gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
           taurus]
          Length = 552

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 144/315 (45%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF +DK++F  LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDMDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G   +  K        +   + 
Sbjct: 315 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYIFPD--GNANTYIKNTKRTAEVWMDEYK 367

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS--------- 217
            +   +   + +  FG++ SR  LR+NL C+SFKWYLE       V  D S         
Sbjct: 368 QYYYASRPFALERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSIHKGSIRQR 427

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
             C+++  +       + L PC K  G    +Q W  +   +I ++E CL     + G  
Sbjct: 428 QKCLEAQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQEELCLSVITLFPGAP 487

Query: 270 VILYPC-HGSKGNQY 283
           V+L  C +G K  Q+
Sbjct: 488 VVLVLCKNGDKRQQW 502


>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Loxodonta africana]
          Length = 552

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 140/313 (44%), Gaps = 48/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YDS  DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDSEMDIWGGENFEMSF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H        T T         + + ++   Y     
Sbjct: 315 RVWMCGGSLEIIPCSRVGHVFRKKHPYIFPDGNTNTYIKNTKRTAEVWMDEYKQYYYAAR 374

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM--------- 219
            +  E         FG++ +R  LR+NL C+SF+WYL     E+S     +         
Sbjct: 375 PFALER-------PFGNIENRLSLRKNLQCESFQWYLKNVYPELSIPKDSLIQKGNIRQR 427

Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD----YAGGD 269
             C+++  +       + L PC K  G    +Q W  +   +I ++E CL     + G  
Sbjct: 428 QKCLETQKRKNQEIPNLKLSPCIKIKGEEAKSQVWAFTYTQQILQEELCLSVITFFPGAP 487

Query: 270 VILYPCHGSKGNQ 282
           V+L  C      Q
Sbjct: 488 VVLVLCKNGDDRQ 500


>gi|349605004|gb|AEQ00388.1| Polypeptide N-acetylgalactosaminyltransferase 3-like protein,
           partial [Equus caballus]
          Length = 337

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 94/282 (33%), Positives = 136/282 (48%), Gaps = 38/282 (13%)

Query: 25  VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPV 84
           VVSP IA+I  +TFE   P    ++  +   G FDW+L F W ++P+ ER+R K+   P+
Sbjct: 4   VVSPDIASIDMNTFEFNKPSPYRSNHNR---GNFDWSLSFGWESLPDHERQRRKDETYPI 60

Query: 85  WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERK-------- 136
            TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF+  W    + E          
Sbjct: 61  KTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRV-WQCGGQLEIMPCSVVGHV 119

Query: 137 -RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSR 194
            R K+    P  T  +A     +    +  +  Y   F     +  ++  +  FGD++ R
Sbjct: 120 FRSKSPHSFPKGTQVIARNQVRLAGEVW--MDEYKEIFYRRNTDAAKIVKQKSFGDLSKR 177

Query: 195 KELRRNLGCKSFKWYLE------------------VSNDWSGMCIDSACKPTDMHKPVGL 236
             ++  L CK+F WYL                   + +    +C+D   +     KP+ L
Sbjct: 178 FAIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSFGQSLCLDVG-ENNQGGKPLIL 236

Query: 237 YPCHKQGGNQFWMMSKHGEIRRD---EACLDYAGGDVILYPC 275
           Y CH  GGNQ++  S   EIR +   E CL  A G V L  C
Sbjct: 237 YTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQLKAC 278



 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/103 (43%), Positives = 63/103 (61%), Gaps = 5/103 (4%)

Query: 85  WTPTMAGGLFSIDKAFFE--KLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAA 142
           +T  ++  + SID   FE  K   Y S  +     N + S  F W ++P+ ER+R K+  
Sbjct: 1   YTAVVSPDIASIDMNTFEFNKPSPYRSNHN---RGNFDWSLSFGWESLPDHERQRRKDET 57

Query: 143 EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
            P+ TPT AGGLFSI K +FE +GTYD   +IWGGEN+E+SF+
Sbjct: 58  YPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFR 100


>gi|444727591|gb|ELW68073.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Tupaia chinensis]
          Length = 554

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 73/178 (41%), Positives = 95/178 (53%), Gaps = 40/178 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 115 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 167

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENL   
Sbjct: 168 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL--- 224

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
                                        GGLF +DK++FE+LG YD   D+WGGENL
Sbjct: 225 -----------------------------GGLFVMDKSYFEELGKYDMMMDVWGGENL 253



 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 86/182 (47%), Gaps = 20/182 (10%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLT---SSYKFFIGGFDWN 61
           E +   L+ ++ VL ++  H++  +I  + DD    R    R+    ++    +   D +
Sbjct: 57  EARSALLRTVVSVLKKSPPHLIKEII--LVDDYSNDRLMRSRVRGADAAQAKVLTFLDSH 114

Query: 62  LQFNWH---AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 118
            + N H    + ER  +       P+           ID    +      +  D+ GG +
Sbjct: 115 CECNEHWLEPLLERVAEDRTRVVSPI-----------IDVINMDNFQYVGASADLKGGFD 163

Query: 119 LELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
             L FK+++   PE+ R R  N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGE
Sbjct: 164 WNLVFKWDYMT-PEQRRSRQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGE 222

Query: 179 NL 180
           NL
Sbjct: 223 NL 224


>gi|357606408|gb|EHJ65055.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Danaus plexippus]
          Length = 389

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 58/231 (25%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYK-FFIGGFDWNL 62
           CE  + WL+PLL+ L  N   V SP+I +I  +TFE       ++ + K  +IGGF+WNL
Sbjct: 149 CECTEGWLEPLLERLVENPKIVASPVIDHIDPNTFEY------ISQNPKDIYIGGFNWNL 202

Query: 63  QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           +F W +I   E KR +N   P+ TPT+AGGLF+IDK FF  +G YD GFD+WGGENLELS
Sbjct: 203 KFIWRSI---EYKR-ENFLLPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELS 258

Query: 123 FK-----------------------FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDK 159
           FK                       F ++   E  ++     AE VW    A       K
Sbjct: 259 FKVWMCGGSLEIVPCSHVGHIFRENFPYYTSGETFKRNAARLAE-VWLDDYA-------K 310

Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL 210
            F+E++G  D                   GDVT++KELR+ L CKSF WYL
Sbjct: 311 IFYERIGNADVS----------------LGDVTAQKELRKKLKCKSFNWYL 345


>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
          Length = 575

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 105/305 (34%), Positives = 146/305 (47%), Gaps = 49/305 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+ LL V+ ++ + VV P+I  I DDTF           S++   G F+WNLQ
Sbjct: 219 CECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAY-------VKSFELHWGAFNWNLQ 271

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  RE K  KN A +P  TPTMAGGLF+ID+ +F ++G YD G +IWGGENLE+S
Sbjct: 272 FRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMS 331

Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG-FDIW 175
           F+  W       I    R  H    + P   P   GG   I+K  F  L        D W
Sbjct: 332 FRI-WQCGGKVQIAPCSRVGHLFRKSSPYSFP---GG---INKTLFSNLARVARVWMDDW 384

Query: 176 GGENLELSFKGDF----GDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
                + +   D      +VTSR ELRR   CK F+WYL+                 + N
Sbjct: 385 ARFYFKFNEPADRIKNEQNVTSRIELRRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRN 444

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEA-CLDYAGGD 269
               MC+    K    ++P+G+       G+    + ++M+K G I  D++ CLD A   
Sbjct: 445 LGQNMCLIKPQKKVVSNQPMGIAKIDMCLGDEVILEMFVMTKEGFIMTDDSICLD-APEK 503

Query: 270 VILYP 274
           V++ P
Sbjct: 504 VVIGP 508


>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
          Length = 608

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 146/333 (43%), Gaps = 66/333 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLTPIREDRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  + ++G++T R ELR+ L CKSFKWYL+                      
Sbjct: 413 QYFALRPELRMR-NYGNITDRVELRKRLNCKSFKWYLDNIYPEMQISGPNAKAPQPVFIN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
                        + +  +  C+ +   P+     V +  C     NQ W+ ++ H  I 
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDPNQVWIYNEDHELIL 531

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +  CLD     +     L  CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
          Length = 607

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 104/302 (34%), Positives = 143/302 (47%), Gaps = 48/302 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+ LL V+ ++ + VV P+I  I DDTF           S++   G F+WNLQ
Sbjct: 251 CECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAY-------VKSFELHWGAFNWNLQ 303

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  RE K  KN A +P  TPTMAGGLF+ID+ +F ++G YD G +IWGGENLE+S
Sbjct: 304 FRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMS 363

Query: 123 FKFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG-FDIW 175
           F+  W       I    R  H    + P   P   GG   I+K  F  L        D W
Sbjct: 364 FRI-WQCGGKVQIAPCSRVGHLFRKSSPYSFP---GG---INKTLFSNLARVARVWMDDW 416

Query: 176 GGENLELSFKGDF----GDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
                + +   D      +VTSR ELRR   CK F+WYL+                 + N
Sbjct: 417 ARFYFKFNEPADRIKNEQNVTSRIELRRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRN 476

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEA-CLDYAGGD 269
               MC+    K    ++P+G+       G+    + ++M+K G I  D++ CLD     
Sbjct: 477 LGQNMCLIKPQKKVVSNQPMGIAKIDMCLGDEVILEMFVMTKEGFIMTDDSICLDAPEKV 536

Query: 270 VI 271
           VI
Sbjct: 537 VI 538


>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 493

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 90/276 (32%), Positives = 128/276 (46%), Gaps = 43/276 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL PLL  +  +   VV P+I  I  ++F       +   +     GGF+WNL 
Sbjct: 212 CECNQGWLPPLLRRVKEDPRRVVCPVIDVINLESF-------KYFGASSDLRGGFNWNLV 264

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  +ER+ R  N   P+ TP +AGGLF +D+A FE+LG YD+  DIWGGENLELS
Sbjct: 265 FKWEFLSNKEREERANNPTLPIRTPMIAGGLFVVDRAQFERLGAYDTAMDIWGGENLELS 324

Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           F+  W      E           RK+H     P   P  +G +F+           +   
Sbjct: 325 FR-AWQCGGSLEILPCSRVGHVFRKQH-----PYSFPGGSGNVFARQANTRRAAEVWMDD 378

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------VSND 215
           +  +    + ++     G V  R  LR++LGC SF+WYL+                 S  
Sbjct: 379 YKKYYYATVPVARNVPMGSVEERLNLRKSLGCHSFQWYLDNVYPELKVPAAGGERLASLR 438

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS 251
              MC+D+         PVGL+ CH  GGNQ W ++
Sbjct: 439 QGQMCLDTLGGSEG--NPVGLFTCHGSGGNQQWSLA 472


>gi|345323153|ref|XP_001510349.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Ornithorhynchus anatinus]
          Length = 479

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 94/320 (29%), Positives = 140/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 129 CEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 181

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  +  +P+ TP +AGGLF IDK++F  LG YD+  DIWGGEN E+SF
Sbjct: 182 FKWEQLSPEQKAKRTDPTQPIKTPIIAGGLFVIDKSWFNHLGKYDTAMDIWGGENFEISF 241

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          +P        RK+H     P   P      +         +       +
Sbjct: 242 RVWMCGGTLEIVPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 287

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------SNDW 216
           +W  E  +  +          +GD+ SR EL+++L C+ FKWYLE           S   
Sbjct: 288 VWMDEFKQYYYAARPAAQGRPYGDIQSRVELKKSLKCRPFKWYLETVYPELRIPEESLAQ 347

Query: 217 SGM------CIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEIRRDEACLD-- 264
           +G+      C++S          + L PC     +  G Q W  +   +IR+ + CL   
Sbjct: 348 TGIIRQRQKCLESQRLEGQEFPALILSPCITSKGEASGTQEWTYTFAQQIRQQQLCLSVH 407

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+  PC    G Q
Sbjct: 408 TLFPGSQVLFSPCKEEDGKQ 427


>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 11 [Sarcophilus
           harrisii]
          Length = 970

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 103/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  +   VV P+I  I  DT         + SS     GGF+W L 
Sbjct: 610 CEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL--------MYSSSPIVRGGFNWGLH 661

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 662 FKWDLVPFSELGGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 721

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   +       D   +
Sbjct: 722 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKE 774

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  K  +G+++ R ELR+ LGCKSFKWYL+                      
Sbjct: 775 QYFSLRPELKLK-SYGNISERVELRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFIN 833

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
                        + +  +  C+ +   P+     V L  C     NQ W+ + +H  I 
Sbjct: 834 RGPKRPKILQRGRLYHLQTNKCLAAQGHPSQKGGLVVLKVCDYSDPNQVWIYNEEHELIL 893

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
            +  CLD     +     L  CHGS G+Q
Sbjct: 894 NNLLCLDMSETRSSDPPRLMKCHGSGGSQ 922



 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/218 (37%), Positives = 111/218 (50%), Gaps = 26/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  +   VV P+I  I  DT         + SS     GGF+W+L 
Sbjct: 248 CEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL--------MYSSSPIVCGGFNWDLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  +    + A  P+ +P MAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPFSKLGGPEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   +       D   +
Sbjct: 360 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTNNSLRMAHVWL------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +     EL  K  +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 413 QYFSLRPELKLK-SYGNISERVELRKKLGCKSFKWYLD 449


>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
           kowalevskii]
          Length = 753

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 93/285 (32%), Positives = 135/285 (47%), Gaps = 36/285 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV+P +  I D TF   F     T      IGGF W + 
Sbjct: 397 CECNIGWLEPLLSEIVNDRTTVVAPNLDVISDKTFGYTFIKPEQT-----MIGGFGWLVD 451

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+++P+RER R  N  + P+ TPT+AGGLF+ID  +F ++G YD GFD WG ENLELS
Sbjct: 452 FKWYSLPKRERLRVNNDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELS 511

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P         ++ P           +I K     +  +      +  
Sbjct: 512 FRVWQCGGTLEIVPCSHVGHVFRSSIPYKYKDNKNPGLTIAKNNMRLMDVWMDDLKYFFL 571

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMC 220
             L    + +FGD + RK+LR NL CK FKWYLE                 + +  SG C
Sbjct: 572 AILPHYAEQEFGDTSERKQLRSNLKCKDFKWYLENIYPENTMPMQYQILGHIKHVESGEC 631

Query: 221 IDSACKPTDMHKPVGLYPCHKQGG--NQFWMMSKHGEIRRDEACL 263
           ++ + K  D + P  + PC   GG  ++  M +K   ++ D  CL
Sbjct: 632 LEMSRK--DGNTP-AIQPC---GGHFDEVLMYTKQSNLQHDYLCL 670


>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
          Length = 644

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 101/303 (33%), Positives = 141/303 (46%), Gaps = 43/303 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL PLL  +  + + +V PL+  I   TFE R     L        G FDWNLQ
Sbjct: 299 CECAEGWLPPLLLAIEADRTKIVCPLVDVIEFQTFEYRAAKEELH-------GAFDWNLQ 351

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE E KR  + A+ +  PT+ GGLF++D+ +F+++G+YDSG DIWG ENLELSF
Sbjct: 352 FIWKDLPEHEMKRRTSPADNIRAPTIIGGLFAVDRLYFKRIGSYDSGMDIWGSENLELSF 411

Query: 124 KFNWHA-----IPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK----LGTYDSGFD 173
           +  W       I    R  H      P   P   GG  +I           L  Y   F 
Sbjct: 412 RV-WMCGGSLEISPCSRVGHVFRTRIPYGFPN--GGKRTIRNNAMRAAEVWLDDYKKFF- 467

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------EVSNDWSGMCIDS 223
            +  +N+         DV  R +LRR L CKSF+WYL          E  +++ G  I S
Sbjct: 468 -YASQNITRRLTT-VEDVVVRVDLRRKLKCKSFQWYLDNVIPEAVLPEDEDEYFGQ-IQS 524

Query: 224 ACKPT------DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDVILYPCH 276
              P+      D H  + L  C     +Q + ++    ++RD+  C D  G D+I   C 
Sbjct: 525 LASPSKCLEFKDNH--LTLSHCKSMKESQMFHLTNQQLLKRDDVTCFDVNGRDLITRDCE 582

Query: 277 GSK 279
            S+
Sbjct: 583 ISQ 585


>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
           gallus]
          Length = 608

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 144/329 (43%), Gaps = 66/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLTPIKEDRRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  + ++G++T R ELR+ L CKSFKWYL+                      
Sbjct: 413 QYFALRPELRTR-NYGNITDRVELRKRLNCKSFKWYLDNIYPEMQVSGPNAKAPQPVFIN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
                        + +  +  C+ +   P+     V +  C     NQ W+ ++ H  I 
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDQNQVWVYNEDHELIL 531

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
            +  CLD     +     L  CHGS G+Q
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQ 560


>gi|157114760|ref|XP_001652408.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883561|gb|EAT47786.1| AAEL001151-PA [Aedes aegypti]
          Length = 592

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 97/304 (31%), Positives = 144/304 (47%), Gaps = 35/304 (11%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+ LLD +ARNS+ +  P I  I +    LR      T +   + G +DW+L F W   
Sbjct: 247 WLEALLDPVARNSTTIAIPTIDWIDEHDMHLR------TENAPSYYGAYDWDLNFGWWGR 300

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
             R  K  +N  EP  TP MAGGLF+I ++FFE+LG YD GFDI+G EN+ELS K +W  
Sbjct: 301 WSRINK-PENKMEPFETPAMAGGLFAITRSFFERLGWYDEGFDIYGIENIELSMK-SWIC 358

Query: 128 ----HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDS-GFDIWGGENLE 181
                 +P       +    P    T    + +      E  +  Y    FDI+G     
Sbjct: 359 GGKMVTVPCSRVAHIQKTGHPYLIQTKKDVVRANSLRLAEVWMDEYKQIIFDIYGLPRYP 418

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVS--------------NDWSGMCI--DSAC 225
           +    + GDV+ RK++R    CK+FK+Y++ +               +   M +  D+  
Sbjct: 419 VE---EIGDVSHRKQIREKAKCKTFKYYVQAAFPEMNNPMVEGAFHGEVKNMALGNDTCL 475

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
           +       V +  C  Q   QFW  + + E+   + CLDY G  + +Y CH S+GNQ ++
Sbjct: 476 EYQLDTNTVRMATCDHQETGQFWAHNYYQELNSHKHCLDYTGDTMGVYGCHRSRGNQAWQ 535

Query: 286 YDYK 289
           Y  K
Sbjct: 536 YVKK 539


>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
          Length = 558

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 107/318 (33%), Positives = 147/318 (46%), Gaps = 71/318 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 233 CEVNVMWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 284

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++++++F +LG YDSG DIWGGENLE+SF
Sbjct: 285 FKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRSYFSELGQYDSGMDIWGGENLEISF 344

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 345 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 381

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLEVSNDWSG 218
             +L L             S + D     +G+V+ R ELR+ LGCKSFKWYL+  + +  
Sbjct: 382 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNVSERVELRKKLGCKSFKWYLD--SIYPE 439

Query: 219 MCIDSA-CKPTDMHKPVGL-----YPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGG 268
           M I     KP    +P+ +      P   Q G  +    +H  +  +  CLD     +  
Sbjct: 440 MQISGPNAKP---QQPIFINRGPKRPKILQRGRIWIYNEEHELVLSNLLCLDMSETRSSD 496

Query: 269 DVILYPCHGSKGNQYFEY 286
              L  CHGS G+Q + +
Sbjct: 497 PPRLMKCHGSGGSQQWTF 514


>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           6 [Macaca mulatta]
 gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
          Length = 608

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
           (Silurana) tropicalis]
 gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
           tropicalis]
          Length = 555

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 203 CEVNKDWLPPLLHRIKEDPTRVVSPVIDIINLDTFAY-------IAASSDLRGGFDWSLH 255

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  +  EP+ TP +AGGLF I+K++F  LG YD+  DIWGGEN E+SF
Sbjct: 256 FKWEQLSAEQKAKRLDPTEPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGGENFEISF 315

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P      +         +       +
Sbjct: 316 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPEGNANTY---------IKNTKRTAE 361

Query: 174 IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------VSNDW 216
           +W  E     +          +GD+  R  LRR L C+SFKWYLE           S   
Sbjct: 362 VWMDEFKNHYYAARPAAQGRPYGDIQKRLSLRRTLKCRSFKWYLENVYPELQIPAESLSK 421

Query: 217 SGM------CIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
           SG+      CI+S          + L PC    G    +Q W+ ++  +I +   C+   
Sbjct: 422 SGIIRQRQRCIESQKTEGPEPPSLNLVPCSSLKGVSPQSQEWVYTQVQQISQGPLCMSVH 481

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L PC    G Q
Sbjct: 482 TLFPGTQVVLLPCREGDGKQ 501


>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
 gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
 gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
          Length = 608

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 4-like
           [Ornithorhynchus anatinus]
          Length = 580

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 100/283 (35%), Positives = 136/283 (48%), Gaps = 46/283 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + RN + VV P+I  I  +TFE     G      +  IGGFDW L 
Sbjct: 232 CECGPGWLEPLLERIGRNETAVVCPVIDTIDWNTFEFYMQTG------EPMIGGFDWRLT 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PERER+R ++  +P+ +PTMAGGLF++ K +FE LGTYD G ++WGGENLELSF
Sbjct: 286 FQWQTVPERERRRRRSRIDPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSF 345

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWGGENLE 181
           +  W                 +   +  G +F     +     L       ++W     E
Sbjct: 346 RV-WQC----------GGTLEILPCSHVGHVFPKRAPYARPSFLRNTARAAEVWMDGYKE 394

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE-------VSND---WSGMC---- 220
             +       K  + D++ R  LR  L C+SF W  E       V  D   W G      
Sbjct: 395 HFYNRNPPARKESYWDLSERTSLREXLNCRSFDWLPENVLPRIHVPEDRPGWHGAVRSAG 454

Query: 221 IDSACKPTDM--HKPVG----LYPCHKQGGNQFWMMSKHGEIR 257
           I S C   +   H P G    L+ CH QGGNQF+  + + EIR
Sbjct: 455 ISSECLDYNAPEHNPTGARLSLFGCHGQGGNQFFEYTSNREIR 497


>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
          Length = 608

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 653

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 109/220 (49%), Gaps = 28/220 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL+ +  N + V  P+I  I  DTFE    P           GGF+W L 
Sbjct: 286 CEVNVGWLEPLLERIRANRATVTCPIIDIINADTFEYTASP--------IVRGGFNWGLH 337

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W + P    ++ + A  P+ +PTMAGGLF++D+ FF +LG YD G DIWGGENLE+SF
Sbjct: 338 FKWESPPAGLARKGRGAIAPIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWGGENLEISF 397

Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGF 172
           +          IP        +R +    P    T+      +   + +    Y   +  
Sbjct: 398 RIWMCGGQLEIIPCSRVGHVFRRRRPYGSPNGEDTLTKNSLRVAHVWMDDYKKYYFQTRS 457

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
           D+ G           +GD+TSR  LR+ LGC+SF WY++ 
Sbjct: 458 DVVGKP---------YGDITSRVALRKRLGCRSFDWYMKT 488


>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
 gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
          Length = 597

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 96/308 (31%), Positives = 142/308 (46%), Gaps = 44/308 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 243 CEGNVGWCEPLLHRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 297 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEF 414

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF WYL                  +V 
Sbjct: 415 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVK 474

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
           +  S +C+D   +  +    VGLYPC K    +Q +  +    +R + +C      D   
Sbjct: 475 SVSSNLCLDDLLQNNEKPYNVGLYPCGKVLQKSQLFSFTNSQVLRNELSCATVQHSDSPP 534

Query: 270 --VILYPC 275
             V++ PC
Sbjct: 535 YRVVMVPC 542


>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Monodelphis domestica]
          Length = 609

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 102/329 (31%), Positives = 143/329 (43%), Gaps = 66/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  +   VV P+I  I  DT         + SS     GGF+W L 
Sbjct: 249 CEVNKMWLQPLLVPIQEDRRTVVCPVIDIISADTL--------MYSSSPIVRGGFNWGLH 300

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPFSELEGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 360

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   +       D   +
Sbjct: 361 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTYNSLRLAHVWL------DEYKE 413

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  K  +G+++ R  LR+ LGCKSFKWYL+                      
Sbjct: 414 QYFSLRPELKLK-SYGNISERIALRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFIN 472

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
                        + +  +  C+ +   P+     V L  C     NQ W+ + +H  I 
Sbjct: 473 RGPKRPKILQRGRLYHLQTNKCLAAQGHPSQKGGLVVLRVCDYSDPNQVWIYNEEHELIL 532

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQ 282
            +  CLD     +     L  CHGS G+Q
Sbjct: 533 NNLLCLDMSETRSSDPPRLMKCHGSGGSQ 561


>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
 gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
          Length = 597

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 96/308 (31%), Positives = 142/308 (46%), Gaps = 44/308 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 243 CEGNVGWCEPLLHRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 297 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEF 414

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF WYL                  +V 
Sbjct: 415 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVK 474

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
           +  S +C+D   +  +    VGLYPC K    +Q +  +    +R + +C      D   
Sbjct: 475 SVSSNLCLDDLLQNNEKPYNVGLYPCGKVLQKSQLFSFTNSQVLRNELSCATVQHSDSPP 534

Query: 270 --VILYPC 275
             V++ PC
Sbjct: 535 YRVVMVPC 542


>gi|426337572|ref|XP_004032775.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3, partial
           [Gorilla gorilla gorilla]
          Length = 413

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 87/265 (32%), Positives = 134/265 (50%), Gaps = 36/265 (13%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  +A N + VVSP IA+I  +TFE   P    ++  +   G FDW+L F W ++
Sbjct: 157 WLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNR---GNFDWSLSFGWESL 213

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
           P+ E++R K+   P+ TPT AGGLFSI K +FE +G+YD   +IWGGEN+E+SF+  W  
Sbjct: 214 PDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRV-WQC 272

Query: 130 IPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
             + E           R K+    P  T  +A     + + + ++   Y   F     + 
Sbjct: 273 GGQLEIMPCSVVGHVFRSKSPHSFPKGTQVIARNQVRLAEVWMDE---YKEIFYRRNTDA 329

Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDWSGMCI 221
            ++  +  FGD++ R E++  L CK+F WYL                   + +    +C+
Sbjct: 330 AKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCL 389

Query: 222 DSACKPTDMHKPVGLYPCHKQGGNQ 246
           D   +     KP+ +Y CH  GGNQ
Sbjct: 390 DVG-ENNQGGKPLIMYTCHGLGGNQ 413


>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
          Length = 874

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 98/319 (30%), Positives = 145/319 (45%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I   TFE R     + +    + G F+W + 
Sbjct: 243 CEVNTNWLPPLLAPIHRDRTVMTVPIIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE+KR K+ +EP  +PT AGGLF+I++ FF  LG YDSG  +WGGEN ELSF
Sbjct: 299 YKENEVPRREQKRRKHDSEPYRSPTHAGGLFAINRKFFLDLGAYDSGLLVWGGENFELSF 358

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  E
Sbjct: 359 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDE 413

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
             +  F          D GD++ +  L+  L CKSF+WY+                    
Sbjct: 414 PYKEYFYTREPLAQYLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANLFW 473

Query: 211 -EVSNDWSGMCIDSACK-PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
            E+ N     C+D+  + P  +   +GL  CH QG NQ   ++  G++   E C++    
Sbjct: 474 GELQNTGMEKCVDALGRQPPAI---IGLQVCHGQGHNQLIRLNAAGQLGVGERCIEAYNA 530

Query: 269 DVILYPCHGSKGNQYFEYD 287
           D+ L  C     +  ++YD
Sbjct: 531 DIKLAFCRLGTVDGPWQYD 549


>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 645

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 150/318 (47%), Gaps = 48/318 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  + +  R   G   S+ K + G F+W L 
Sbjct: 296 CEVNVNWLPPLLAPIRQNRKVMTVPVIDGIDKNDWSYRIVYG---SADKHYRGIFEWGLL 352

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +  +E  R K+ +EP  +PT AGGLF+I+K +FE+LG YD G  IWGGE  ELSF
Sbjct: 353 YKETELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSF 412

Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
           K  W        +P         +  P         P ++  +  + K + ++   YD  
Sbjct: 413 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDE---YDKY 468

Query: 172 FDIWGGENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYL-------------------- 210
           + I      E S +    G+++S+ +LR++L CKSFKWY+                    
Sbjct: 469 YYI-----REPSARHRLPGNISSQLKLRKSLKCKSFKWYMEKVAYDVVVSYPFPPENHVW 523

Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
            E  N  +G CID+  +P  +   VG  PCH  GGNQ   +++ G++ + E C+    G+
Sbjct: 524 GEAKNHATGKCIDTMGRP--VPGIVGATPCHGYGGNQLIRLNRKGQLAQGEWCITAVHGN 581

Query: 270 VILYPCHGSKGNQYFEYD 287
           +I   C     +  F Y+
Sbjct: 582 LITNHCIKGTVDGPFTYN 599


>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 409

 Score =  133 bits (334), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 97/318 (30%), Positives = 146/318 (45%), Gaps = 48/318 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  + +  R   G +   Y+   G F+W L 
Sbjct: 60  CEVNVNWLPPLLAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYR---GIFEWGLL 116

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +  +E  R K+ +EP  +PT AGGLF+I+K +FE+LG YD G  IWGGE  ELSF
Sbjct: 117 YKETELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSF 176

Query: 124 KFNWHA------IPERERKRHKNAAEPV------WTPTMAGGLFSIDKAFFEKLGTYDSG 171
           K  W        +P         +  P         P ++  +  + K + ++   YD  
Sbjct: 177 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPVISTNMLRVIKTWMDE---YDKY 232

Query: 172 FDIWGGENLELSFKGDF-GDVTSRKELRRNLGCKSFKWYL-------------------- 210
           + I      E S K    G+++S+ +LR +L CKSFKWY+                    
Sbjct: 233 YYI-----REPSAKHRLPGNISSQLKLRESLKCKSFKWYMEKVAYDVIVSYPFPPENHVW 287

Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
            E  N  +G CID+  +P      VG  PCH  GGNQ   ++  G++ + E C+    G+
Sbjct: 288 GEAKNHATGKCIDTMGRPVP--GIVGATPCHGYGGNQLIRLNMKGQLAQGEWCITAVHGN 345

Query: 270 VILYPCHGSKGNQYFEYD 287
           +I   C     +  F Y+
Sbjct: 346 LITNHCIKGTVDGPFTYN 363


>gi|260817709|ref|XP_002603728.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
 gi|229289050|gb|EEN59739.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
          Length = 501

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 145/305 (47%), Gaps = 50/305 (16%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL  + +N+S V  P+I +I   TF             KF  GGF W+L F W  +
Sbjct: 152 WLEPLLARIRKNNSTVACPVIDHIDTKTFAY--------EQLKFLAGGFTWDLNFMWIYV 203

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF---- 125
            + E  R K+A +PV  P MAGGLF+I K +F+ +G YD   +I+GGEN+E+SF+     
Sbjct: 204 NKEEMARRKSAIDPVRCPVMAGGLFAIYKDYFQHIGAYDQAMEIYGGENVEMSFRVWQCG 263

Query: 126 -NWHAIPERERKRHKNAAEP---VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
                +P       +   +P   V +         ++KA   ++   +    ++  E   
Sbjct: 264 GRIETVPCSRVGHIERTDKPYLYVRSNDTKDINIEVNKARVAEVWMDEYKRYLYAREPQL 323

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSNDWSGMC 220
            +    +GD++ R+ LR+ LGC+SF+WY+                     E+ N  +G+C
Sbjct: 324 KNI--SYGDISERQALRKRLGCQSFQWYMENVYPDRLEQTVENGYYRAWGELRNLQAGLC 381

Query: 221 IDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKG 280
           +D         + VGL+ CH QGG QF+ + +     + +A      GD+    C G++G
Sbjct: 382 LDLMDG-----RGVGLWDCHGQGGQQFFALRRP---EKRKALQTIGTGDM---QCMGTEG 430

Query: 281 NQYFE 285
            + FE
Sbjct: 431 TERFE 435


>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Cricetulus griseus]
          Length = 541

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 191 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 243

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 244 FQWEQLSPEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 303

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RK+H     P   P      +         +       +
Sbjct: 304 RVWMCGGSLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 349

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKWYLE       V  D S  
Sbjct: 350 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 409

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 410 KGNIRQRQKCLESQKQKKQETPHLRLSPCTKVKGEEAKSQVWAFTYTQQIIQEELCLSVV 469

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 470 TLFPGAPVVLVLCKNGDERQ 489


>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Nomascus leucogenys]
          Length = 527

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 167 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483


>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Taeniopygia guttata]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/333 (30%), Positives = 145/333 (43%), Gaps = 66/333 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLAPIREDPRTVVCPVIDIISADTLTY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLAELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   +
Sbjct: 360 RIWMCGGRLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  +  +G++T R ELR+ L CKSFKWYL+                      
Sbjct: 413 QYFALRPELRTRS-YGNITDRVELRKRLNCKSFKWYLDNIYPEMQISGPNAKAPQPVFIN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK-HGEIR 257
                        + +  +  C+ +   P+     V +  C     NQ W+ ++ H  I 
Sbjct: 472 RAQKRPKIIQRGRLYHLQTNKCLVAQGHPSQKGGLVVVRECDYNDQNQVWIYNEDHELIL 531

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +  CLD     +     L  CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|355689604|gb|AER98888.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Mustela putorius
           furo]
          Length = 306

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 110/221 (49%), Gaps = 29/221 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGFDWNL 
Sbjct: 94  CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFDWNLV 146

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 147 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEIS 206

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   +P            P   P  +G +F+ +              ++W  
Sbjct: 207 FRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 257

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
           E     +          +G++ SR ELR+ L CK FKWYLE
Sbjct: 258 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLE 298


>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Nomascus leucogenys]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Gorilla gorilla gorilla]
          Length = 527

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCLVAQGRPSQKGGLVVLKACDYSD 435

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483


>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 107/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIQEDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YD G DIWGGENLE+SF
Sbjct: 300 FKWDLVPLPEPGGPEGATAPIKSPTMAGGLFAMNRDYFSELGQYDRGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +G C+ +  +P+     V L  C    
Sbjct: 457 VSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTGKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  I  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELILSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Gorilla gorilla gorilla]
 gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Gorilla gorilla gorilla]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
           troglodytes]
          Length = 527

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 167 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 278

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 375

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 376 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 436 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 483


>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Pan troglodytes]
 gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           4 [Pan troglodytes]
 gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
           anubis]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Otolemur garnettii]
          Length = 608

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDQQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGEEGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPHAKPQQPIFINRGLSRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYGD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Pan paniscus]
 gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Pan paniscus]
          Length = 608

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLATIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
           norvegicus]
 gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
           [Rattus norvegicus]
 gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
          Length = 552

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 139/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKWYLE       V  D S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       + L PC K  G+    Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLESQKQKNQETPHLRLSPCAKVKGDRAKSQVWAFTYTQQIIQEELCLSVV 480

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 481 TLFPGAPVVLVLCKNGDERQ 500


>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Bombus impatiens]
          Length = 643

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 133/289 (46%), Gaps = 16/289 (5%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV KRW++PLL  +A + + +  P+I  I  DTF+    P           GGF+W L 
Sbjct: 271 IEVNKRWIEPLLSQIAHSKTIIAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLH 322

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P       ++  +P+ +PTMAGGLF++D+ +F KLG YD+G DIWGGENLE+SF
Sbjct: 323 FKWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISF 382

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E                 G     D      L       D +    L+  
Sbjct: 383 RI-WMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSLRVAHVWLDEYKDYFLKNV 441

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
            K D+GD++ R  LR+ L CK+F WYL V      +  D+  +  D       KP+  + 
Sbjct: 442 QKVDYGDISERLNLRKRLKCKNFAWYLNVVYPELALPDDNKNRLKDKWAKIEQKPIQPWH 501

Query: 239 CHKQG-GNQFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFE 285
             K+   +Q+ + +S      + E  +   G  +IL PC   K   ++E
Sbjct: 502 SRKRNYTDQYQIRLSNSALCIQSEKDIKTKGSKLILAPCLRIKSQMWYE 550


>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 145/343 (42%), Gaps = 95/343 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  + ++   VV P+I  I  DT  L + P  +        GGF+W L 
Sbjct: 236 CEVNQMWLQPLLAPIRQDRRTVVCPVIDIISADT--LSYSPSPIVR------GGFNWGLH 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E K  +    P+ +PTMAGGLF+I++ +F ++G YD+G DIWGGENLE+SF
Sbjct: 288 FKWDPVPPAELKSPQGPVGPIRSPTMAGGLFAINRKYFNEIGQYDAGMDIWGGENLEISF 347

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 348 R--------------------IW---MCGGQLFIIPCSRVGHIFRKRRPYGSPGGQDTMA 384

Query: 177 GENLELSF------------------KGDFGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L+                   + D+GD+  R  LR+ L C+SF+WYL+       
Sbjct: 385 HNSLRLAHVWMDEYKEQYLSMRPDLRQRDYGDIGERVALRKRLQCRSFRWYLDTVYPEMQ 444

Query: 212 ---------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGG 244
                                      + N  +  C+ +  + +     V L PC  Q  
Sbjct: 445 TVAGGNKHQPLFINKDLKRPKVLQRGRLRNLATNRCLVAQGRASQKGGVVVLRPCDPQDP 504

Query: 245 NQFWMMSKHGE-IRRDEACLDYAGGDVI----LYPCHGSKGNQ 282
            Q W   + G+ +     CLD +         L  CHGS G+Q
Sbjct: 505 EQEWAYDEEGQLVLAGLLCLDVSEVRTFDPPRLMKCHGSGGSQ 547


>gi|195455372|ref|XP_002074693.1| GK23025 [Drosophila willistoni]
 gi|194170778|gb|EDW85679.1| GK23025 [Drosophila willistoni]
          Length = 599

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/321 (30%), Positives = 148/321 (46%), Gaps = 45/321 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 244 CEGNVGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKAFQVGGFQWNG 297

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 298 HFDWVNLPEREKQRQRRECDQAREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 357

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 358 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDDY 415

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF WYL                  +V 
Sbjct: 416 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFDWYLKNVYPEKFVPNKNVQYWGKVR 475

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              + +C+D   +  +    +GLYPC K    +Q +  +    +R + +C      D   
Sbjct: 476 AVNANLCLDDLLQNNEKPFNLGLYPCGKTLQKSQLFSYTNSQVLRNELSCATVQHSDSPP 535

Query: 270 --VILYPCHGS-KGNQYFEYD 287
             V++ PC  S K N  ++Y+
Sbjct: 536 RRVVMVPCSESDKFNDQWKYE 556


>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
           taurus]
          Length = 605

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/333 (29%), Positives = 143/333 (42%), Gaps = 67/333 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 246 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   + ++   Y S   
Sbjct: 358 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWLDEYKQYFSLRP 416

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
                N        +G+++ R ELR+ L CKSFKWYL+                      
Sbjct: 417 DLRTRN--------YGNISERVELRKKLDCKSFKWYLDNIYPEMQISGPNVKPQQPIFIN 468

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
                        + +  +  C+ +  +P++    V L  C     NQ W+ + +H  + 
Sbjct: 469 RGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSDPNQVWIYNEEHELVL 528

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +  CLD     +     L  CHGS G+Q + +
Sbjct: 529 NNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 561


>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
 gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/338 (28%), Positives = 152/338 (44%), Gaps = 74/338 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   V  P+I  I  DTFE         SS     GGF+W L 
Sbjct: 150 CEVNINWLQPLLQHIHDDQKAVACPVIDVISSDTFEY--------SSSPMVRGGFNWGLH 201

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP     + ++  +P+ +PTMAGGLF++D+ +F +LG YDSG DIWG ENLE+SF
Sbjct: 202 FTWEPIPPSLLVKPEDYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAENLEISF 261

Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFE--KLGTYDSGF 172
           +      +   +P        +R +         TM+     + + + +  K   Y    
Sbjct: 262 RIWMCGGSLDILPCSRVGHLFRRFRPYGSDSKGDTMSRNSMRLAEVWLDGYKKYFYQIRH 321

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWY----------------------- 209
           D+ G +         FGD++ R +LR++L CKSF+WY                       
Sbjct: 322 DLEGKK---------FGDISQRIKLRKSLQCKSFEWYLKNIYPELKPPGQPGGGAFYPID 372

Query: 210 ----------------LEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKH 253
                           L+ S D  G C+DS   P++      ++ C +   ++FW +++ 
Sbjct: 373 RRPQVVIWKGKVICIQLQTSFD-DGYCLDSPGHPSEKKASAVIHQC-ESTKSRFWSLNED 430

Query: 254 GEIRRDE-ACLDYAGGD----VILYPCHGSKGNQYFEY 286
           GE++ +   CL+ +G      + L  CH   G Q +++
Sbjct: 431 GELKIESLLCLEASGYQSKLGLRLMKCHAQGGGQQWKF 468


>gi|260809642|ref|XP_002599614.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
 gi|229284894|gb|EEN55626.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
          Length = 432

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 88/289 (30%), Positives = 134/289 (46%), Gaps = 61/289 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW-NL 62
           CE    WL+PLL+ ++ N + V  P++  I  + F   F  G L+      +G  D  +L
Sbjct: 160 CECMYGWLEPLLERISLNHTVVPWPVLDMIQHNDFAYLFHGGVLS------VGSVDLVDL 213

Query: 63  QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           +FNWHA+P++E +  K+  +P+ +PTM GG+FSI K +FE LG YD G +IWGGEN+ELS
Sbjct: 214 RFNWHAVPQKEFRARKSIIDPIRSPTMPGGVFSIHKKYFEYLGGYDDGMEIWGGENIELS 273

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 182
           F+  W                 +   +  G +F +          Y +  D W   N  L
Sbjct: 274 FRVIWQC----------GGTIELVPCSHVGHVFRVTSP-------YSAPVDKWMKNNKRL 316

Query: 183 S------FKG------------DFGDVTSRKELRRNLGCKSFKWYLEV------------ 212
           +      +K             + G+V  RK LR+ L C  F WY++             
Sbjct: 317 AEVWMDDYKNVIYRKHPDYKTVETGNVMPRKVLRKALHCHDFSWYVQNVYPNLYVPDVRP 376

Query: 213 ----SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
                   +G C+D+     +  K   L+ CH  GGNQ+W  ++ GE+R
Sbjct: 377 VAYGQVRMTGKCLDAVSPEKEQPK---LFGCHGLGGNQYWEFTRAGEVR 422


>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
           caballus]
          Length = 539

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/320 (29%), Positives = 140/320 (43%), Gaps = 62/320 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  D F           S     GGFDW+L 
Sbjct: 189 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDNFNY-------IESATELRGGFDWSLH 241

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ +  + AEP+ TP +AGGLF ++K++F+ LG YD   DIWGGEN E+SF
Sbjct: 242 FQWEQLSPEQKAQRLDPAEPIRTPVIAGGLFVMNKSWFDYLGKYDMDMDIWGGENFEISF 301

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P      +         +       +
Sbjct: 302 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTVE 347

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ SR +LR  L C+SFKWYLE       +  D S  
Sbjct: 348 VWMDEYKQYYYAARPFALERPFGNIDSRVDLRSTLLCQSFKWYLENVYPELRIPKDSSIQ 407

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGG----NQFWMMSKHGEIRRDEACLD-- 264
                    C++S  +       V L PC K  G    +Q W  +   +I ++E CL   
Sbjct: 408 KGNIRQRQKCLESQKQDNQKISNVKLSPCVKSKGEDTMSQIWAFTYTQQIIQEELCLSVI 467

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 468 TVFPGAPVVLVLCKNEDDKQ 487


>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Pteropus alecto]
          Length = 557

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 140/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKISRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + T  +  +G+  C     N    Q W  S H  I++ E CL         G 
Sbjct: 440 NCLESQGQDTAGNFLLGVGICRGSAKNPLASQAWTFSDH-LIQQQEKCLTATSTSISPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            VIL  C+  +G Q
Sbjct: 499 PVILQACNPREGRQ 512


>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Ailuropoda melanoleuca]
 gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
          Length = 608

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  + ++   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIQQDQRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELRR LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTDKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
             Q W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PGQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|19922324|ref|NP_611043.1| GalNAc-T1, isoform A [Drosophila melanogaster]
 gi|24653878|ref|NP_725472.1| GalNAc-T1, isoform B [Drosophila melanogaster]
 gi|51315876|sp|Q6WV20.2|GALT1_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           Short=pp-GaNTase 1; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 1; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1
 gi|10121393|gb|AAG13184.1|AF218236_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
           melanogaster]
 gi|7303062|gb|AAF58130.1| GalNAc-T1, isoform B [Drosophila melanogaster]
 gi|21064373|gb|AAM29416.1| RE14585p [Drosophila melanogaster]
 gi|21645385|gb|AAM70974.1| GalNAc-T1, isoform A [Drosophila melanogaster]
 gi|220947986|gb|ACL86536.1| GalNAc-T1-PA [synthetic construct]
          Length = 601

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              S +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 AVNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562


>gi|34042906|gb|AAQ56699.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 601

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              S +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 AVNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562


>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Columba livia]
          Length = 502

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 133/312 (42%), Gaps = 48/312 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 161 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 213

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 214 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 273

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 274 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 326

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
            +  E    +    FG V  R E RR L CKSF+WYLE                     G
Sbjct: 327 QYYYEARPSAIGKSFGSVAERVEQRRKLNCKSFQWYLENVYPELKIPEKELIPGIIKQGG 386

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GGDV 270
            C++S  + T  +  VG+  C     N    Q W+ S    IR+ + CL       G  +
Sbjct: 387 NCLESQAQDTTGNTLVGMGNCKGTVSNPPVTQEWVFS-DPLIRQQDKCLSITSFSMGSHI 445

Query: 271 ILYPCHGSKGNQ 282
            L  C+   G Q
Sbjct: 446 TLEACNQKDGRQ 457


>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
 gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
          Length = 353

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/211 (35%), Positives = 106/211 (50%), Gaps = 11/211 (5%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WLQPLL  +  + + V  P+I  I    F     P  +       IGGF W++Q
Sbjct: 131 CEANVDWLQPLLSRIHSDRTIVAVPVIDIISSTNFMYSGTPSAV-------IGGFSWDMQ 183

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F WH++P   +   K+   P+ TPTMAGGLFSID+ +F + G+YD G D+WGGENLE+SF
Sbjct: 184 FTWHSLPNNRQSERKDRTAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSF 243

Query: 124 KFNWHAIPERER---KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENL 180
           +  W    + E     R  +     +  +  GG   +       +  +   ++ +     
Sbjct: 244 RI-WQCGGKLEILPCSRVGHVFRTRFPYSFPGGYSEVSVNLARVVHVWMDEYNQYVYMKR 302

Query: 181 ELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
                  +GD+TSR  LR  L CKSFKWYLE
Sbjct: 303 PDLQSLKYGDITSRVALRNKLKCKSFKWYLE 333


>gi|195488539|ref|XP_002092358.1| GE11714 [Drosophila yakuba]
 gi|194178459|gb|EDW92070.1| GE11714 [Drosophila yakuba]
          Length = 601

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECKHDREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              S +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 ALNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562


>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 11-like [Oryzias
           latipes]
          Length = 574

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 141/328 (42%), Gaps = 65/328 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  + ++   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 215 CEVNQDWLQPLLAPIQKDRRTVVCPIIDIISADTLTY--------SSSPIVRGGFNWGLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + AA P+ +PTMAGGLF++++ +F +LG YD G DIWGGENLE+SF
Sbjct: 267 FKWDPVPPSEISGPEGAAGPIRSPTMAGGLFAMNREYFNELGRYDPGMDIWGGENLEISF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   + ++   Y   + 
Sbjct: 327 RIWMCGGQLLIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWMDE---YKEQYL 382

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
               E    S    +GD++ R  LR+ L C+SF+WYL+                      
Sbjct: 383 SLRPELRNRS----YGDISERVALRKRLQCRSFRWYLDTVYPEMQAVASGNRPPPLFVNK 438

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE-IRR 258
                       + N   G C+ +  + +     V + PC  +   Q W   + G+ +  
Sbjct: 439 GLKRPKVLQRGRLRNLAVGRCLTAQGRASQKGGAVVVRPCDPRDPEQEWSYDEEGQLVLA 498

Query: 259 DEACLDYAGGDVI----LYPCHGSKGNQ 282
              CLD +         L  CHGS G+Q
Sbjct: 499 GLLCLDVSEVRTFDPPRLMKCHGSGGSQ 526


>gi|432111808|gb|ELK34851.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Myotis davidii]
          Length = 539

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 135/297 (45%), Gaps = 47/297 (15%)

Query: 20  RNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNW-HAIPERERKRHK 78
           ++ + VVSP+I  I  D F+          +     GGFDWNL F W +  PE+ R R  
Sbjct: 223 QDRTRVVSPIIDVINMDNFQY-------VGASADLKGGFDWNLVFKWDYMTPEQRRARQG 275

Query: 79  NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-----NWHAIPER 133
           N   P+ TP +AGGLF +DK++FE+LG YD   D+WGGENLE+SF+      +   IP  
Sbjct: 276 NPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCS 335

Query: 134 ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG------- 186
                     P   P  +G +F+ +              ++W  E     +         
Sbjct: 336 RVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMDEYKNFYYAAVPSARNV 386

Query: 187 DFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGMCIDSACKPTDMH---K 232
            +G++ SR ELR+ L CK F+WYLE               +  +   + C  T  H    
Sbjct: 387 PYGNIQSRLELRKKLSCKPFRWYLENVYPELRVPDHQDIAFGALQQGTNCLDTLGHFADG 446

Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGNQYFE 285
            VG+Y CH  GGNQ W ++K   ++  + CL   D   G +I L  C  +   Q +E
Sbjct: 447 VVGVYECHNAGGNQEWALTKEKSVKHMDLCLTVVDRTPGSLIKLQGCRENDSRQKWE 503


>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
          Length = 640

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/311 (31%), Positives = 141/311 (45%), Gaps = 34/311 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + RN   +  P+I  I   T+  R   G   S+ + F G F+W L 
Sbjct: 289 CEVNINWLPPLLAPIRRNRKVMTVPVIDGIDMHTWSYRRVYG---SADRHFRGIFEWGLL 345

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I + E +R K  +EP  +PT AGGLF+IDK +FE+LG YD G  IWGGE  ELSF
Sbjct: 346 YKETEITKEEARRRKYNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSF 405

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           K  W        +P         +  P     ++G    I       + T+   ++ +  
Sbjct: 406 KI-WQCGGGILFVPCSHVGHVYRSHMPYGFGKLSGKPV-ISTNMVRVIKTWMDEYEKYYY 463

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSNDW 216
                +     GD++++ ELR+ L CKSFKWY+                     E  N  
Sbjct: 464 IREPSAKHRSPGDISAQLELRKRLHCKSFKWYMEKVAYDVVYSYPFLPENHVWGEAKNLQ 523

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCH 276
           +  CID+  +   +   VG  PCH  GGNQ   ++K G++ + E C+   G  +    C 
Sbjct: 524 TSKCIDTMGRA--IPGIVGATPCHGYGGNQLIRLNKKGQLTQGEWCMTPLGNQLQTGHCA 581

Query: 277 GSKGNQYFEYD 287
               +  F+YD
Sbjct: 582 KGTVDGPFQYD 592


>gi|194755004|ref|XP_001959782.1| GF13042 [Drosophila ananassae]
 gi|190621080|gb|EDV36604.1| GF13042 [Drosophila ananassae]
          Length = 599

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 152/324 (46%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 245 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 298

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +   +      P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 299 HFDWINLPEREKQRQRRECKQQREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 358

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 359 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 416

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDW---- 216
            +I+     +L F  D GDVT R  LR+ L CK+F+WYL+             N W    
Sbjct: 417 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKNFEWYLKNIYPEKFVPTHNVNAWGKVQ 476

Query: 217 ---SGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
                +C+D   +  +    VGLYPC K    +Q +  +K   +R + +C      +   
Sbjct: 477 AVSGNLCLDDLLQNNEKPYNVGLYPCGKTLQKSQLFSFTKSQVLRNELSCATVQHSESPP 536

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ ++Y+ ++
Sbjct: 537 YRVVMVPCLENDEFNEQWKYERQH 560


>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
          Length = 579

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 143/315 (45%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P++  ++   + +V P+I +I D+T             +    GGF W L 
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYH-------GDWSLSTGGFSWALH 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  + E E+KR     + + +PTMAGGL + ++ +F ++G YD   DIWGGENLE+SF
Sbjct: 283 FTWEGLSEEEQKRRTKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   IP         A  P +  T       +     ++L       ++W  +
Sbjct: 343 RAWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMDD 395

Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
              L +         D GD+T+R ELR+ L CK FKW+L+                  + 
Sbjct: 396 YKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDVVAYGALH 455

Query: 214 NDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGDV 270
              SG  MC D+  +   M + +G++ C  +G + Q   +SK G +RR+  C     G++
Sbjct: 456 TVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCASEENGNI 515

Query: 271 ILYPCHGSKGNQYFE 285
            +  C  SK  Q+ E
Sbjct: 516 RMKTC--SKKAQFNE 528


>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
           lupus familiaris]
          Length = 608

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIQEDQQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
             Q W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PTQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
          Length = 607

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 96/322 (29%), Positives = 148/322 (45%), Gaps = 61/322 (18%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV K WL+P++  +  +  HVV P+I +I  D+F              +  GG D  L F
Sbjct: 256 EVSKGWLEPMMARINEDRKHVVMPIIDSIDPDSF-------------NYMRGGLDI-LGF 301

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
           +W    +    R +   EP+ +P MAGGLFS+D+ +F  LG YD G  ++GGE LE+SF+
Sbjct: 302 SWGMGQKSIGSRRRTRVEPMPSPIMAGGLFSMDRKYFFDLGGYDPGMKLYGGEELEISFR 361

Query: 125 F-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
                     IP   R  H       W     G ++++      K        ++W  E 
Sbjct: 362 IWQCGGTLECIP-CSRVGHVFRTGAYWK----GQVYTVPGHVIVK--NKLRAAEVWMDEY 414

Query: 180 LELSFK--------GDFGDVTSRKELRRNLGCKSFKWYL--------------------E 211
            E+  +         D GD+++ +E+RR   CK FKW+L                    E
Sbjct: 415 KEVVQRVMPPLPRGMDLGDLSAMQEIRRKFQCKPFKWFLKNVYPEMFVPNDEESIEASGE 474

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD----EACLDYAG 267
           + N  +  C D+        K +G+YPCH   G Q +++SK G++R      + CLD   
Sbjct: 475 IRNPQTNACFDTLGASHQGAK-IGVYPCHHSHGTQEFVLSKAGDVRVAAMDFDNCLDRGN 533

Query: 268 GD--VILYPCHGSKGNQYFEYD 287
           GD  V ++PCH + GNQ +++D
Sbjct: 534 GDGSVGIWPCHQTGGNQAWKWD 555


>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
 gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
 gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
          Length = 579

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 143/315 (45%), Gaps = 49/315 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P++  ++   + +V P+I +I D+T             +    GGF W L 
Sbjct: 230 CEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYH-------GDWSLSTGGFSWALH 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  + E E+KR     + + +PTMAGGL + ++ +F ++G YD   DIWGGENLE+SF
Sbjct: 283 FTWEGLSEEEQKRRTKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISF 342

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   IP         A  P +  T       +     ++L       ++W  +
Sbjct: 343 RAWMCGGSIEFIPCSHVGHIFRAGHP-YNMTGRNNNKDVHGTNSKRLA------EVWMDD 395

Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
              L +         D GD+T+R ELR+ L CK FKW+L+                  + 
Sbjct: 396 YKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDVVAYGALH 455

Query: 214 NDWSG--MCIDSACKPTDMHKPVGLYPCHKQGGN-QFWMMSKHGEIRRDEACLDYAGGDV 270
              SG  MC D+  +   M + +G++ C  +G + Q   +SK G +RR+  C     G++
Sbjct: 456 TVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCASEENGNI 515

Query: 271 ILYPCHGSKGNQYFE 285
            +  C  SK  Q+ E
Sbjct: 516 RMKTC--SKKAQFNE 528


>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Loxodonta africana]
          Length = 555

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 141/314 (44%), Gaps = 52/314 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKISRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE  N +  +              
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLE--NVYPELTVPEKEVLPGTIKQ 437

Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GG 268
              C++S  + T     +G+  C     N    Q W+ S H  I++   CL  A    G 
Sbjct: 438 GVNCLESQGQDTAGDTLLGMGICRGSAKNPVAAQEWLFSDH-LIQQQGKCLAAAFPSPGA 496

Query: 269 DVILYPCHGSKGNQ 282
            V L  C+  +G+Q
Sbjct: 497 LVALQACNSKEGSQ 510


>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Felis catus]
          Length = 527

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 167 CEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 219 FKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 278

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 279 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 315

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELRR LGCKSFKWYL+       
Sbjct: 316 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 375

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 376 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 435

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
             Q W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 436 PGQVWIYNEEHELVLNNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 483


>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Felis catus]
          Length = 608

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELRR LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTKSYGNISERVELRRKLGCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 ISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYSD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
             Q W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PGQVWIYNEEHELVLNNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
          Length = 654

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPAVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELAGAGGATAPIKSPTMAGGLFAMNRQYFSELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGQLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTRSYGNISERVELRKRLGCKSFKWYLDNVYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 IPGPNARPQQPVFVHRGPKRPRVLLRGRLYHLQTSRCLVAQGRPSQKGGLVVLKACDYGD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PNQVWVYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Callithrix jacchus]
          Length = 607

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 247 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPIVRGGFNWGLH 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 299 FRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISF 358

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 359 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 395

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+       
Sbjct: 396 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERIELRKKLGCKSFKWYLDNIYPEMQ 455

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 456 TSGPHAKPQQPIFVNKGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYTD 515

Query: 244 GNQFWMMSKHGEIRRDE-ACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +Q W+ ++  E+  +   CLD     +     L  CHGS G+Q + +
Sbjct: 516 PDQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 563


>gi|403276614|ref|XP_003929989.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Saimiri boliviensis boliviensis]
          Length = 566

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 95/323 (29%), Positives = 133/323 (41%), Gaps = 88/323 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTLAY--------SSSPIVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++D+ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMDRQYFHELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +         E+                   FS+      K                   
Sbjct: 360 RVILFFCVLNEQ------------------YFSLRPDLKTK------------------- 382

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLE-------------------------------- 211
               +G+++ R ELR+ LGCKSFKWYL+                                
Sbjct: 383 ---SYGNISERVELRKKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQ 439

Query: 212 ---VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE-ACLDY-- 265
              + +  S  C+ +  +P+     V L  C     NQ W+ ++  E+  +   CLD   
Sbjct: 440 RGRLRHLQSNTCLVAQGRPSQKGGLVVLKACDYTDPNQIWIYNEEHELVLNSLLCLDMSE 499

Query: 266 --AGGDVILYPCHGSKGNQYFEY 286
             +     L  CHGS G+Q + +
Sbjct: 500 TRSSDPPRLMKCHGSGGSQQWTF 522


>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Monodelphis domestica]
          Length = 563

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 96/312 (30%), Positives = 134/312 (42%), Gaps = 48/312 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDKA+F  LG YD+  DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDKAWFNHLGKYDTQMDIWGGENFELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
            +  E    +    FG +  R+E R+ + CKSF+WYLE                      
Sbjct: 388 QYYYEARPSAIGKSFGSIADREEQRKKMNCKSFQWYLENVYPELKIPEKEMIPGIIKQGT 447

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
           +C++S  + T  +  V +  C     N    Q W+ S    IR+ + CL       G  V
Sbjct: 448 ICLESQGQDTAGNNLVVMGSCKGTSNNPSMTQEWVFSD-PLIRQQDKCLAITSFSTGSQV 506

Query: 271 ILYPCHGSKGNQ 282
            L  C+   G Q
Sbjct: 507 TLEACNQKDGRQ 518


>gi|170038571|ref|XP_001847122.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
 gi|167882321|gb|EDS45704.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
          Length = 560

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 143/303 (47%), Gaps = 39/303 (12%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+ LLD +ARN   +  P I  I +    LR      T +   + G +DW+L F W   
Sbjct: 214 WLEALLDPVARNWMTIAIPTIDWIDEHDMHLR------TENAPTYYGAYDWDLNFGWWGR 267

Query: 70  PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNW-- 127
             R  K+ +N  EP  TP MAGGLF+I++ FFE LG YD GF+I+G EN+ELS K +W  
Sbjct: 268 WSRV-KQPQNKLEPFETPAMAGGLFAINRTFFELLGWYDEGFEIYGIENIELSMK-SWIC 325

Query: 128 ----HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK-LGTYDSG-FDIWGGENLE 181
                 +P       +    P         + +      E  +  Y    FDI+G     
Sbjct: 326 GGKMLTVPCSRVAHIQKTGHPYLMKANKDVVRANSLRLAEVWMDEYKQVIFDIYGLPRYP 385

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDS 223
           +    + GDV+SRKE+RR   CK+F++Y+                  EV N  + +  D+
Sbjct: 386 VE---EVGDVSSRKEVRRKANCKTFRYYIETAYPEMKNPLIEGAFRGEVKN--AALGNDT 440

Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
                     VG+  C     +QFW+ + + E+   + CLDY G ++ ++ CH  +GNQ 
Sbjct: 441 CLTYHAATNTVGMASCDHAEKSQFWVHNYYQELNSYKHCLDYTGSELGVFGCHRGRGNQA 500

Query: 284 FEY 286
           ++Y
Sbjct: 501 WQY 503


>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Cavia porcellus]
          Length = 608

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 99/333 (29%), Positives = 144/333 (43%), Gaps = 66/333 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLATIRGDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGEDGATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   +       D   D
Sbjct: 360 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKD 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     +L  K  +G+++ R ELR+ LGC+SFKWYL+                      
Sbjct: 413 QYFSLRPDLKTK-SYGNISERVELRKRLGCRSFKWYLDNIYPEMQVQGPNAKAQQPVFVN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
                        + +  +  C+ +  +P+     V L  C  +   Q W+ + +H  + 
Sbjct: 472 RGPKRPRVLRRGRLYHFQTNKCLVAQGRPSQKGSLVVLKACDYRDPAQVWIYNEEHELVL 531

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +  CLD     +     L  CHGS G+Q + +
Sbjct: 532 NNLLCLDVSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
           mellifera]
          Length = 603

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 97/297 (32%), Positives = 138/297 (46%), Gaps = 40/297 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A+N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 250 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 302

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R +N  EP  TP MAGGLFS+++ +F +LG+YD+   IWGGENLELS
Sbjct: 303 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELS 362

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           F+      +    P          + P    T  GG+  I      ++     D   + +
Sbjct: 363 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 419

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
              N E +   D   + SR ELR+ L CK+F+WYL+                 + +  S 
Sbjct: 420 FKFNAEAARLRDKQTIRSRLELRKKLQCKNFEWYLDNIWPEHFFPKDDRFFGRIVHILSK 479

Query: 219 MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
            CI          +P G    H    +   NQ ++M+  G I  DE+ CLD    D 
Sbjct: 480 KCIMRPSAKGTYSQPSGYAILHSCVPRPLLNQMFVMTADGIIMTDESVCLDAPENDT 536


>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Otolemur garnettii]
          Length = 558

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 139/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
 gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
          Length = 383

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 119/225 (52%), Gaps = 35/225 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLLD +A +    V+P+I  I D+TF+ +            + GGF+WNLQ
Sbjct: 155 CECTEGWLEPLLDRIAFDRKIAVAPVIDVINDETFQYQ-------KGIDVYRGGFNWNLQ 207

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W++ P  E KR  N    PV TPT+AGGLFSID+ FF ++G YD    IWGGENLE+S
Sbjct: 208 FRWYSSPPSELKRRGNDVTHPVRTPTIAGGLFSIDRQFFFEIGAYDKEMKIWGGENLEMS 267

Query: 123 FKF-----NWHAIP-------ERERKRHK----NAAEPVWTPTMAGGLFSIDKAFFEKLG 166
           F+          IP        R++  H     N+A      T+   L  + + + ++  
Sbjct: 268 FRIWQCGGQLEIIPCSHVGHVFRKKSPHDFPRGNSAR-----TLTTNLVRVAEVWMDE-- 320

Query: 167 TYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            + S F I       +S   +  DV+ RKELR+ L CKSF WYL+
Sbjct: 321 -WKSLFYIISSAAKNIS---EIIDVSERKELRKRLKCKSFAWYLD 361


>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
          Length = 679

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 149/327 (45%), Gaps = 70/327 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL  +  + + VVSP+I  I  D F        L        GGF+W+L 
Sbjct: 333 VEANEGWLEPLLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLK-------GGFNWDLV 385

Query: 64  FNWHAIPERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + E+ER+  + A   P+ TP +AGGLFSIDK +F +LG YD   D+WGGENLE+S
Sbjct: 386 FKWDYMSEQERRERRRAPTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEIS 445

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+          IP        RK+H     P   P  +G +F+ +              
Sbjct: 446 FRVWQCHGTLEIIPCSRVGHVFRKKH-----PYTFPGGSGNVFAKNTR---------RAA 491

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
           ++W  E  E  F          FGD++ R E+R  L CKSF W+LE              
Sbjct: 492 EVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENVYPELRIPNKDAI 551

Query: 212 ----VSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG-EIRRDEACLD 264
               VS    G+  CI +    T     +G+Y CH  GGNQ + ++K G E R ++ C+ 
Sbjct: 552 GWGAVSQTNKGLEECIGN----THGGGTLGMYRCHGDGGNQEFTLTKEGKEFRHNDLCIG 607

Query: 265 Y-----AGGDVILYPCHGSKGNQYFEY 286
           Y      G  V    CH    +Q +EY
Sbjct: 608 YNAKEPVGNPVKFNTCH-QMSHQRWEY 633


>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Apis florea]
          Length = 595

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 97/297 (32%), Positives = 138/297 (46%), Gaps = 40/297 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A+N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 242 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 294

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R +N  EP  TP MAGGLFS+++ +F +LG+YD+   IWGGENLELS
Sbjct: 295 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELS 354

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           F+      +    P          + P    T  GG+  I      ++     D   + +
Sbjct: 355 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 411

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
              N E +   D   + SR ELR+ L CK+F+WYL+                 + +  S 
Sbjct: 412 FKFNAEAARLRDKQTIRSRLELRKKLQCKNFEWYLDNIWPEHFFPKDDRFFGRIVHILSK 471

Query: 219 MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
            CI          +P G    H    +   NQ ++M+  G I  DE+ CLD    D 
Sbjct: 472 KCIMRPSAKGTYSQPSGYAILHSCVPRPLLNQMFVMTTDGIIMTDESVCLDAPENDT 528


>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
           gallus]
          Length = 611

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 105/327 (32%), Positives = 144/327 (44%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 245 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 299 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 356

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   PT   +A  L  + + + ++   Y       
Sbjct: 357 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 413

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDVT++KELR NL CKSFKW++                      E+ 
Sbjct: 414 RPEYRHLS----AGDVTAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEIR 469

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C+D+  K   +  P+ L  C K  G   W        S   +IR  +       
Sbjct: 470 NVGTGLCVDT--KHGSLGSPLRLESCVKDRGEAAWNNVQVFTFSWREDIRPGDPQHTKKF 527

Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
           C D       V LY CHG KGNQ + Y
Sbjct: 528 CFDAISHSSPVTLYDCHGMKGNQLWRY 554


>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
           mutus]
          Length = 606

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 246 CEVNVLWLQPLLAAIREDRQTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ L CKSFKWYL+       
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P++    V L  C    
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562


>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Loxodonta africana]
          Length = 527

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 88/233 (37%), Positives = 115/233 (49%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  +   VV P+I  I  DT         L SS     GGF+W L 
Sbjct: 248 CEVNEMWLQPLLAAVREDPHTVVCPVIDIISADTL--------LYSSSPIVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPFDELGGPEGATAPIKSPTMAGGLFAMNRHYFSELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTRSYGNISERVELRKKLGCKSFKWYLD 449


>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
          Length = 696

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 149/327 (45%), Gaps = 70/327 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  + WL+PLL  +  + + VVSP+I  I  D F        L        GGF+W+L 
Sbjct: 350 VEANEGWLEPLLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLK-------GGFNWDLV 402

Query: 64  FNWHAIPERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + E+ER+  + A   P+ TP +AGGLFSIDK +F +LG YD   D+WGGENLE+S
Sbjct: 403 FKWDYMSEQERRERRRAPTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEIS 462

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+          IP        RK+H     P   P  +G +F+ +              
Sbjct: 463 FRVWQCHGTLEIIPCSRVGHVFRKKH-----PYTFPGGSGNVFAKNTR---------RAA 508

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
           ++W  E  E  F          FGD++ R E+R  L CKSF W+LE              
Sbjct: 509 EVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENVYPELRIPNKDAI 568

Query: 212 ----VSNDWSGM--CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG-EIRRDEACLD 264
               VS    G+  CI +    T     +G+Y CH  GGNQ + ++K G E R ++ C+ 
Sbjct: 569 GWGAVSQTNKGLEECIGN----THGGGTLGMYRCHGDGGNQEFTLTKEGKEFRHNDLCIG 624

Query: 265 Y-----AGGDVILYPCHGSKGNQYFEY 286
           Y      G  V    CH    +Q +EY
Sbjct: 625 YNAKEPVGNPVKFNTCH-QMSHQRWEY 650


>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
           aries]
          Length = 606

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 246 CEVNVLWLQPLLAAIREDRRAVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ L CKSFKWYL+       
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P++    V L  C    
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562


>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Bos taurus]
 gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
 gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
           taurus]
          Length = 557

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   F 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEFK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + T  +  +G+  C     N    Q W+ + H  I++   CL         G 
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 499 LVVLQACNPREGRQ 512


>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Bos grunniens mutus]
          Length = 557

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   F 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEFK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + T  +  +G+  C     N    Q W+ + H  I++   CL         G 
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 499 LVVLQACNPREGRQ 512


>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Meleagris gallopavo]
          Length = 562

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 105/328 (32%), Positives = 144/328 (43%), Gaps = 62/328 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 195 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 248

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 249 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 306

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   PT   +A  L  + + + ++   Y       
Sbjct: 307 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 363

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDVT++KELR NL CKSFKW++                      E+ 
Sbjct: 364 RPEYRHLS----AGDVTAQKELRNNLNCKSFKWFMSEVAWDLPKFYPPVEPPAAAWGEIR 419

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW-------MMSKHGEIRRDEA----- 261
           N  +G+C+D+  K   +  P+ L  C K  G   W         S   +IR  +      
Sbjct: 420 NVGTGLCVDT--KHGSLGSPLRLESCVKDRGEAAWNNVQVTXTFSWREDIRPGDPQHTKK 477

Query: 262 -CLDYA--GGDVILYPCHGSKGNQYFEY 286
            C D       V LY CHG KGNQ + Y
Sbjct: 478 FCFDAISHSSPVTLYDCHGMKGNQLWRY 505


>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Bos taurus]
 gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
           taurus]
          Length = 606

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 246 CEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 297

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 298 FKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 357

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 358 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 394

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ L CKSFKWYL+       
Sbjct: 395 HNSLRLAHVWLDEYKEQYFSLRPDLRTRNYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 454

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P++    V L  C    
Sbjct: 455 ISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSEKGGLVVLKACDYSD 514

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            NQ W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 515 PNQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 562


>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Strongylocentrotus purpuratus]
          Length = 654

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 108/217 (49%), Gaps = 22/217 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV ++WL+PLL+ +  +S  VV P+I  I  DTF     P           GGF+W + 
Sbjct: 286 CEVNEQWLEPLLERIKADSHTVVCPIIDIINHDTFAYTASP--------LVKGGFNWGMH 337

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I  R+    ++  +P+ +PTMAGGLF++++ +F KLG YD G DIWGGENLE+SF
Sbjct: 338 FKWDTIRSRQLVGKEDYVKPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISF 397

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          +P            P  +P         D      +   +   D +   
Sbjct: 398 RIWQCGGKLEIVPCSRVGHVFRKRRPYGSPNRQ------DTTTKNAVRVAEVWMDEYKEH 451

Query: 179 NLELSFKG---DFGDVTSRKELRRNLGCKSFKWYLEV 212
             ++  K    D+GD++SR  LR  L CKSFKWYL+ 
Sbjct: 452 FYQVQPKAKNIDYGDISSRVALREELKCKSFKWYLDT 488


>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
          Length = 499

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 93/295 (31%), Positives = 136/295 (46%), Gaps = 45/295 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+   +  P+I  I   TFE R P      +Y+   G F+W + 
Sbjct: 146 CEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTFEYR-PVYSHGINYR---GIFEWGML 201

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P+RE   HK+ +EP  +PT AGGLF+I++ +F ++G YD G  +WGGEN ELSF
Sbjct: 202 YKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFLEIGAYDPGLLVWGGENFELSF 261

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H   A   + P   G L    K     +  Y    + W  E
Sbjct: 262 KI-WQCGGSIEWVPCSRVGHVYRA---FMPYSFGNLAKNRKGSLITI-NYKRVIETWFDE 316

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------------------- 211
             +  F          D GD++ +  LR  L CKSF WY+E                   
Sbjct: 317 EHKEFFYTREPMARFLDMGDISEQVALRDKLNCKSFSWYMENVAYDVYDKFPKLPKNVHW 376

Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
             V N   G+C+D+  K    +  +G+  CH  G NQ + +++ G++   E CL+
Sbjct: 377 GMVKNKAIGLCLDTMGKAAPSY--IGIQSCHGAGNNQLYRLNEAGQLGVGERCLE 429


>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
 gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
          Length = 598

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 144/327 (44%), Gaps = 63/327 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I   TFE R     + +    + G F+W + 
Sbjct: 245 CEVNTNWLPPLLAPIHRDRTVMTVPIIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 300

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE+KR K+ +EP  +PT AGGLF+I++ FF +LG YDSG  +WGGEN ELSF
Sbjct: 301 YKENEVPRREQKRRKHDSEPYRSPTHAGGLFAINRKFFLELGAYDSGLLVWGGENFELSF 360

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF----FEKLGTYDSG--FDIWGG 177
           K  W      E           W P    G   + + F    F KL     G    I   
Sbjct: 361 KI-WQCGGSIE-----------WVPCSRVG--HVYRGFMPYNFGKLANKKKGPLITINYK 406

Query: 178 ENLELSFKG----------------DFGDVTSRKELRRNLGCKSFKWYL----------- 210
             +E  F G                D GD++ +  L+  L CKSF+WY+           
Sbjct: 407 RVIETWFDGPYKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKY 466

Query: 211 ----------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDE 260
                     E+ N     C+D+  +       +GL  CH QG NQ   ++  G++   E
Sbjct: 467 PMLPANVKWGELQNVGKEKCVDALGRQPP--AVIGLQQCHGQGHNQLIRLNGAGQLGVGE 524

Query: 261 ACLDYAGGDVILYPCHGSKGNQYFEYD 287
            C++    ++ L  C     +  ++YD
Sbjct: 525 RCIEAYNSEIKLAFCRLGTVDGPWQYD 551


>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 611

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/302 (32%), Positives = 136/302 (45%), Gaps = 59/302 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +    + VVSP I  I  +TF+   P   + SS+ +  G FDW L 
Sbjct: 266 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLETFQFNKP---VASSHAYNRGNFDWGLT 322

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IPE  RK  K+   PV TPT AGGLFSI K++FE +GTYD   +IWGGEN+E+SF
Sbjct: 323 FGWEQIPEAARKLRKDETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSF 382

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W    + E          +   ++ G +F           T+  G D+     + L+
Sbjct: 383 RV-WQCGGQLE----------IIPCSVVGHVFRTKSPH-----TFPKGTDVITRNQVRLA 426

Query: 184 ------FKGDF------------GDVTSRK-----------ELRRNLGCKSFK------- 207
                 +K  F             D+T  K            L R+   K+         
Sbjct: 427 EVWMDDYKKIFYRRNRNAENMAKEDLTPEKYGAVRHTFLSITLERSSFLKNVTPLFIFDP 486

Query: 208 WYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLD 264
           +  ++ N  S  C+D   +     KPV +Y CH  GGNQ++  S H E+R +   E CL 
Sbjct: 487 YVAQIQNQGSKTCLDVG-ENNKGGKPVIMYQCHNMGGNQYFEYSSHNELRHNIGKEFCLH 545

Query: 265 YA 266
            A
Sbjct: 546 AA 547


>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
 gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
          Length = 693

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 114/215 (53%), Gaps = 15/215 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K+H  +   P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTEMAKQHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P  + G   ++         +   +  +  
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKV-LNANLLRVAEVWMDEWKYYFY 425

Query: 178 ENLELSFK-GDFGDVTSRKELRRNLGCKSFKWYLE 211
           +   ++F+  +  DV+ R ELR+ L CKSFKWYL+
Sbjct: 426 KIAPVAFRMRESIDVSERVELRKKLNCKSFKWYLQ 460


>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Ornithorhynchus anatinus]
          Length = 608

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/336 (30%), Positives = 143/336 (42%), Gaps = 66/336 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNAMWLQPLLVPIREDRRTVVCPVIDIIGADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGPGRATAPIKSPTMAGGLFAMNREYFRELGQYDSGMDIWGGENLEISF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TMA     +   +       D   +
Sbjct: 360 RIWMCGGQLFIIPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DEYKE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------- 211
            +     EL  +  +G+++ R  LR+ LGCKSFKWYL+                      
Sbjct: 413 QYFALRPELRLR-SYGNISERVTLRKKLGCKSFKWYLDTVYPEMQISGPNARPQPPAFVN 471

Query: 212 -------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIR 257
                        + +  +  C+ +   P+     V L  C     NQ W+ + +H  I 
Sbjct: 472 RGPKRPRILQRGRLYHLQTNKCLAAQGHPSQKGGRVVLKECDYGDLNQVWIYNEEHELIL 531

Query: 258 RDEACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
            +  CLD     +     L  CHGS G+Q + +  K
Sbjct: 532 NNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTFGRK 567


>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Taeniopygia guttata]
          Length = 570

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 134/314 (42%), Gaps = 52/314 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 229 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 282 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 341

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 342 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 394

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
            +  E    +    FG V  R ELR  L CKSF+WYLE  N +  +              
Sbjct: 395 QYYYEARPSAIGKSFGSVADRVELRHKLNCKSFQWYLE--NVYPELKIPEKELIPGIIRQ 452

Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA----GG 268
              C++S  +    +   G+  C     N    Q W+ S    IR+ + CL  A    G 
Sbjct: 453 GENCLESQAQDITGNVLAGMGNCKGTVNNPPVTQEWIFSDPS-IRQQDKCLSIASFSTGS 511

Query: 269 DVILYPCHGSKGNQ 282
            + L  C+   G Q
Sbjct: 512 QITLEACNQKDGRQ 525


>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
 gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Mus musculus]
          Length = 550

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 64/320 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKWYLE       V  D S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +     + + L PC K  G+    Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498


>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
          Length = 550

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 136/315 (43%), Gaps = 54/315 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP            P   P      +         +       ++W  E
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTY---------IKNTKRTAEVWMDE 365

Query: 179 NLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS------- 217
             +        + +  FG++ +R  LR+NL C++FKWYLE       V  D S       
Sbjct: 366 YKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQKGNIR 425

Query: 218 --GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD----YAG 267
               C++S  +     + + L PC K  G+    Q W  +   +I ++E CL     + G
Sbjct: 426 QRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVVTLFPG 483

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C      Q
Sbjct: 484 APVVLALCKNGDERQ 498


>gi|157134100|ref|XP_001663146.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108870595|gb|EAT34820.1| AAEL012972-PA [Aedes aegypti]
          Length = 600

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 142/311 (45%), Gaps = 48/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   +WL+PLL+ + ++S+ V+ P+I     D  E +           F IGGF W+  
Sbjct: 243 CECMHQWLEPLLERIKQSSTSVLVPII-----DVIEAKNFYYSTNGVTDFQIGGFTWDGH 297

Query: 64  FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
           F+WH + +RE++R K        A  P ++PTMAGGLF+I + +F ++G+YD   D WGG
Sbjct: 298 FDWHDVTQREKERQKRECPEKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 357

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
           ENLE+SF+          IP            P   P      G+ ++  A        D
Sbjct: 358 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 413

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
              D+      +L    + GDVT R+ LR  L CKSF WY++                  
Sbjct: 414 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGR 473

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
           VS+    +C+D+  +  D    +G+Y C +     +Q   ++K G +R + +C      +
Sbjct: 474 VSSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNN 533

Query: 270 -----VILYPC 275
                V++ PC
Sbjct: 534 AETRFVVMIPC 544


>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 557

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 138/314 (43%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + T  +  +G+  C     N    Q W+ S H  I++   CL         G 
Sbjct: 440 NCLESQGQDTAGNFLLGVGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSVSPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            V L  C+  +G Q
Sbjct: 499 PVTLQACNLREGRQ 512


>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
           musculus]
          Length = 551

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/320 (30%), Positives = 140/320 (43%), Gaps = 64/320 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 203 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 255

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 256 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 315

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RK+H     P   P      +         +       +
Sbjct: 316 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 361

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKWYLE       V  D S  
Sbjct: 362 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSIQ 421

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +     + + L PC K  G+    Q W  +   +I ++E CL   
Sbjct: 422 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 479

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 480 TLFPGAPVVLALCKNGDERQ 499


>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Saimiri boliviensis boliviensis]
          Length = 558

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V SR E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGM 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             V L  C+  +G Q
Sbjct: 499 SPVTLQMCNPREGKQ 513


>gi|157133631|ref|XP_001662949.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108870752|gb|EAT34977.1| AAEL012823-PA [Aedes aegypti]
          Length = 600

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 142/311 (45%), Gaps = 48/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   +WL+PLL+ + ++S+ V+ P+I     D  E +           F IGGF W+  
Sbjct: 243 CECMHQWLEPLLERIKQSSTSVLVPII-----DVIEAKNFYYSTNGVTDFQIGGFTWDGH 297

Query: 64  FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
           F+WH + +RE++R K        A  P ++PTMAGGLF+I + +F ++G+YD   D WGG
Sbjct: 298 FDWHDVTQREKERQKRECPEKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 357

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
           ENLE+SF+          IP            P   P      G+ ++  A        D
Sbjct: 358 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 413

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
              D+      +L    + GDVT R+ LR  L CKSF WY++                  
Sbjct: 414 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGR 473

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
           VS+    +C+D+  +  D    +G+Y C +     +Q   ++K G +R + +C      +
Sbjct: 474 VSSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNN 533

Query: 270 -----VILYPC 275
                V++ PC
Sbjct: 534 AETRFVVMIPC 544


>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Mus musculus]
 gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1;
           AltName: Full=Polypeptide GalNAc transferase-like
           protein 1; Short=GalNAc-T-like protein 1;
           Short=pp-GaNTase-like protein 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase-like
           protein 1; AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
 gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
 gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
 gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
 gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
          Length = 558

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQTCNPKEGKQ 513


>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Xenopus laevis]
 gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
          Length = 603

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/217 (37%), Positives = 106/217 (48%), Gaps = 24/217 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  N   VV P+I  I  DT         + SS     GGF+W L 
Sbjct: 243 CEVNEMWLQPLLAPIRENPKTVVCPVIDIISSDTL--------IYSSSPVVRGGFNWGLH 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    +    P  +PTMAGGLF +D+ +F  LG YDSG DIWGGENLE+SF
Sbjct: 295 FKWDPVPLSELGGPEGYTAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISF 354

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +      +   +P            P  +P    TMA     +   +       D   D 
Sbjct: 355 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAYNSLRLAHVWM------DEYKDQ 408

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +     EL  K D+GD++ R  LR+ L CKSFKWYL+
Sbjct: 409 YFALRPELRNK-DYGDISERLALRKRLKCKSFKWYLD 444


>gi|307183924|gb|EFN70514.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Camponotus
           floridanus]
          Length = 471

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/320 (33%), Positives = 148/320 (46%), Gaps = 46/320 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + +N++ VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 155 CECTVGWLEPLLEAIGKNATRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 207

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R  N  EP  TP MAGGLFS++K +F KLG+YD    IWGGENLELS
Sbjct: 208 FRWLTLNGRLLKERRDNIIEPFRTPAMAGGLFSMNKDYFFKLGSYDDEMRIWGGENLELS 267

Query: 123 FKFNWHAIPERERK--RHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTY--DSGFDIWGG 177
           F+  W      E     H        +P T  GG+  I      ++     D   D +  
Sbjct: 268 FR-TWQCGGSVEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLARVALVWMDQWADFYFK 326

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGMCIDSACKPT 228
            N E +       + SR  LR  L CKSF+WYLE           + + G  + +A K  
Sbjct: 327 FNPEAAKLRYKQQIRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGKIVHAATKRC 386

Query: 229 DMHKPVGLYPCHKQGGN-------------QFWMMSKHGEIRRDEA-CLDYAGGD----- 269
            M +P       +  GN             Q ++M+K+G I  DE+ CLD    D     
Sbjct: 387 LM-RPTAKSLYAQPSGNAILHSCIPRPILGQMFVMTKNGVIMTDESVCLDAPERDMQQRT 445

Query: 270 --VILYPCHGSKGNQYFEYD 287
             V +  C G +  Q ++YD
Sbjct: 446 PKVKIMACSG-RERQRWQYD 464


>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
           glaber]
          Length = 608

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/344 (30%), Positives = 147/344 (42%), Gaps = 96/344 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL V+  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAVVHGDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E     +A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSELGGADSATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGC+SFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCQSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C  + 
Sbjct: 457 ILGPNAKAQQPVFVNRGPKRPRVLQRGRLYHFQTNKCLVAQGRPSQKGGLVVLKACDYED 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQ 282
             Q W+ + +H  +  +  CLD     +     L  CHGS G+Q
Sbjct: 517 PAQVWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQ 560


>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Macaca
           mulatta]
          Length = 558

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQSTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Ornithorhynchus anatinus]
          Length = 555

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/314 (31%), Positives = 136/314 (43%), Gaps = 52/314 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNSEWLQPLLQRVKEDYTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRAAEVWMDDYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
            +  E    +    FG V  R E R+ + CKSF+WYLE  N +  +              
Sbjct: 380 QYYYEARPSAIGKAFGSVAERVEQRQKMNCKSFQWYLE--NVYPELKVPEKEPAPGIIRQ 437

Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIRRDEACLDY----AGG 268
              C++S  +      P G+  C        G Q W+ S    IR+ + CL      AG 
Sbjct: 438 GASCLESRGRDAAGDSPAGVGGCRGTAGGPAGTQEWVFS-DPLIRQQDQCLSITSFSAGS 496

Query: 269 DVILYPCHGSKGNQ 282
            V L  C+   G Q
Sbjct: 497 QVTLERCNQKDGRQ 510


>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Ovis
           aries]
          Length = 557

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGT 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + T  +  +G+  C     N    Q W+ + H  I++   CL         G 
Sbjct: 440 NCLESQGQDTAGNFQLGMGICRGSAKNPPAAQAWLFTDH-LIQQQGKCLAATSTSVSPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 499 LVVLQACNPREGRQ 512


>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Cricetulus griseus]
 gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
           griseus]
          Length = 608

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 87/233 (37%), Positives = 112/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGADGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 449


>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
          Length = 927

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 111/215 (51%), Gaps = 15/215 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   VV P+I  I D TF  +          + F GGF+WNLQ
Sbjct: 255 CECTKGWLEPLLARIKENRKAVVCPVIDVINDRTFAYQ-------KGIELFRGGFNWNLQ 307

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P +  + R  +   P+ +PTMAGGLFSIDK +FE+LG YD G +IWGGEN+E+S
Sbjct: 308 FRWYAVPPDIVKGRANDPTMPIQSPTMAGGLFSIDKRYFEELGAYDPGMEIWGGENIEIS 367

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
           F+          +P          A P   P  + G + + +     ++   +  +  + 
Sbjct: 368 FRIWQCGGRIEILPCSHVGHIFRKASPHDFPGKSSGKILNSNLLRVAEVWMDEWKYLFYK 427

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
                L  +    DV+ R ELR+ L CK F WYL+
Sbjct: 428 TAPQALQMRSSI-DVSERIELRKRLQCKDFNWYLQ 461


>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 623

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 148/315 (46%), Gaps = 61/315 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WLQPLL  +  +  +VV P+I  I  DTF+         SS     GGF+W L 
Sbjct: 255 IEVNVNWLQPLLSRIVDSKKNVVVPIIDIINADTFKY--------SSSPLVRGGFNWGLH 306

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+   K +++  +P+ +PTMAGGLF+I++A+F++LG YD+G +IWGGENLE+SF
Sbjct: 307 FKWENLPKSTLKSNEDFVKPILSPTMAGGLFAINRAYFKELGEYDNGMNIWGGENLEISF 366

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      N   IP        RKR +    P    TM      +               +
Sbjct: 367 RIWMCGGNLELIPCSRVGHVFRKR-RPYGSPNGEDTMMRNSLRVA--------------N 411

Query: 174 IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACK 226
           +W  +  E  +K         FGD++ R +LR+ L C SF+WYL+  N +  + +     
Sbjct: 412 VWMDDYKEFFYKQHPEGKTFPFGDISDRLKLRKKLHCHSFEWYLQ--NIYPELIL----- 464

Query: 227 PTDMHKPVGL-YPCHKQGGNQFWMMSKHG-----EIR--------RDEACLDYAGGDVIL 272
           P+D  +   + +   +Q   Q W + K       ++R          E  + + G  +IL
Sbjct: 465 PSDNEQKSKIKWNALEQQKFQPWHLRKRNYTAQFQLRLFNTSLCVTSERDVKHKGSPLIL 524

Query: 273 YPCHGSKGNQYFEYD 287
            PC   K   +++ D
Sbjct: 525 SPCLRRKTQVWYQTD 539


>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
          Length = 667

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 323 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 375

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 376 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 435

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 436 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 488

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 489 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 548

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 549 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 607

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 608 SPVILQTCNPKEGKQ 622


>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
          Length = 558

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDLTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQTCNPKEGKQ 513


>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Metaseiulus occidentalis]
          Length = 610

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 144/320 (45%), Gaps = 44/320 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  + + VV P+I  I DDTF           S++   G  +W + 
Sbjct: 253 CECTTGWLEPLLQRIKEDRTRVVCPIIDIIHDDTFAY-------VKSFELHWGAINWEMH 305

Query: 64  FNWHAI-PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + P   ++RH + +EP  TP MAGGLFSIDK +F ++G YD   DIWGGEN+E+S
Sbjct: 306 FRWYPVGPHVLKQRHGDPSEPFKTPVMAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMS 365

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVW--TPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           F+      +   +P          + P     P   GG+   + A   ++   D   + +
Sbjct: 366 FRIWQCGGSLEIVPCSHVGHVFRRSSPYTFPHPKGVGGILFSNLARVAEVWM-DDWAEFY 424

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSG 218
              N E        DV  RK LR  L CK F WYL                 ++ N  + 
Sbjct: 425 FNMNTEAKKLRSTMDVAKRKALRDRLHCKPFSWYLTNVWPENFFPSENRFFGKIRNRAAE 484

Query: 219 MCIDSACKPTDMHKPVG---LYPCH-KQGGNQFWMMSKHGEIRRDEA-CLD----YAGGD 269
            C       +  H+P+G   L  C       Q+++M+  G +  DE+ CLD    Y   +
Sbjct: 485 KCFGRPVSKS-YHQPIGKVKLEDCAVTHYARQYFVMTGEGYLMTDESVCLDSPEGYEDTN 543

Query: 270 VILYPCHGSKGNQYFEYDYK 289
           V++  C G +  Q + +D K
Sbjct: 544 VVMIACQGIQ-RQKWRFDVK 562


>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
          Length = 596

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 145/319 (45%), Gaps = 47/319 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I   TFE R     + +    + G F+W + 
Sbjct: 243 CEVNTNWLPPLLAPIYRDRTVMTVPVIDGIDHKTFEYR----PVYADGHHYRGIFEWGML 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE+KR K+ +EP  +PT AGGLF+I++ FF ++G YD G  +WGGEN ELSF
Sbjct: 299 YKENEVPRREQKRRKHDSEPYKSPTHAGGLFAINREFFLEIGAYDPGLLVWGGENFELSF 358

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L +  K     +  Y    + W  E
Sbjct: 359 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLANKKKGPLITI-NYKRVIETWFDE 413

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
             +  F          D GD++ +  L+  L CKSF+WY+                    
Sbjct: 414 QYKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFW 473

Query: 211 -EVSNDWSGMCIDSACK-PTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
            E+ N  +  C+D+  + P  M   +GL  CH QG NQ   ++  G++   E C++    
Sbjct: 474 GELKNSGTEKCVDALGRQPPAM---IGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNM 530

Query: 269 DVILYPCHGSKGNQYFEYD 287
            + L  C     +  ++YD
Sbjct: 531 GIKLAFCRMGTVDGPWQYD 549


>gi|380016857|ref|XP_003692388.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like,
           partial [Apis florea]
          Length = 556

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/295 (30%), Positives = 134/295 (45%), Gaps = 20/295 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV K+W++PLL  +  + +    P+I  I  DTF+    P           GGF+W L 
Sbjct: 184 IEVNKQWIEPLLSRIVYSKTITAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLH 235

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P       ++  +P+ +PTMAGGLF++++ +F KLG YD+G DIWGGENLE+SF
Sbjct: 236 FKWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISF 295

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E                 G     D      L       D +    L+  
Sbjct: 296 RI-WMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDTMLKNSLRVAHVWLDEYKDYFLQNI 354

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYP 238
            K D+GD+T R  LR+ L CK+F WYL+V      +  D+  +  D       KP+   P
Sbjct: 355 KKIDYGDITERINLRKRLACKNFAWYLKVVYPELTLPDDNKNRLKDKWAKIEQKPIQ--P 412

Query: 239 CHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
            H +  N   Q+ + +S      + E  +   G  +IL PC   K   ++E D +
Sbjct: 413 WHSKKRNYTDQYQIRLSNSTLCIQSEKDIKTKGSKLILAPCLRIKSQMWYETDKR 467


>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 92/295 (31%), Positives = 138/295 (46%), Gaps = 55/295 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PL+  +  +    V P+I  I D +F           + +   GGF W LQ
Sbjct: 174 CECTEGWLEPLVSRIGDDRKTRVQPIIDIIDDRSFAY-------IGASESNSGGFTWQLQ 226

Query: 64  FNWHAIPERERKRHKNAAEPVW-------TPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
             W  IPE E+ R  +  + +        TPTMAGGLFSI+K +FEK+G YD+G D+WGG
Sbjct: 227 HQWVRIPEYEQNRRVSEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGG 286

Query: 117 ENLELSFKF-----NWHAIPERE----RKRHKNAAEPVWT-PTMAGGLFSIDKAFFEKLG 166
           EN+E+SF+          IP        +R+   + P  + PT+      + + + +   
Sbjct: 287 ENIEMSFRIWMCGGKIEIIPCSRIGHVYRRYIPYSFPNGSDPTIYRNAMRVAEVWMDHYK 346

Query: 167 TYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------- 210
            +      +     +L    D+GDV+ R ELRR LGC +F WYL                
Sbjct: 347 KF------FYATQTKLHMV-DYGDVSDRLELRRKLGCHNFTWYLKNIIPEMILPVDDANY 399

Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL 263
             E+ ND +G+C+DSA       KP+ +  C       F + S H ++R  + CL
Sbjct: 400 FGEIRNDATGLCLDSASG-----KPLRVDICAATSDQIFTLTSDH-QLRIGKECL 448


>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 511

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/319 (31%), Positives = 145/319 (45%), Gaps = 48/319 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  ++ +   VV P+I  I DDTF           S++   G F+WNL 
Sbjct: 163 CECTKGWLEPLLVRVSEDRKKVVCPVIDIINDDTFAY-------VRSFELHWGAFNWNLH 215

Query: 64  FNWHAIPERERKRHKN-AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +   E K+ KN   EP  TP MAGGLF+I + +F ++G YD    IWGGENLE+S
Sbjct: 216 FRWYTLGTTEIKKRKNDVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGENLEMS 275

Query: 123 FKFNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDI 174
           F+  W        +P          + P    T  GG+  I  A   ++     D   + 
Sbjct: 276 FR-GWQCGGSVEIVPCSHVGHLFRKSSPY---TFPGGVGEILHANLARVALVWMDEWQEF 331

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWS 217
           +   N E + + D   V +R +LR  L CKSF+WYL+                 + +   
Sbjct: 332 FFKFNPEAARQRDKQSVRARIQLRSRLKCKSFEWYLDNVWPQHFFPKNDRFFGLIKSASD 391

Query: 218 GMCIDSACKPTDMHKPVG---LYPCHKQGGNQFWMMSKHGEIRRDEA-CLDY-----AGG 268
             C+     P   ++P G   L PC K+    F++ +K  ++  DE+ CLD         
Sbjct: 392 NKCLTRPHGPPSTNQPTGVVTLTPC-KETLEHFFVYTKFSDVMTDESVCLDLLDKNEMKA 450

Query: 269 DVILYPCHGSKGNQYFEYD 287
            V +  C GS   ++  YD
Sbjct: 451 KVKVMACSGSPRQKWM-YD 468


>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
          Length = 655

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 140/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 311 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 363

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 364 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 423

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 424 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 476

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 477 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 536

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 537 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 595

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 596 SPVILQTCNPKEGKQ 610


>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Cavia
           porcellus]
          Length = 556

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 138/313 (44%), Gaps = 49/313 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNVEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDKA+F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGIIKQGL 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA-----GGD 269
            C+++  + T     +G+  C     N    Q W+ + H  I++   CL        G  
Sbjct: 440 NCLETQGQDTAGDFLLGMGICRGSAKNPPPAQAWLFTDH-LIQQQGRCLAATSVSPPGSP 498

Query: 270 VILYPCHGSKGNQ 282
           VIL  C+  +  Q
Sbjct: 499 VILQVCNSKESKQ 511


>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Canis lupus
           familiaris]
          Length = 557

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + +  +  +G+  C     N    Q W+ S H  I++   CL         G 
Sbjct: 440 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSITPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            VIL  C+  +G Q
Sbjct: 499 LVILQVCNPREGRQ 512


>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
          Length = 558

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
          Length = 558

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
          Length = 617

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 100/299 (33%), Positives = 140/299 (46%), Gaps = 55/299 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K H  +   P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPSSMAKEHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFEKLGTYDSG 171
           F+          +P          + P   P  + G      L  + + + ++   Y   
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKILNANLLRVAEVWMDEWKYY--- 423

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSN 214
              +    +    +    DV+ R ELR+ L CKSFKWYL+                 +SN
Sbjct: 424 --FYKLAPVAYRMRQSI-DVSERVELRKKLNCKSFKWYLQNVFKDHFLPTPLDKFGRISN 480

Query: 215 DWSGMCIDSACKPTDM----HKPVGLYPCHKQGGN--QFWMMSKHGEIRRDE-ACLDYA 266
             S  C  +A +P D     H+ +G  PC   G +  Q W+ +    IR DE  CL   
Sbjct: 481 --SNYC--AAFRPGDTGPKNHRLLGA-PC-TMGFDLWQLWLYTGDSRIRTDEHLCLSVV 533


>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Oryctolagus cuniculus]
          Length = 608

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 148/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E+   + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPLSEQGGAEGATAPIKSPTMAGGLFAMNRLYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ LGC+SFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCQSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 VPGPNAKAQQPVFFNRGPKRPKVLRRGRLYHFQTNKCLVAQGRPSQKGGLVVLKACDYGD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +Q W  + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PDQVWFYNEEHELVLHNLLCLDVSETRSSDPPRLMKCHGSGGSQQWAF 564


>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Nomascus leucogenys]
          Length = 558

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
           norvegicus]
 gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
           norvegicus]
 gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_b [Rattus norvegicus]
          Length = 609

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 113/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 249 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 300

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  +     +A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 301 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 360

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 361 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 397

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 398 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 450


>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Homo sapiens]
 gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Homo sapiens]
 gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1;
           AltName: Full=Polypeptide GalNAc transferase-like
           protein 1; Short=GalNAc-T-like protein 1;
           Short=pp-GaNTase-like protein 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase-like
           protein 1; AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
 gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
 gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
           [Homo sapiens]
 gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
           [Homo sapiens]
 gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
 gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
          Length = 558

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Rattus norvegicus]
          Length = 558

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 140/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNVEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             V+L  C+  +G Q
Sbjct: 499 SPVVLQSCNPKEGKQ 513


>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
 gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
          Length = 600

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 100/328 (30%), Positives = 135/328 (41%), Gaps = 64/328 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  N   VV P+I  I  DT         + S      GGF+W L 
Sbjct: 240 CEVNEMWLQPLLAPIRENPKTVVCPVIDIISADTL--------IYSQSPVVRGGFNWGLH 291

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    +    P  +PTMAGGLF++D+ +F  LG YDSG DIWGGENLE+SF
Sbjct: 292 FKWDPVPLSELGGPEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSGMDIWGGENLEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +      +   +P            P  +P    TMA     +   +       D   D 
Sbjct: 352 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWM------DEYKDQ 405

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE----------------------- 211
           +     EL  + DFGD+  R  LR+ L CKSFKWYL+                       
Sbjct: 406 YFALRPELRNR-DFGDIRDRLTLRKRLNCKSFKWYLDNIYPEMQVSGPNAKPQPPVFINK 464

Query: 212 ------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMS-KHGEIRR 258
                       + N  +  C+ +   P+     V +  C      Q W  + +H  I  
Sbjct: 465 GQKRPKILQRGRLINMQTNKCLVAQGHPSQKGGLVVVKDCDFNDSEQVWSYNEEHELILS 524

Query: 259 DEACLDY----AGGDVILYPCHGSKGNQ 282
           +  CLD     +     L  CHGS G+Q
Sbjct: 525 NLLCLDMSETRSSDPPRLMKCHGSGGSQ 552


>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
 gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
          Length = 585

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 146/323 (45%), Gaps = 55/323 (17%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PLL+ +A +    V P I  I  DTF+ R       S  +   G FDW  +F
Sbjct: 221 EVNTNWLPPLLEPIAEDYRTCVCPFIDVIAHDTFQYR-------SQDEGKRGAFDW--KF 271

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P        +  +P  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 272 YYKRLPLLPGDL-DDPTKPFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 330

Query: 125 FNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
             W         P            P   P    G+  + + F      +   +  +  E
Sbjct: 331 I-WQCGGRLVDAPCSRVGHVYRGYAPFGNPR---GVNFVVRNFKRVAEVWMDEYSQFLYE 386

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV-----------------------SND 215
                 K D GD+++++ELR  L CK FKW+LEV                       S  
Sbjct: 387 RNPQFAKTDPGDLSAQRELRERLQCKPFKWFLEVVAPDLLVRYPPRDPQPFASGRVQSVA 446

Query: 216 WSGMCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEI--RRDEACLDYA-- 266
              +C+DS        +P+GLY C     H Q  NQF+ +S H +I  R ++ CLD A  
Sbjct: 447 NPRLCLDSLNH--QAKEPIGLYACAFNKTHPQ-NNQFFTLSYHRDIRVRSNDKCLDAAKL 503

Query: 267 GGDVILYPCHGSKGNQYFEYDYK 289
             +++L+ CH S+GNQ + YDY+
Sbjct: 504 NDEIVLFSCHESQGNQMWRYDYE 526


>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Rattus norvegicus]
          Length = 608

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 113/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  +     +A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKTKSFGNISERVELRKKLGCQSFKWYLD 449


>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
          Length = 499

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/306 (30%), Positives = 142/306 (46%), Gaps = 45/306 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+   +  P+I  I  +TFE R P  +  ++Y+   G F+W + 
Sbjct: 146 CEVNVNWLPPLLAPIYRDYRTMTVPVIDGIDYNTFEYR-PVYQHGTNYR---GIFEWGML 201

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P+RE   HK+ +EP  +PT AGGLF+I++ +F ++G YD G  +WGGEN ELSF
Sbjct: 202 YKENEVPDREAHLHKHKSEPYKSPTHAGGLFAINRRYFLEIGAYDPGLLVWGGENFELSF 261

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H   A   + P   G L    K     +  Y    + W  E
Sbjct: 262 KI-WQCGGSIEWVPCSRVGHVYRA---FMPYTFGNLAKNRKGSLITI-NYKRVIETWFDE 316

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------------------- 211
             +  F          D GD++ +  L+  L CKSF W++E                   
Sbjct: 317 EHKEYFYTREPMARFLDMGDISEQVALKERLKCKSFGWFMENVAYDVYDKFPKLPKNVHW 376

Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGD 269
             V N  +G C+D+  K    +  +G   CH  G +Q + +++ G++   E C++  G +
Sbjct: 377 GMVKNKATGACLDTMGKAAPAY--IGTSSCHGMGNSQLFRLNEAGQLGVGERCVETDGDN 434

Query: 270 VILYPC 275
           V    C
Sbjct: 435 VKQAIC 440


>gi|327263882|ref|XP_003216746.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Anolis carolinensis]
          Length = 536

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 75/216 (34%), Positives = 113/216 (52%), Gaps = 11/216 (5%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A   + VVSP I  I  +TFE   P   +    +   G FDW+L 
Sbjct: 271 CECFHGWLEPLLSRIAEEPTAVVSPDITTIDLNTFEFSKP---IQYGKQHSRGNFDWSLT 327

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W AIP+ E++R K+   P+ TPT AGGLF+I KA+FE +G+YD   +IWGGEN+E+SF
Sbjct: 328 FGWEAIPQHEKERRKDETYPIKTPTFAGGLFAISKAYFEHVGSYDDQMEIWGGENVEMSF 387

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEK--LGTYDSGFDIWG 176
           +          IP         +  P   P     + S ++    +  +  Y   F    
Sbjct: 388 RVWQCGGQLEIIPCSVVGHVFRSKSPHTFPK-GTQVISRNQVRLAEVWMDDYKEIFYRRN 446

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
            +  +++ +  +GD++ R +L+  L CK+F WYL+ 
Sbjct: 447 QQASQMAREKTYGDLSDRLDLKERLHCKNFTWYLQT 482


>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Callithrix jacchus]
          Length = 558

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 136/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------VSNDWSG 218
            +  E    +    FG V SR E R+ + CKSF+WYLE               V      
Sbjct: 380 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPVIIKQGM 439

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             V L  C+  +G Q
Sbjct: 499 SPVTLQMCNPREGKQ 513


>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Taeniopygia guttata]
          Length = 555

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 101/331 (30%), Positives = 141/331 (42%), Gaps = 69/331 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 189 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 242

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 243 YKRIPIPPELQK--PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 300

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   PT      ++ +             ++W  E
Sbjct: 301 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRV-----------AEVWMDE 349

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
             E  ++          GDV ++KELR NL CKSFKW++                     
Sbjct: 350 YAEFIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAW 409

Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA-- 261
            E+ N  +G+C+D+  K   +  P+ L  C K  G   W        S   +IR  +   
Sbjct: 410 GEIRNVGTGLCVDT--KHGALGSPLRLENCVKDRGEAAWNNVQVFTFSWREDIRPGDPQH 467

Query: 262 ----CLDYA--GGDVILYPCHGSKGNQYFEY 286
               C D       V LY CHG KGNQ + Y
Sbjct: 468 TKKFCFDAISHSSPVTLYDCHGMKGNQLWRY 498


>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 16 [Homo sapiens]
 gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
           [Homo sapiens]
 gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
           [synthetic construct]
          Length = 542

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
          Length = 558

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 136/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_c [Rattus norvegicus]
          Length = 560

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 109/218 (50%), Gaps = 26/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 200 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 251

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  +     +A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 252 FKWDLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 311

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RKR +    P    TM      +   +       D   +
Sbjct: 312 RIWMCGGKLFIIPCSRVGHIFRKR-RPYGSPEGQDTMTHNSLRLAHVWL------DEYKE 364

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +     +L  K  FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 365 QYFSLRPDLKTKS-FGNISERVELRKKLGCQSFKWYLD 401


>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Papio
           anubis]
          Length = 558

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Macaca mulatta]
          Length = 558

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 440 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 498

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 499 SPVILQMCNPREGKQ 513


>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
           musculus]
 gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
 gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
 gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
 gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
           musculus]
 gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
 gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
           musculus]
          Length = 608

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF +  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 449


>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
           musculus]
          Length = 529

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 169 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 220

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 221 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 280

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF +  +     F K   Y S  G D   
Sbjct: 281 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 317

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 318 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 370


>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
          Length = 608

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 112/233 (48%), Gaps = 56/233 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL ++  +   VV P+I  I  DT           SS     GGF+W L 
Sbjct: 248 CEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTLAY--------SSSPVVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A  P+ +PTMAGGLF++++ +F  LG YDSG DIWGGENLE+SF
Sbjct: 300 FKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF +  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE 211
             +L L             S + D     FG+++ R ELR+ LGC+SFKWYL+
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLKNKSFGNISERVELRKKLGCQSFKWYLD 449


>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Pan
           paniscus]
          Length = 529

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 185 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 237

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 238 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 297

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 298 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 350

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 351 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 410

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 411 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 469

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 470 SPVILQMCNPREGKQ 484


>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
          Length = 499

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 155 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 207

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 208 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 267

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 268 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 320

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 321 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIKQGP 380

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 381 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 439

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 440 SPVILQMCNPREGKQ 454


>gi|328792011|ref|XP_624873.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Apis mellifera]
          Length = 637

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 89/294 (30%), Positives = 134/294 (45%), Gaps = 20/294 (6%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++W++PLL  +  + +    P+I  I  DTF+    P           GGF+W L F
Sbjct: 266 EVNRQWIEPLLSRIVYSKTITAMPVIDIINPDTFQYTGSP--------LVRGGFNWGLHF 317

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +P       ++  +P+ +PTMAGGLF++++ +F KLG YD+G DIWGGENLE+SF+
Sbjct: 318 KWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFR 377

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 184
             W      E                 G     D      L       D +    L+   
Sbjct: 378 I-WMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDTMLKNSLRVAHVWLDEYKDYFLQNIK 436

Query: 185 KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-----HKPVGLYPC 239
           K D+GD+T R  LR+ L CK+F WYL+V      +  D+  +  D       KP+   P 
Sbjct: 437 KIDYGDITERINLRKRLACKNFAWYLKVVYPELTLPDDNKNRLKDKWAKIEQKPIQ--PW 494

Query: 240 HKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           H +  N   Q+ + +S      + E  +   G  +IL PC   K   ++E D +
Sbjct: 495 HSKKRNYTDQYQIRLSNSTLCIQSEKDIKTKGSKLILAPCLRIKSQMWYETDKR 548


>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
          Length = 575

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 247 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 300 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 359

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 360 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 413 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 472

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 473 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 531

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 532 SPVILQMCNPREGKQ 546


>gi|241133788|ref|XP_002404588.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215493637|gb|EEC03278.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 459

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 100/313 (31%), Positives = 142/313 (45%), Gaps = 59/313 (18%)

Query: 18  LARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAIPERERKRH 77
           + R ++ VV P+I  I D+TF           S++   G F+W L F W  + ERE KR 
Sbjct: 117 ITRQATVVVCPVIDIINDETFAY-------VRSFEMHWGAFNWELHFRWFPVGEREHKRR 169

Query: 78  K-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-----NWHAIP 131
             NA  P  TP MAGGLFSID+ +F ++G YD   DIWGGEN+E+SF+      +   +P
Sbjct: 170 SGNATAPFRTPVMAGGLFSIDRGYFYEMGAYDDQMDIWGGENMEISFRIWQCGGSVEVVP 229

Query: 132 ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFG-- 189
                       P   P   G    +    F  L    +   +W  E     F  + G  
Sbjct: 230 CSHVGHLFRRTSPYTFPNPGG----VGSVLFSNLARVAA---VWMDEWAAFYFNMNRGEK 282

Query: 190 -----DVTSRKELRRNLGCKSFKWYL-----------------EVSNDWSGMCIDSACKP 227
                DVT+RK+LR  L CKSFKWYL                 +V N  SG C     +P
Sbjct: 283 RHMLQDVTARKKLREKLQCKSFKWYLKNIWPENFLPNDNIFFGKVRNKKSGKCF---VRP 339

Query: 228 T--DMHKPVGLYPCHKQG----GNQFWMMSKHGEIRRDEA-CLDY----AGGDVILYPCH 276
           +  + H+PVG     +        Q ++ ++ G I+ DE+ CLD     A  +V++  C+
Sbjct: 340 SSKNYHQPVGRVVLEECALTYYAMQHFVFTEEGFIKTDESICLDSPESKADTNVVMIACN 399

Query: 277 GSKGNQYFEYDYK 289
             +  Q + YD K
Sbjct: 400 DLQ-RQKWRYDPK 411


>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Tupaia chinensis]
          Length = 566

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 222 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 275 FKWEQIPLDQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 387

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 388 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIMKQGV 447

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  +       +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 448 NCLESQGQSPAGDFLLGMGICRGSAKNPPSAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 506

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 507 SPVILQVCNPREGKQ 521


>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Myotis davidii]
          Length = 518

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 99/316 (31%), Positives = 139/316 (43%), Gaps = 52/316 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 171 CEVNTEWLQPLLQRVQEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 223

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 224 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 283

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 284 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 336

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V SR E R+ + CKSF+WYLE         V      +     
Sbjct: 337 QYYYEARPSAIGKAFGSVASRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPSIIKQGV 396

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA-------- 266
            C++S  + T  +  +G+  C     N    Q W+ S H  I++   CL           
Sbjct: 397 NCLESQGQDTAGNFLLGVGTCRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSASISP 455

Query: 267 GGDVILYPCHGSKGNQ 282
           G  V L  C+  +G Q
Sbjct: 456 GSPVGLQTCNPREGKQ 471


>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
           scrofa]
          Length = 608

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 147/348 (42%), Gaps = 96/348 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +  +   VV P+I  I  DT      P           GGF+W L 
Sbjct: 248 CEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTLAYSASP--------VVRGGFNWGLH 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E +  + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE+SF
Sbjct: 300 FRWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFDIWG 176
           +                    +W   M GG LF I  +     F K   Y S  G D   
Sbjct: 360 R--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTMT 396

Query: 177 GENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------- 211
             +L L             S + D     +G+++ R ELR+ L CKSFKWYL+       
Sbjct: 397 HNSLRLAHVWLDEYKEQYFSLRPDLRTRSYGNISERVELRKKLDCKSFKWYLDNIYPEMQ 456

Query: 212 ----------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQG 243
                                       + +  +  C+ +  +P+     V L  C    
Sbjct: 457 VSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLAAQGRPSQKGGLVVLKACDYGD 516

Query: 244 GNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
            +Q W+ + +H  I  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 PDQIWIYNEEHELILNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 564


>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1,
           partial [Felis catus]
          Length = 553

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 96/314 (30%), Positives = 141/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 210 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 262

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 263 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 322

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 323 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFPE--GNALTYIRNTKRTAEVWMDEYK 375

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+     
Sbjct: 376 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIKQGV 435

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + +  +  +G+  C     N    Q W+ S H  I++   CL         G 
Sbjct: 436 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTATSTSITPGS 494

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 495 LVVLQVCNPREGRQ 508


>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
 gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
          Length = 673

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 106/334 (31%), Positives = 147/334 (44%), Gaps = 71/334 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A+N    V P I  I    F  R       +  +   G FDW+  
Sbjct: 302 VEANYNWLPPLLEPIAQNKRTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDWDFF 354

Query: 64  FN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           +     +PE      K+ AEP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELS
Sbjct: 355 YKRLPLLPED----LKHPAEPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELS 410

Query: 123 FKF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           FK        + A   R    ++     V  P     L             Y    ++W 
Sbjct: 411 FKIWMCGGQMYDAPCSRIGHIYRGPRNHVSNPRGGDYLHK----------NYKRVAEVWM 460

Query: 177 GENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE------VSN-------D 215
            E  +  + G        D GD+T++K +R  L CKSFKW++E      + N       D
Sbjct: 461 DEYKQYLYNGADGVYERIDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLIKNYPPIDPPD 520

Query: 216 WSG----------MCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMSKHGEIR--RD 259
           ++           +C+D+  KP   H  VG+Y C     K    Q+W +S   ++R  R 
Sbjct: 521 YASGAIQNVGDPTLCVDTLSKPR--HNRVGIYSCARNLVKPQRTQYWSLSWKRDLRLHRK 578

Query: 260 EACLDY----AGGDVILYPCHGSKGNQYFEYDYK 289
           + CLD     A   V L+ CHG +GNQY+ YDY+
Sbjct: 579 KDCLDVQIWDANAPVWLWDCHGQQGNQYWYYDYR 612


>gi|312370888|gb|EFR19193.1| hypothetical protein AND_22920 [Anopheles darlingi]
          Length = 812

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 96/309 (31%), Positives = 147/309 (47%), Gaps = 41/309 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL+ LLDV+A N + +  P I  +  D + +++   +       FIG +DW+L 
Sbjct: 459 VEVTIGWLEALLDVVAHNWTTIAIPTIDWV--DEYNMKYKDDKA----PIFIGAYDWDLN 512

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W       +K++ N   P  TP MAGGLF+I++ FFE+LG YD GFDI+G EN+ELS 
Sbjct: 513 FGWWG-RWSMKKKYDNKMVPFDTPAMAGGLFTINRTFFERLGWYDEGFDIYGIENIELSM 571

Query: 124 KFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG--FDIW 175
           K +W        +P       + A  P  T      + +      E       G  +DI+
Sbjct: 572 K-SWMCGGKMVTVPCSRVGHIQKAGHPYLTRETKDVVRANSIRLAEVWMDEYKGIIYDIY 630

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWS 217
           G  +     + +FG V  RK +R+  GC+ F++YL                  EV N   
Sbjct: 631 GIPHYS---EEEFGSVEHRKAIRQKAGCQPFRYYLENAFPEMHNPMVPGAFRGEVHNGAL 687

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHG 277
           G       + TD    +G+ PC     +QFW  + + E+   + C+D  G ++ +Y CH 
Sbjct: 688 GNGTCLTYRGTD--NFLGMAPCDHLEKSQFWTHNYYQELNSYQNCID--GPNLAVYRCHK 743

Query: 278 SKGNQYFEY 286
           S+GNQ ++Y
Sbjct: 744 SRGNQAWKY 752



 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 75/255 (29%), Positives = 124/255 (48%), Gaps = 33/255 (12%)

Query: 54  FIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 113
           +IG +DW+L F W       +K++ N   P  TP MAGGLF+I++ FFE+LG YD GFDI
Sbjct: 11  YIGAYDWDLNFGWWG-RWSMKKKYDNKMVPFDTPAMAGGLFTINRTFFERLGWYDEGFDI 69

Query: 114 WGGENLELSFKFNWH------AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT 167
           +G EN+ELS K +W        +P       +    P +       +   +     ++  
Sbjct: 70  YGIENIELSMK-SWMCGGKMVTVPCSRVAHIQKVGHP-YLRNEKKDVVRANSIRLAEVWM 127

Query: 168 YDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV------SNDWSG--- 218
            +    I+    +    + +FG V +RK +R   GC+ F++Y+E       S D +G   
Sbjct: 128 DEYKHVIFDIHGIPHYLEEEFGSVENRKAIRERAGCRDFRYYIENAFPEMHSPDVAGAFR 187

Query: 219 -----------MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
                      MC++   + TD    +G+ PC  +  +QFW  + + E+     C+D+ G
Sbjct: 188 GEVHSVVLGVTMCLEY--RHTDSF--LGMGPCDGKQRSQFWTHNYYEELNSYRYCIDFTG 243

Query: 268 GDVILYPCHGSKGNQ 282
            ++ ++ CH S+GNQ
Sbjct: 244 SNLGVFGCHRSRGNQ 258


>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Equus caballus]
          Length = 619

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 139/315 (44%), Gaps = 52/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F          ++     GGFDW+L 
Sbjct: 276 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNFAY-------LAASAILRGGFDWSLH 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 329 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 388

Query: 124 KF-----NWHAIPERE-----RKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +      +   +P        RKRH  N  E        G   +  +        +   +
Sbjct: 389 RVWMCGGSLEIVPCSRVGHVFRKRHPYNFPE--------GNALTYIRNTKRTAEVWMDEY 440

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM---- 219
             +  E    +    FG V +R E R+ + CKSF+WYL+         V     G+    
Sbjct: 441 KQYYYEARPSAIGKAFGSVATRIEQRKKMSCKSFRWYLDNVYPELTVPVKEVLPGIIKQG 500

Query: 220 --CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------G 267
             C++S  + T  +  +G+  C     N    Q W+ S H  I++   CL         G
Sbjct: 501 VNCLESQGQDTAGNFLLGMGICRGSVKNPPAPQAWLFSDH-LIQQQGKCLTATSTSVSPG 559

Query: 268 GDVILYPCHGSKGNQ 282
             V L  C+  +G Q
Sbjct: 560 SLVTLQVCNPREGRQ 574


>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Gorilla gorilla gorilla]
          Length = 598

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL P+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 254 CEVNTEWLPPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 306

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 307 FKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 366

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 367 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 419

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 420 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIKQGV 479

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     N    Q W+ S H  I++   CL          G
Sbjct: 480 NCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 538

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 539 SPVILQMCNPREGKQ 553


>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
          Length = 617

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 81/218 (37%), Positives = 114/218 (52%), Gaps = 21/218 (9%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K+H  +   P+ +PTMAGGLFSID+ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPSSMAKQHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P  + G   +  A   ++   +   D W  
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSG--KVLNANLLRVA--EVWMDEWKY 422

Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
              +++ +        DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAYRMRPSIDVSERVELRKTLNCKSFKWYLQ 460


>gi|195028169|ref|XP_001986949.1| GH20244 [Drosophila grimshawi]
 gi|193902949|gb|EDW01816.1| GH20244 [Drosophila grimshawi]
          Length = 599

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 139/308 (45%), Gaps = 44/308 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE  + W +PLL  +  + + V+ P+I  I  D+ + ++     T+ YK F +GGF WN 
Sbjct: 244 CEANEGWCEPLLQRIKDSRTSVLVPIIDVI--DSVDFQYS----TNGYKSFQVGGFQWNG 297

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE+ R            P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 298 HFDWVNLPEREKLRQSRECNQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 357

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 358 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 415

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
            +++     +L F  D GDVT R  LR+ L CKSF WYL+                  V 
Sbjct: 416 INVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKSFDWYLQNVYPEKFVPNKNVKAWGRVR 475

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
           +    +CID      +    +GLYPC K    +Q +  +    +R + +C          
Sbjct: 476 SVHDNLCIDDLLNNNEKPYNLGLYPCGKTLQHSQLFSFTNSQVLRNELSCATVQHSSSPP 535

Query: 270 --VILYPC 275
             +++ PC
Sbjct: 536 YRIVMVPC 543


>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
           gigas]
          Length = 613

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 83/266 (31%), Positives = 130/266 (48%), Gaps = 34/266 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL  ++ + + VV P+I  I  DT E +  P           GGF+W L 
Sbjct: 242 CEVNTDWLEPLLLRISHDPTTVVVPVIDIINHDTMEYQQSP--------LVRGGFNWGLH 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +P+ E+      ++P+ +PTMAGGLF++ + +F  LG YD G DIWGGENLE+SF
Sbjct: 294 FSWDRLPDNEKNDPDLGSKPILSPTMAGGLFAMKRDYFHHLGEYDLGMDIWGGENLEISF 353

Query: 124 KF-----NWHAIPE-------RERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           +          IP        R+R+ + N   P    T       +   + +K   Y   
Sbjct: 354 RIWMCGGKLEIIPCSRVGHIFRKRRPYGN---PKGRDTFLKNSLRVANVWMDKYKEY--- 407

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMH 231
                 +    +   D+GD++ R  LR++L CKSFKWYL+  + +  + +    KP++  
Sbjct: 408 ----FLKQRPQAQVVDYGDISDRISLRKHLSCKSFKWYLD--HVYPELSLPGDVKPSN-- 459

Query: 232 KPVGLYPCHKQGGNQFWMMSKHGEIR 257
           K     P       +  ++ +HG I+
Sbjct: 460 KSSHHQPMKSNDKKKKPVIVRHGRIK 485


>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
          Length = 637

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 107/322 (33%), Positives = 149/322 (46%), Gaps = 55/322 (17%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PLL+ +A +    V P I  I  DTF+ R       +  +   G FDW  +F
Sbjct: 252 EVNNNWLPPLLEPIAEDYRTCVCPFIDVIAHDTFQYR-------AQDEGKRGAFDW--KF 302

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P        +  +P  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 303 YYKRLPLLPGDL-DDPTKPFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 361

Query: 125 FNWHA------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
             W         P            P   P    G+  + + F      +   +  +  E
Sbjct: 362 I-WQCGGRLVDAPCSRVGHVYRGYAPFGNPR---GVNFVVRNFKRVAEVWMDEYAKFLYE 417

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW-------------SG------ 218
              L  K D GD+T+++ELR  L C+ FKW+L E++ D              SG      
Sbjct: 418 RNPLFEKTDPGDLTAQRELRERLQCRPFKWFLEEIAPDLLIRYPVREPQPFASGRVQSVA 477

Query: 219 ---MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEI--RRDEACLDYA-- 266
              +C+DS        +P+GLY C     H Q  NQF+ +S H +I  R ++ CLD +  
Sbjct: 478 DRRLCLDSLNH--QAKQPIGLYTCASNQTHPQ-NNQFFTLSFHRDIRVRSNDKCLDASRL 534

Query: 267 GGDVILYPCHGSKGNQYFEYDY 288
             +VIL+ CH S+GNQ + YDY
Sbjct: 535 NDEVILFSCHESQGNQMWRYDY 556


>gi|194882801|ref|XP_001975498.1| GG20529 [Drosophila erecta]
 gi|190658685|gb|EDV55898.1| GG20529 [Drosophila erecta]
          Length = 601

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 97/324 (29%), Positives = 150/324 (46%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +          P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQQREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHVFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              S +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 ALNSNICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+++ Y++++
Sbjct: 539 YRVVMVPCMENDEFNEHWRYEHQH 562


>gi|383848548|ref|XP_003699911.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Megachile rotundata]
          Length = 604

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 147/323 (45%), Gaps = 52/323 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A+N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 251 CECTVGWLEPLLEAVAKNKTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R +N  EP  TP MAGGLFS+++ +F +LG+YD    IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDDQMKIWGGENLELS 363

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           F+      +    P          + P    T  GG+  I      ++     D   + +
Sbjct: 364 FRVWQCGGSVEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
              N E S       V +R  LR+ L CKSF+WYL+  N W                   
Sbjct: 421 FKFNAEASRLRHKQPVRARLALRKRLQCKSFEWYLD--NVWPEHFFPKNDRFFGRIVHVS 478

Query: 217 SGMCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
           +  CI          +P G   L  C  +   NQ ++M+K G +  DE+ CLD    D  
Sbjct: 479 TKKCIMRPTAKGTYSQPSGYALLESCIPRPVLNQMFVMTKSGIVMTDESICLDAPDRDTQ 538

Query: 270 -----VILYPCHGSKGNQYFEYD 287
                V +  C  S+  Q ++YD
Sbjct: 539 HKTPRVKIMAC-SSQSRQNWQYD 560


>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Amphimedon queenslandica]
          Length = 512

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 134/297 (45%), Gaps = 39/297 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  ++++ + VVSP+I  I  DTF+       L        GGFDW+L 
Sbjct: 178 CECNIGWLEPLLHRVSQDRTIVVSPIIDVISMDTFDYIGASSELR-------GGFDWSLH 230

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W      +R + K+  EP+ TP +AGGLFSI++  F + G YD   DIWGGEN E+SF
Sbjct: 231 FKWDGFTPAQRAKRKSPIEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGENFEISF 290

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RKRH     P   P   G   +  K        +   + 
Sbjct: 291 RTWMCGGSLEIIPCSRVGHVFRKRH-----PYVFP--GGNAMTYMKNTKRAAEVWMDNYK 343

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------VSNDWSGMCID 222
            +       +   D G + SR  LR+ L C +F WY++            +N    +  +
Sbjct: 344 DYYYSARPSAKGRDMGSIKSRVALRKRLNCTTFDWYMKNVYPELSVPSSTNNKHGKLKQN 403

Query: 223 SACKPTDMHK---PVGLYPCHK-QGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
           + C  T  H+   PVGL  C + + G Q W ++  G IR    CL+  G  V L  C
Sbjct: 404 NLCLDTLGHQAGEPVGLQDCQQSRQGYQDWSIAMKGLIRHLNLCLEARGQIVHLQYC 460


>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Otolemur garnettii]
          Length = 678

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 110/222 (49%), Gaps = 32/222 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 314

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-------DIWG 176
           +  W              +  +   +  G +F     +    G  ++         ++W 
Sbjct: 315 RV-WMC----------GGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWM 363

Query: 177 GENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            E  +        + +  FG++ SR +LR+NL C+SFKWYLE
Sbjct: 364 DEYKQYYYAARPFALERPFGNIESRLDLRKNLRCQSFKWYLE 405


>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Cricetulus griseus]
          Length = 797

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 139/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 198 CEVNIEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 250

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 251 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 310

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 311 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFPE--GNALTYIRNTKRTAEVWMDEYK 363

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE               G+     
Sbjct: 364 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIKQGV 423

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 424 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 482

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 483 SPVILQVCNPKEGKQ 497


>gi|321469963|gb|EFX80941.1| hypothetical protein DAPPUDRAFT_224457 [Daphnia pulex]
          Length = 498

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 112/222 (50%), Gaps = 34/222 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + W+QPL+  +  N + VV+P+I  I  DTF+    P           GGF+W L 
Sbjct: 122 CEVNREWVQPLIARIQENRTFVVTPIIDIINSDTFQYTSSP--------LVRGGFNWGLH 173

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+   K +++  +P+ +PTMAGGLF+I++ +F  +G YD+G ++WGGENLE+SF
Sbjct: 174 FKWDSLPDDTLKTNEDFVKPILSPTMAGGLFAIEREYFFDIGEYDAGMNVWGGENLEISF 233

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
           +          IP            P  +P              E   TY+S     +W 
Sbjct: 234 RIWMCGGRLEIIPCSRVGHVFRRRRPYGSPN------------GEDTMTYNSLRAAHVWL 281

Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
            + +E  F          +GDV  R+ LRR + C+SF WYL+
Sbjct: 282 DDYIEHFFHVRPDARHVSYGDVGPRQRLRRLMKCQSFDWYLK 323


>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Cricetulus griseus]
          Length = 513

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 139/315 (44%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 169 CEVNIEWLQPMLQRVMEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 221

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 222 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 281

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 334

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE               G+     
Sbjct: 335 QYYYEARPSAIGKAFGSVATRIEQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIKQGV 394

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ S H  I++   CL          G
Sbjct: 395 NCLESQGQNTAGDLLLGMGICRGSAKSPPPAQAWLFSDH-LIQQQGKCLAATSTLMSSPG 453

Query: 268 GDVILYPCHGSKGNQ 282
             VIL  C+  +G Q
Sbjct: 454 SPVILQVCNPKEGKQ 468


>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Metaseiulus occidentalis]
          Length = 622

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 146/304 (48%), Gaps = 35/304 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + W++PLL  +    S VV P+I  +  DTF   FP    +S  +   GGFDWNL 
Sbjct: 259 CECNEGWIEPLLARIRDEPSKVVCPVIDVLSMDTFGY-FPA---SSDLR---GGFDWNLV 311

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I  +     + A +P+ TP MAGGLF+I K  FE+LG+YD+  DIWG ENLE+SF
Sbjct: 312 FKWEFITSKP----ELATDPIKTPAMAGGLFAITKKEFERLGSYDTQMDIWGAENLEMSF 367

Query: 124 KFNWHA-----IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +  W       I    R  H    +  +T    G      +        +   +  +  E
Sbjct: 368 RV-WQCGSGIEILPCSRVGHVFRKQHPYTFPGGGSGKVFARNSRRAAEVWMDDYKKYYYE 426

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWY-------LEVSNDWSG------MCIDSAC 225
            +  +    +GD++ R +LR  L CKSF+WY       L++ ++  G       C+D+  
Sbjct: 427 QVPAAKSVAYGDISERLKLREKLRCKSFEWYMKNVYPELKLPSNVHGYVRQNNRCLDTLG 486

Query: 226 KPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACL---DYAGGDVI-LYPCHGSKGN 281
             +D    V +YPCH  GGNQ + ++K+  +   + C+     AG  ++ L  C+G    
Sbjct: 487 AISD-GSTVHVYPCHYLGGNQDFRLAKNHLLMVHDMCVSLGSLAGQQLVKLRTCNGENSQ 545

Query: 282 QYFE 285
           ++  
Sbjct: 546 KWVR 549


>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Ailuropoda melanoleuca]
          Length = 555

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 142/314 (45%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 212 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 264

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 265 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 324

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 325 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 377

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + C+SF+WYL+         V     G+     
Sbjct: 378 QYYYEARPSAIGKAFGSVATRIEQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIKQGV 437

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + +  +  +G+  C     N    Q W+ S H  I++   CL  +      G 
Sbjct: 438 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTVSSTSVTPGS 496

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 497 LVLLQGCNPREGRQ 510


>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
 gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
 gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
          Length = 562

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K+H  +   P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P  + G           L   +   D W  
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSG----KVLNTNLLRVAEVWMDDWKH 422

Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
              +++ +        DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460


>gi|350416150|ref|XP_003490858.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Bombus impatiens]
          Length = 604

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 106/323 (32%), Positives = 147/323 (45%), Gaps = 52/323 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A+N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 251 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R +N  EP  TP MAGGLFS+++ +F +LG+YD    IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELS 363

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           F+      +    P          + P    T  GG+  I      ++     D   + +
Sbjct: 364 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
              N E +   D   V  R ELR+ L CK+F+WYL  +N W                   
Sbjct: 421 FKFNTEAARLRDKQPVRGRLELRKRLQCKNFEWYL--NNIWPEHFFPKDDRFFGRILHIS 478

Query: 217 SGMCIDSACKPTDMHKPVG---LYPCHKQGG-NQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
           S  CI          +P G   L  C  +   +Q ++M+K G I  DE+ CLD    D  
Sbjct: 479 SNKCIMRPTAKGTYSQPSGYAVLETCLPRPILSQMFVMTKDGIIMTDESVCLDAPDHDTQ 538

Query: 270 -----VILYPCHGSKGNQYFEYD 287
                V +  C G+   Q + YD
Sbjct: 539 HKTPKVKIMACSGN-DRQKWRYD 560


>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
          Length = 553

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 142/314 (45%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 210 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 262

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 263 FKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 322

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 323 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 375

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + C+SF+WYL+         V     G+     
Sbjct: 376 QYYYEARPSAIGKAFGSVATRIEQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIKQGV 435

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C++S  + +  +  +G+  C     N    Q W+ S H  I++   CL  +      G 
Sbjct: 436 NCLESQGQDSAGNFLLGMGICRGSAKNPPAPQAWLFSDH-LIQQQGKCLTVSSTSVTPGS 494

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 495 LVLLQGCNPREGRQ 508


>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 136/314 (43%), Gaps = 62/314 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +  N   +  P+I  I  DTFE R     +    + F G F+W + 
Sbjct: 290 CEVGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYR----PVYHGRQHFRGIFEWGML 345

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP+ E KR K  +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 346 YKEIEIPDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSF 405

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
           K  W                  W P    G        +S  K   ++ G      Y   
Sbjct: 406 KI-WQC-----------GGMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRV 453

Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
            ++W  E  E       L+   D GD+  +  LR  L CKSF+W++              
Sbjct: 454 VEVWMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLL 513

Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
                  E+ +D +  C+D+       H P    L  CH  GGNQ + ++  G++   E 
Sbjct: 514 PRNLYWGEIRHDATDQCLDA----MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGER 569

Query: 262 CLDYAGGDVILYPC 275
           C+D +   + +  C
Sbjct: 570 CMDASSHSMDVVYC 583


>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
           partial [Oryctolagus cuniculus]
          Length = 499

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/315 (30%), Positives = 138/315 (43%), Gaps = 51/315 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 155 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 207

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F IDKA+F  LG YD+  DIWGGEN ELSF
Sbjct: 208 FKWEQIPLEQKITRTDPTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSF 267

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 268 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 320

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CKSF+WYLE         V     G+     
Sbjct: 321 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIKQGV 380

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL-------DYAG 267
            C++S  + T     +G+  C     +    Q W+ + H  I++   CL          G
Sbjct: 381 NCLESQGQNTAGDFLLGMGICRGSAKSPPPAQAWLFTDH-LIQQQGKCLAATSTLMSSPG 439

Query: 268 GDVILYPCHGSKGNQ 282
             V L  C+  +G Q
Sbjct: 440 SPVTLQVCNPREGKQ 454


>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/314 (29%), Positives = 136/314 (43%), Gaps = 62/314 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +  N   +  P+I  I  DTFE R     +    + F G F+W + 
Sbjct: 290 CEVGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYR----PVYHGRQHFRGIFEWGML 345

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP+ E KR K  +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 346 YKEIEIPDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSF 405

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
           K  W                  W P    G        +S  K   ++ G      Y   
Sbjct: 406 KI-WQC-----------GGMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRV 453

Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
            ++W  E  E       L+   D GD+  +  LR  L CKSF+W++              
Sbjct: 454 VEVWMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLL 513

Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
                  E+ +D +  C+D+       H P    L  CH  GGNQ + ++  G++   E 
Sbjct: 514 PRNLYWGEIRHDATDQCLDA----MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGER 569

Query: 262 CLDYAGGDVILYPC 275
           C+D +   + +  C
Sbjct: 570 CMDASSHSMDVVYC 583


>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Nasonia vitripennis]
          Length = 610

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 145/326 (44%), Gaps = 54/326 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +++N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 254 CECTAGWLEPLLEAISKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 306

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +     R+R +N  +P  TP MAGGLFS+D+ +F +LG+YD    IWGGENLELS
Sbjct: 307 FRWLMLNGALLRERRENIVDPFKTPAMAGGLFSMDREYFFELGSYDEHMRIWGGENLELS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTP-----TMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +    P          + P   P      + G L  +   + ++ G +   F
Sbjct: 367 FRVWQCGGSVEIAPCSHVGHIFRKSSPYTFPGGVDEILYGNLARVALVWMDEWGKFYFNF 426

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
                 N +     D   + SR ELR  L CKSF+WYL+                 + + 
Sbjct: 427 ------NPQAQRVRDKQQIRSRLELRERLKCKSFEWYLDNVWPDHFFPKDDRFFGYILHP 480

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIRRDEA-CLDYAGGD- 269
            +  C+          +P G             +Q ++M K G I  DE+ CLD    D 
Sbjct: 481 SNKKCLMRPMSKGAYSQPSGFVAYQDCIVPPNLSQMFVMRKDGVIMTDESVCLDAPEKDN 540

Query: 270 ------VILYPCHGSKGNQYFEYDYK 289
                 V L  C G   +Q +EYD K
Sbjct: 541 RHEKPKVKLMACSGF-ASQKWEYDEK 565


>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
          Length = 613

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 107/329 (32%), Positives = 146/329 (44%), Gaps = 70/329 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A++    V P I  I  +TFE R       +  +   G FDW  +F
Sbjct: 251 EANVNWLPPLLEPIAQDYKTCVCPFIDVIQYETFEYR-------AQDEGARGAFDW--EF 301

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P       ++  EP  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSFK
Sbjct: 302 FYKRLPLLPEDL-EHPTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFK 360

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
             W                 V  P    G      A F   G        Y    ++W  
Sbjct: 361 I-WQC-----------GGLMVDAPCSRVGHIYRKYAPFPNPGKGDFVGRNYRRVAEVWMD 408

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------- 216
           E  E  +K        D GD+T +K LR  L CK FKW++E V+ D              
Sbjct: 409 EYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYPPIEPGDFG 468

Query: 217 ---------SGMCIDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRR--DEAC 262
                      +C+DS  K  D  + +GL  C K   + G Q + ++ H ++R      C
Sbjct: 469 VGEIRNLAAPELCVDSGHK--DRDQVIGLAECVKGTNKNGEQNFALTWHKDLRVKGKTLC 526

Query: 263 LDYAG----GDVILYPCHGSKGNQYFEYD 287
           LD +      D++LYPCHGS+GNQY+ YD
Sbjct: 527 LDVSDPNDKADIVLYPCHGSQGNQYWRYD 555


>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
          Length = 591

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 107/331 (32%), Positives = 147/331 (44%), Gaps = 70/331 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A++    V P I  I  +TFE R       +  +   G FDW  +F
Sbjct: 229 EANVNWLPPLLEPIAQDYKTCVCPFIDVIQYETFEYR-------AQDEGARGAFDW--EF 279

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P       ++  EP  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSFK
Sbjct: 280 FYKRLPLLPEDL-EHPTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFK 338

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
             W                 V  P    G      A F   G        Y    ++W  
Sbjct: 339 I-WQC-----------GGLMVDAPCSRVGHIYRKYAPFPNPGKGDFVGRNYRRVAEVWMD 386

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------- 216
           E  E  +K        D GD+T +K LR  L CK FKW++E V+ D              
Sbjct: 387 EYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYPPIEPGDFG 446

Query: 217 ---------SGMCIDSACKPTDMHKPVGLYPCHK---QGGNQFWMMSKHGEIRR--DEAC 262
                      +C+DS  K  D  + +GL  C K   + G Q + ++ H ++R      C
Sbjct: 447 VGEIRNLAAPELCVDSGHK--DRDQVIGLAECVKGTNKNGEQNFALTWHKDLRVKGKTLC 504

Query: 263 LDYAG----GDVILYPCHGSKGNQYFEYDYK 289
           LD +      D++LYPCHGS+GNQY+ YD +
Sbjct: 505 LDVSDPNDKADIVLYPCHGSQGNQYWRYDVE 535


>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
           AltName: Full=Polypeptide GalNAc transferase 14;
           Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 14;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 14
          Length = 550

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 64/320 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKW LE       V  D S  
Sbjct: 361 VWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +     + + L PC K  G+    Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498


>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Oreochromis niloticus]
          Length = 619

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 141/332 (42%), Gaps = 73/332 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  + ++   VV P+I  I  DT  L + P  +        GGF+W L 
Sbjct: 224 CEVNQAWLQPLLAPIQKDHRTVVCPVIDIISADT--LAYSPSPIVR------GGFNWGLH 275

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    + A+ P+ +PTMAGGLF++++ +F +LG YD+G DIWGGENLE+SF
Sbjct: 276 FKWDPVPPSELSGPEGASGPIRSPTMAGGLFAMNRKYFNELGQYDAGMDIWGGENLEISF 335

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +          IP            P  +P    TMA     +   + +           
Sbjct: 336 RIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWMD----------- 384

Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYLE------------------ 211
            G +   LS + +     +GD+  R  LR+ L C SF+WYL+                  
Sbjct: 385 -GYKEQYLSLRPELRNRSYGDIGERVALRKRLQCHSFRWYLDTVYPEMQTAANGNKQQPL 443

Query: 212 ----------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
                           + N     C+ +  + +     V L PC  +  +Q W   + G+
Sbjct: 444 FINKGLKRPKVLQRGRLRNLAIRRCLVAQGRASQKGGAVVLRPCDPRDPDQDWAYDEEGQ 503

Query: 256 -IRRDEACLDYAGGDVI----LYPCHGSKGNQ 282
            I     CLD +         L  CHGS G+Q
Sbjct: 504 LILAGLLCLDVSEVRTFDPPRLMKCHGSGGSQ 535


>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
          Length = 550

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/320 (30%), Positives = 139/320 (43%), Gaps = 64/320 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 202 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDWSLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 255 FQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          IP        RK+H     P   P      +         +       +
Sbjct: 315 RVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKRTAE 360

Query: 174 IWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDWS-- 217
           +W  E  +        + +  FG++ +R  LR+NL C++FKW LE       V  D S  
Sbjct: 361 VWMDEYKQYYYAARPFALERHFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSIQ 420

Query: 218 -------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLD-- 264
                    C++S  +     + + L PC K  G+    Q W  +   +I ++E CL   
Sbjct: 421 KGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCLSVV 478

Query: 265 --YAGGDVILYPCHGSKGNQ 282
             + G  V+L  C      Q
Sbjct: 479 TLFPGAPVVLALCKNGDERQ 498


>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
           floridanus]
          Length = 667

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 91/304 (29%), Positives = 143/304 (47%), Gaps = 42/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + W++PLL  +A + + V  P+I  I  DTF+    P           GGF+W L 
Sbjct: 295 IEVNEIWIEPLLSRIAYSKTIVPMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 346

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P    K   +  +P+ +PTMAGGLF+ID+ +F K+G YD+G D+WGGENLE+SF
Sbjct: 347 FKWDNLPIGTLKHENDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISF 406

Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +      +   IP         R+R   + +P    TM      +   + ++   Y    
Sbjct: 407 RIWMCGGSIELIPCSRVGHVFRRRRPYGSDDP--HDTMLKNSLRVAHVWMDEYKDY---- 460

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
                  L+ +   D+GD++ R  LR+ L CK+F WYL+V      +  D+  +  D   
Sbjct: 461 ------FLKNAKAIDYGDISERLALRQKLECKTFDWYLKVVYPELTLPDDTEKRLKDKWS 514

Query: 231 ---HKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
               +P  + P H +  N   Q+ + +S      + E  +   G  +IL PC   K   +
Sbjct: 515 KLEQRP--MQPWHSRKRNYTDQYQIRLSNSVLCIQSEKDIKTKGSKLILMPCLRIKSQMW 572

Query: 284 FEYD 287
           +E D
Sbjct: 573 YETD 576


>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Sarcophilus harrisii]
          Length = 563

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 135/314 (42%), Gaps = 52/314 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGM-------------- 219
            +  E    +    FG +  R+E R+ + CKSF+WYLE  N +  +              
Sbjct: 388 QYYYEARPSAIGKSFGSIADREEQRKKMNCKSFQWYLE--NVYPELKIPEKEVIPGIIKQ 445

Query: 220 ---CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGG 268
              C++S  + T  +  V +  C     N    Q W+ S    IR+ + CL       G 
Sbjct: 446 GTNCLESQGQDTAGNNLVVMGGCKGTSNNPLMTQEWVFS-DPVIRQQDKCLSITSFSTGS 504

Query: 269 DVILYPCHGSKGNQ 282
            V L  C+     Q
Sbjct: 505 QVTLEVCNQKDDRQ 518


>gi|195334637|ref|XP_002033984.1| GM21620 [Drosophila sechellia]
 gi|194125954|gb|EDW47997.1| GM21620 [Drosophila sechellia]
          Length = 601

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 149/324 (45%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +          P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              + +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 AVNANLCLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNALRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562


>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 478

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 98/314 (31%), Positives = 140/314 (44%), Gaps = 62/314 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +  N   +  P+I  I  DTFE R     +    + F G F+W + 
Sbjct: 173 CEVGINWLPPLLAPIRANRYTMTVPVIDGIDKDTFEYR----PVYHGGQHFRGIFEWGML 228

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IPE E KR K  +EP  +PT AGGLF+ID+ +F KLG YD G  +WGGEN ELSF
Sbjct: 229 YKEIEIPEEEIKRRKYHSEPYKSPTHAGGLFAIDRKYFLKLGGYDPGLLVWGGENFELSF 288

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLG-----TYDSG 171
           K  W                  W P    G        +S  K   ++ G      Y   
Sbjct: 289 KI-WQC-----------GGSIYWVPCSRVGHVYRGFMPYSFGKLAHKRKGPIVTVNYKRV 336

Query: 172 FDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------- 210
            ++W  E  E       ++   D GD++ + ELR++LGCK F W++              
Sbjct: 337 VEVWMDEYKEYFYTREPMARHYDPGDLSGQLELRQSLGCKGFDWFMKNVSYDVLKNFPLL 396

Query: 211 -------EVSNDWSGMCIDSACKPTDMHKP--VGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
                  E+    +G C+D+     + H P  V +  CH  GGNQ + ++  G++   E 
Sbjct: 397 PRNIHWGEIRTMVTGQCLDT----MNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGER 452

Query: 262 CLDYAGGDVILYPC 275
           C+D +   + L  C
Sbjct: 453 CVDASSHSMQLVFC 466


>gi|158286701|ref|XP_565317.3| AGAP006881-PA [Anopheles gambiae str. PEST]
 gi|157020594|gb|EAL41927.3| AGAP006881-PA [Anopheles gambiae str. PEST]
          Length = 587

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 96/334 (28%), Positives = 139/334 (41%), Gaps = 71/334 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSH---VVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
           CE    WL+PLL+++A N  +   V  P I  + + T  L+        +     G FDW
Sbjct: 225 CECLAGWLEPLLELVASNQENRKVVAVPTIDWLNETTLALQ------VGASSGLYGAFDW 278

Query: 61  NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
           NL F W    +R +   +N  EP  TP MAGGLF I+KAFF +LG YD G  ++GGEN+E
Sbjct: 279 NLSFQWRPRYDRLQAPQENLLEPFDTPVMAGGLFCIEKAFFAQLGWYDPGLQVYGGENME 338

Query: 121 LSFKF-----NWHAIP------------------ERERK---RHKNAAEPVWTPTMAGGL 154
           LSFK          +P                   +ER    R+      VW    A  L
Sbjct: 339 LSFKVWMCGGAIRTVPCSHVAHIQKRNNPYIGSYTKERDLTMRNSLRVAEVWMDEYAEFL 398

Query: 155 FSIDKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--- 211
           + +   +   L +  S        N+ L          +R++LR  LGCKSF+WYL+   
Sbjct: 399 YRLHPDYRALLASRTSH----SLSNVNLD---------ARRQLRSELGCKSFRWYLQHVF 445

Query: 212 ----------------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGE 255
                             N+   +C+    +     + + L  CH  GG Q W   K GE
Sbjct: 446 PEQDDPSEAQAAGWIRHENEAGQLCLTWPMR----DRSLALLHCHGLGGQQIWFHRKTGE 501

Query: 256 IRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           I R+  CL     +V +  C     +  + + Y+
Sbjct: 502 IAREGHCLGVDSAEVTIALCSSEGSSGAYRWLYR 535


>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mustela putorius
           furo]
          Length = 609

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/351 (29%), Positives = 149/351 (42%), Gaps = 99/351 (28%)

Query: 4   CEVQKRWL---QPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
           CEV   WL   QPLL  + ++   VV P+I  I  DT           SS     GGF+W
Sbjct: 248 CEVNVMWLMWLQPLLAAIQQDRRTVVCPVIDIISADTLAY--------SSSPVVRGGFNW 299

Query: 61  NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
            L F W  +P  E    + A  P+ +PTMAGGLF++++ +F +LG YDSG DIWGGENLE
Sbjct: 300 GLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLE 359

Query: 121 LSFKFNWHAIPERERKRHKNAAEPVWTPTMAGG-LFSIDKA----FFEKLGTYDS--GFD 173
           +SF+                    +W   M GG LF I  +     F K   Y S  G D
Sbjct: 360 ISFR--------------------IW---MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD 396

Query: 174 IWGGENLEL-------------SFKGD-----FGDVTSRKELRRNLGCKSFKWYLE---- 211
                +L L             S + D     +G+++ R ELR+ LGCKSFKWYL+    
Sbjct: 397 TMTHNSLRLAHVWLDDYKEQYFSLRPDLRTKSYGNISERVELRKKLGCKSFKWYLDNIYP 456

Query: 212 -------------------------------VSNDWSGMCIDSACKPTDMHKPVGLYPCH 240
                                          + +  +  C+ +  +P+     V L  C 
Sbjct: 457 EMQISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACD 516

Query: 241 KQGGNQFWMMS-KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEY 286
               +Q W+ + +H  +  +  CLD     +     L  CHGS G+Q + +
Sbjct: 517 YSDPSQIWIYNEEHELVLNNLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF 567


>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
           vitripennis]
          Length = 583

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 143/330 (43%), Gaps = 65/330 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +A ++  +  P+I  I   TFE R     +      + G F+W + 
Sbjct: 230 CEVNVNWLPPLLSPIAEDNKVMTVPIIDGIDHKTFEYR----PVYQEGHLYRGIFEWGML 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P+RE K  K+ +EP  +PT AGGLF+I++ +F  LG YD G  +WGGEN ELSF
Sbjct: 286 YKENELPQREAKTRKHNSEPYRSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSF 345

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGL-------FSIDKAFFEKLG-----TYDSG 171
           K  W                 +W P    G        ++  K   +K G      Y   
Sbjct: 346 KI-WQC-----------GGSILWVPCSHVGHVYRGFMPYNFGKLAQKKKGPLITINYKRV 393

Query: 172 FDIWGGENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------- 210
            + W  E  +        L+   D GD+T + E +R  GCKSF+W++             
Sbjct: 394 IETWFDEKHKEFFYTREPLARLLDHGDITEQLEFKRRKGCKSFQWFMDNIAYDVLDKFPE 453

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYP---CHKQGGNQFWMMSKHGEIRRD 259
                   E+ N  + MC+D     T  H P  L     CH  G NQ   ++  G++   
Sbjct: 454 LPPNIHWGEMKNVATQMCLD-----TMGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVG 508

Query: 260 EACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
           E C++  G  V L  C     +  ++YD K
Sbjct: 509 ERCVEADGQGVKLAFCRLGTVDGPWQYDEK 538


>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Anolis carolinensis]
          Length = 612

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/327 (30%), Positives = 144/327 (44%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 246 CEANVNWLPPLLDRIARNHKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 300 YKRIPIPPELQK--PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 357

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   PT   +A  L  + + + ++   Y       
Sbjct: 358 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 414

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++KELR NL CKSF+W++                      E+ 
Sbjct: 415 RPEYRHLS----AGDVATQKELRSNLNCKSFRWFMNEVAWDLRKFYPPVEPPAAAWGEIH 470

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  + +C+D+  K   +  P+ +  C K  G   W        S   +IR  +       
Sbjct: 471 NVGTSLCVDT--KHGALGSPLKIETCVKSRGEAAWNNVQVFTFSWREDIRPGDPQHTKKF 528

Query: 262 CLDYAGGD--VILYPCHGSKGNQYFEY 286
           C D    +  V LY CHG KGNQ ++Y
Sbjct: 529 CFDAVSHNSPVTLYDCHGMKGNQLWKY 555


>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
 gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 6; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
 gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
 gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
          Length = 618

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K+H  +   P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P  + G           L   +   D W  
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKVLNTNL----LRVAEVWMDDWKH 422

Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
              +++ +        DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460


>gi|10436305|dbj|BAB14795.1| unnamed protein product [Homo sapiens]
          Length = 457

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 109/218 (50%), Gaps = 24/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  + + VV P+I  I  DTF           S     GGFDW+L 
Sbjct: 169 CEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY-------IESASELRGGFDWSLH 221

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   ++ R  +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E+SF
Sbjct: 222 FQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISF 281

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G   +  K        +   + 
Sbjct: 282 RVWMCGGSLEIVPCSRVGHVFRKKH-----PYVFPD--GNANTYIKNTKRTAEVWMDEYK 334

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +       + +  FG+V SR +LR+NL C+SFKWYLE
Sbjct: 335 RYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLE 372


>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
          Length = 547

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 29/222 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K W++PLL  +  N   VV P+I  I + TF  +          + F GGF+WNLQ
Sbjct: 230 CECTKGWMEPLLARIKENRKAVVCPVIDVINERTFAYQ-------KGIELFRGGFNWNLQ 282

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P E  + R  +  +P+ +PTMAGGLFSID+ +FE++GTYD   +IWGGEN+E+S
Sbjct: 283 FRWYALPPEMIKSRSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEIS 342

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFE--KLGTYD 169
            +          +P          A P   P+   G      L  + + + +  K   Y 
Sbjct: 343 LRVWQCGGRIEILPCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYR 402

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +   ++           +  DV+ R ELR+ L CKSFKW+L+
Sbjct: 403 TAPQVYKMR--------ETVDVSDRVELRKRLHCKSFKWFLD 436


>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
 gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
 gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
          Length = 617

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 111/218 (50%), Gaps = 21/218 (9%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +  N   V  P+I  I D+TF+ +          + F GGF+WNLQ
Sbjct: 254 CECTKGWLEPLLTRIKLNRKAVPCPVIDIINDNTFQYQ-------KGIEMFRGGFNWNLQ 306

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P    K+H  +   P+ +PTMAGGLFSI++ +FE+LG YD G DIWGGENLE+S
Sbjct: 307 FRWYGMPTAMAKQHLLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMS 366

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+          +P          + P   P  + G           L   +   D W  
Sbjct: 367 FRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKVLNTNL----LRVAEVWMDDWKH 422

Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
              +++ +        DV+ R ELR+ L CKSFKWYL+
Sbjct: 423 YFYKIAPQAHRMRSSIDVSERVELRKKLNCKSFKWYLQ 460


>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
          Length = 597

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 29/222 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K W++PLL  +  N   VV P+I  I + TF  +          + F GGF+WNLQ
Sbjct: 219 CECTKGWMEPLLARIKENRKAVVCPVIDVINERTFAYQ-------KGIELFRGGFNWNLQ 271

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P E  + R  +  +P+ +PTMAGGLFSID+ +FE++GTYD   +IWGGEN+E+S
Sbjct: 272 FRWYALPPEMIKSRSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEIS 331

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGG------LFSIDKAFFE--KLGTYD 169
            +          +P          A P   P+   G      L  + + + +  K   Y 
Sbjct: 332 LRVWQCGGRIEILPCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYR 391

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +   ++           +  DV+ R ELR+ L CKSFKW+L+
Sbjct: 392 TAPQVYKMR--------ETVDVSDRVELRKRLHCKSFKWFLD 425


>gi|392923087|ref|NP_001256888.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
 gi|255068800|emb|CBA11615.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
          Length = 480

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 111/227 (48%), Gaps = 41/227 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL  +A N   VV+P+I  I  D F        L        GGFDW L 
Sbjct: 242 IECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLV 294

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + E+ RK RH +   P+ +PTMAGGLF+I K +F +LGTYD   ++WGGENLE+S
Sbjct: 295 FRWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMS 354

Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           F+  W      E           RK+H     P   P  +G +F  +             
Sbjct: 355 FRV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RA 399

Query: 172 FDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
            ++W  E   +  K        +FGD+T R  +R  L CKSFKWYLE
Sbjct: 400 AEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLE 446


>gi|72000999|ref|NP_507850.2| Protein GLY-4, isoform b [Caenorhabditis elegans]
 gi|27151758|emb|CAB81985.3| Protein GLY-4, isoform b [Caenorhabditis elegans]
          Length = 453

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 111/227 (48%), Gaps = 41/227 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E  ++WL+PLL  +A N   VV+P+I  I  D F        L        GGFDW L 
Sbjct: 242 IECNQKWLEPLLARIAENPKAVVAPIIDVINVDNFNYVGASADLR-------GGFDWTLV 294

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  + E+ RK RH +   P+ +PTMAGGLF+I K +F +LGTYD   ++WGGENLE+S
Sbjct: 295 FRWEFMNEQLRKERHAHPTAPIRSPTMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMS 354

Query: 123 FKFNWHAIPERE-----------RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           F+  W      E           RK+H     P   P  +G +F  +             
Sbjct: 355 FRV-WQCGGSLEIMPCSRVGHVFRKKH-----PYTFPGGSGNVFQKNTR---------RA 399

Query: 172 FDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
            ++W  E   +  K        +FGD+T R  +R  L CKSFKWYLE
Sbjct: 400 AEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLE 446


>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
 gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
          Length = 567

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 141/292 (48%), Gaps = 41/292 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRTHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSID-KAFFEKLGTYDSGFDIWGGENLEL 182
           K  W      ++K+              G L +I+ K   E    +D     +      L
Sbjct: 354 KI-WQCGGSIDKKK--------------GPLITINYKRVIETW--FDDTHKEYFYTREPL 396

Query: 183 SFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM-----------CIDSACK 226
           +   D GD+T +  L++ LGCKSF+W++     +V + + G+                C 
Sbjct: 397 ARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDGCL 456

Query: 227 PTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
            +  H+P   +GL  CH  G NQ   ++  G++   E C++     + L  C
Sbjct: 457 DSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIKLAVC 508


>gi|195583656|ref|XP_002081633.1| GD11122 [Drosophila simulans]
 gi|194193642|gb|EDX07218.1| GD11122 [Drosophila simulans]
          Length = 601

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 149/324 (45%), Gaps = 45/324 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 247 CEGNIGWCEPLLQRIKESRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 300

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R +          P ++PTMAGGLF+ID+ +F ++G+YD   D WGG
Sbjct: 301 HFDWINLPEREKQRQRRECRQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGG 360

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 361 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 418

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +I+     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 419 INIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVH 478

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              + +C+D   +  +     GLYPC K    +Q +  +    +R + +C      +   
Sbjct: 479 AVNANICLDDLLQNNEKPYNAGLYPCGKVLQKSQLFSFTNTNVLRNELSCATVQHSESPP 538

Query: 270 --VILYPC-HGSKGNQYFEYDYKY 290
             V++ PC    + N+ + Y++++
Sbjct: 539 YRVVMVPCMENDEFNEQWRYEHQH 562


>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Nasonia vitripennis]
          Length = 637

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 106/208 (50%), Gaps = 9/208 (4%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV K WL+PLL  ++ + + V  P+I  I  DTF+         SS     GGF+W L 
Sbjct: 269 IEVNKMWLEPLLARISHSRTIVPMPVIDIINADTFQY--------SSSPLVRGGFNWGLH 320

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P       ++  +P+ +PTMAGGLF++D+ +F +LG YD+G D+WGGENLE+SF
Sbjct: 321 FKWDSLPIGTLSLEQDFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISF 380

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E                 GG    D      L       D +    L+  
Sbjct: 381 RI-WMCGGSIELIPCSRVGHVFRRRRPYGGNDQQDTMLKNSLRVAYVWMDQYKKYFLKNV 439

Query: 184 FKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            K D+GD+T R++LR+ L CK F WYLE
Sbjct: 440 KKIDYGDITERQQLRQKLHCKDFAWYLE 467


>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 608

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/327 (33%), Positives = 145/327 (44%), Gaps = 67/327 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PLL+ +A++    V PLI  I  DTFE R       S  +   G FDW  +F
Sbjct: 245 EVNVNWLPPLLEPIAQDYRTCVCPLIDVIVHDTFEYR-------SQDEGKRGAFDW--KF 295

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P R      +  EP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 296 YYKRLPLRPGDL-DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFK 354

Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
             W                 V  P +  G ++     F    G       +    ++W  
Sbjct: 355 I-WQC-----------GGRMVDAPCSRVGHVYRGYSPFPNPRGVNFVTRNFKRVAEVWMD 402

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
           E  +  +       K + GD+T +K LR  L CK FKW+LE                   
Sbjct: 403 EYKQFLYERNPQFDKTNPGDLTKQKALRERLKCKPFKWFLEEVAPDLLVRYPLREPLPFA 462

Query: 213 SNDWSGMCIDSACKPTDMHK---PVGLYPC-----HKQGGNQFWMMSKHGEIRRD--EAC 262
           S     +     C  T  HK   P+G++ C     H Q  NQF+ ++ + +IR    E C
Sbjct: 463 SGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQ-NNQFFTLTYYRDIRAASVEKC 521

Query: 263 LDYAGGD--VILYPCHGSKGNQYFEYD 287
           LD +  D  VIL+ CH S+GNQ + YD
Sbjct: 522 LDASSDDAEVILFNCHESQGNQLWRYD 548


>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 isoform 2
           [Gallus gallus]
          Length = 557

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 216 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 268

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F I+K++F  LG YD+  DIWGGEN ELSF
Sbjct: 269 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 328

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 329 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 381

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
            +  E    +    +G +  R E RR L CKSF+WYLE                     G
Sbjct: 382 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 441

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
            C++S  + T  +   G+  C     N    Q W+ S    IR+ + CL       G  +
Sbjct: 442 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWVFS-DPLIRQQDKCLSITSFSTGSHI 500

Query: 271 ILYPCHGSKGNQ 282
            L  C+   G Q
Sbjct: 501 TLEACNQKDGRQ 512


>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
           saltator]
          Length = 493

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 150/324 (46%), Gaps = 54/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + +N++ ++SP+I  I D+TF         T S++   G F+W+L 
Sbjct: 139 CECTVGWLEPLLEAVGKNATRIISPVIDIINDNTFSY-------TRSFELHWGAFNWDLH 191

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R ++  EP  TP MAGGLFS+++ +F +LG+YD    IWGGENLELS
Sbjct: 192 FRWLTLNGRLLKERRESIVEPFRTPAMAGGLFSMNRNYFFQLGSYDDQMRIWGGENLELS 251

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTP-----TMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +    P          + P   P      + G L  +   + ++   +   F
Sbjct: 252 FRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLVRVASVWMDQWAEFYFKF 311

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSND 215
           +    E   L +K     V SR  LR  L CKSF+WYLE                 V + 
Sbjct: 312 N---PEAARLRYK---QQVRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGRVIHA 365

Query: 216 WSGMCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGD- 269
            +  C+          +P G   L+ C  +   +Q ++M+K+G I  DE+ CLD    D 
Sbjct: 366 TTNRCLMRPTAKGSYTQPSGHAVLHSCIPRPMLSQMFVMTKNGVIMTDESVCLDAPERDT 425

Query: 270 ------VILYPCHGSKGNQYFEYD 287
                 V +  C G +  Q ++YD
Sbjct: 426 QQKTPKVKIMACSG-RDRQKWQYD 448


>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 586

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 138/325 (42%), Gaps = 79/325 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP++  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 253 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 305

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F +DK++F +LG YD+  DIWGGEN ELSF
Sbjct: 306 FKWEQIPIEQKMARSDPTQPIRTPVIAGGIFVMDKSWFNRLGQYDTHMDIWGGENFELSF 365

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
           +                    VW   M GG   I         F K   Y+         
Sbjct: 366 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYEFPEGNALTY 402

Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLEVSNDWS 217
                   ++W  E  +  +          FG +T R  LR+ L CK F+WY+E  N + 
Sbjct: 403 IRNTRRAAEVWMDEYKQYYYSARPSAQGKAFGSITDRVSLRKKLNCKPFRWYME--NVYP 460

Query: 218 GMCIDSACKPTDMHKP------------VGLYPCHKQGGN----QFWMMSKHGEIRRDEA 261
            + +      T + +             +GL  C   G N    Q W + +   IR+ + 
Sbjct: 461 ELRVPEQEAVTSVLRQGGLCLEARGAEWLGLAECRGVGTNRPQSQRWELIE-PLIRQQDL 519

Query: 262 CLDYA----GGDVILYPCHGSKGNQ 282
           CL  +    G  V + PC+  +  Q
Sbjct: 520 CLAISAFSPGSKVKMEPCNAKEARQ 544


>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 isoform 1
           [Gallus gallus]
          Length = 563

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 222 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F I+K++F  LG YD+  DIWGGEN ELSF
Sbjct: 275 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 335 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 387

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN---------------DWSG 218
            +  E    +    +G +  R E RR L CKSF+WYLE                     G
Sbjct: 388 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 447

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
            C++S  + T  +   G+  C     N    Q W+ S    IR+ + CL       G  +
Sbjct: 448 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWVFS-DPLIRQQDKCLSITSFSTGSHI 506

Query: 271 ILYPCHGSKGNQ 282
            L  C+   G Q
Sbjct: 507 TLEACNQKDGRQ 518


>gi|340711409|ref|XP_003394268.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Bombus terrestris]
          Length = 604

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 145/323 (44%), Gaps = 52/323 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ +A+N + VVSP+I  I DDTF         T S++   G F+W+L 
Sbjct: 251 CECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSY-------TRSFELHWGAFNWDLH 303

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R +N  EP  TP MAGGLFS+++ +F +LG+YD    IWGGENLELS
Sbjct: 304 FRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELS 363

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDIW 175
           F+      +    P          + P    T  GG+  I      ++     D   + +
Sbjct: 364 FRVWQCGGSIEIAPCSHVGHLFRKSSPY---TFPGGVGEILYGNLARVALVWMDEWAEFY 420

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDW------------------- 216
              N E +   D   V  R ELR+ L CK+F+WYL  +N W                   
Sbjct: 421 FKFNTEAARLRDKQPVRGRLELRKRLQCKNFEWYL--NNIWPEHFFPKDDRFFGRILHIS 478

Query: 217 SGMCIDSACKPTDMHKPVG---LYPCHKQGG-NQFWMMSKHGEIRRDEA-CLDYAGGD-- 269
           S  CI          +P G   L  C  +   +Q ++M+  G I  DE+ CLD    D  
Sbjct: 479 SNKCIMRPTAKGTYSQPSGYAVLETCLPRPILSQMFVMTTDGIIMTDESVCLDAPDHDTQ 538

Query: 270 -----VILYPCHGSKGNQYFEYD 287
                V +  C G    Q + YD
Sbjct: 539 HKTPKVKIMACSG-HSRQKWRYD 560


>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
           musculus]
          Length = 515

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/323 (30%), Positives = 141/323 (43%), Gaps = 67/323 (20%)

Query: 4   CEVQKRWLQPLL---DVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDW 60
           CEV + WLQPLL     + ++ + VV P+I  I  DTF           S     GGFDW
Sbjct: 164 CEVNRDWLQPLLHRVKEVLQDYTRVVCPVIDIINLDTFNY-------IESASELRGGFDW 216

Query: 61  NLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
           +L F W  +   ++    +  EP+ TP +AGGLF IDKA+F+ LG YD   DIWGGEN E
Sbjct: 217 SLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFE 276

Query: 121 LSFKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS 170
           +SF+          IP        RK+H     P   P      +         +     
Sbjct: 277 ISFRVWMCGGGLEIIPCSRVGHVFRKKH-----PYVFPDGNANTY---------IKNTKR 322

Query: 171 GFDIWGGENLE-------LSFKGDFGDVTSRKELRRNLGCKSFKWYLE-------VSNDW 216
             ++W  E  +        + +  FG++ +R  LR+NL C++FKWYLE       V  D 
Sbjct: 323 TAEVWMDEYKQYYYAARPFALERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDS 382

Query: 217 S---------GMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACL 263
           S           C++S  +     + + L PC K  G+    Q W  +   +I ++E CL
Sbjct: 383 SIQKGNIRQRQKCLES--QKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQEELCL 440

Query: 264 D----YAGGDVILYPCHGSKGNQ 282
                + G  V+L  C      Q
Sbjct: 441 SVVTLFPGAPVVLALCKNGDERQ 463


>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Meleagris gallopavo]
          Length = 509

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 132/312 (42%), Gaps = 48/312 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 168 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 220

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F I+K++F  LG YD+  DIWGGEN ELSF
Sbjct: 221 FKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSF 280

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 281 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 333

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV--------SNDW-------SG 218
            +  E    +    +G +  R E RR L CKSF+WYLE           D         G
Sbjct: 334 QYYYEARPSAIGKSYGSIADRVEQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIRQGG 393

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDY----AGGDV 270
            C++S  + T  +   G+  C     N    Q W  S    IR+ + CL       G  +
Sbjct: 394 NCLESWAQDTTGNTLAGIGNCKGTVNNPPVTQEWAFS-DPLIRQQDKCLSITSFSTGSQI 452

Query: 271 ILYPCHGSKGNQ 282
            L  C+   G Q
Sbjct: 453 TLEACNQKDGRQ 464


>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oreochromis niloticus]
          Length = 600

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 141/331 (42%), Gaps = 69/331 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 235 CEANVNWLPPLLDRIAQNRKAIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   ++   + +EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 289 YKRIPIPPEMQR--DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 346

Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           K  W      E             + P    G  S+ K             ++W  E  E
Sbjct: 347 KL-WMCGGRMEDIPCSRVGHIYRKYVPYKVPGGISLAKNL-------KRVAEVWMDEYAE 398

Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
             ++          GD+T++KELR  L CKSFKW++                      E+
Sbjct: 399 YVYQRRPEYRHLSAGDMTAQKELRTRLNCKSFKWFMNEVAWDLPKHYPPVEPPAAAWGEI 458

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
            N  SGMC++   K      P+ L  C K  G   W    HG++     R D        
Sbjct: 459 QNVGSGMCME--VKHFVSGSPIRLENCVKGRGEVGW---SHGQVLTFGWREDIRVGDPMH 513

Query: 260 --EACLDYA--GGDVILYPCHGSKGNQYFEY 286
             + C D       V LY CHG KGNQ + Y
Sbjct: 514 TRKLCFDAVSHSSPVTLYDCHGMKGNQLWRY 544


>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Metaseiulus occidentalis]
          Length = 585

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 101/302 (33%), Positives = 138/302 (45%), Gaps = 46/302 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV +RWLQPLL  + +N + V  P+I  I  DTFE  + P  L        GGF+W + 
Sbjct: 223 VEVNERWLQPLLVPIQQNQTTVTCPVIDIINADTFE--YSPSPLVK------GGFNWGMH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+   K  K    P+ +PTMAGGLF+I K  F +LG YD G D+WGGENLELSF
Sbjct: 275 FRWDNLPKGYFKSEKERIAPLPSPTMAGGLFAIHKDEFRRLGEYDWGMDVWGGENLELSF 334

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKR    A      T+A     +   + +    Y     
Sbjct: 335 RIWMCGGSLKIMPCSRVGHVFRKRRPYGASN-GEDTLAKNSLRVANVWMDDYKKY----- 388

Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHK 232
                 +    K  DFGD+++R ELR  L CKSF WYL+  N +  + + S         
Sbjct: 389 ---YYRMRPDLKDIDFGDISARVELRNRLKCKSFDWYLK--NIYPDLQLPS--------N 435

Query: 233 PVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA------GGDVILYPCH-GSKGNQYFE 285
             GL   +     Q  M  K  +IR D+ C+         GG  +L  C   SK   +FE
Sbjct: 436 RTGLRNVNLYKRKQPTMTGKF-QIRVDKLCVQSQDSIFRRGGAFVLQKCDPHSKKQMWFE 494

Query: 286 YD 287
            +
Sbjct: 495 TE 496


>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 619

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 107/327 (32%), Positives = 145/327 (44%), Gaps = 67/327 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PLL+ +A++    V PLI  I  DTFE R       S  +   G FDW  +F
Sbjct: 256 EVNVNWLPPLLEPIAQDYRTCVCPLIDVIVHDTFEYR-------SQDEGKRGAFDW--KF 306

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P R      +  EP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 307 YYKRLPLRPGDL-DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFK 365

Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
             W                 V  P +  G ++     F    G       +    ++W  
Sbjct: 366 I-WQC-----------GGRMVDAPCSRVGHVYRGYSPFPNPRGVNFVTRNFKRVAEVWMD 413

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------V 212
           E  +  +       K + GD+T +K LR  L CK FKW+LE                   
Sbjct: 414 EYKQFLYERNPQFDKTNPGDLTKQKALREKLKCKPFKWFLEEVAPDLLVRYPLREPLPFA 473

Query: 213 SNDWSGMCIDSACKPTDMHK---PVGLYPC-----HKQGGNQFWMMSKHGEIRRD--EAC 262
           S     +     C  T  HK   P+G++ C     H Q  NQF+ ++ + +IR    E C
Sbjct: 474 SGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQ-NNQFFTLTYYRDIRAASVEKC 532

Query: 263 LDYAG--GDVILYPCHGSKGNQYFEYD 287
           LD +    +VIL+ CH S+GNQ + YD
Sbjct: 533 LDASSDNAEVILFNCHESQGNQLWRYD 559


>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1-like [Sus
           scrofa]
          Length = 557

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 140/314 (44%), Gaps = 50/314 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 214 CEVNTEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 266

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 267 FKWEQIPLEQKIAWTDPTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSF 326

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  +        +   + 
Sbjct: 327 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYNFP--EGNALTYIRNTKRTAEVWMDEYK 379

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDWSGM----- 219
            +  E    +    FG V +R E R+ + CK+F+WYLE         V      +     
Sbjct: 380 QYYYEARPSAIGKAFGSVATRIEQRKKMNCKTFRWYLENVYPELTVPVKEVLPSIIKQGA 439

Query: 220 -CIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRRDEACLDYA------GG 268
            C+++  + T  +  +G+  C     N    Q W+ S H  I++   CL         G 
Sbjct: 440 NCLETQGQDTAGNFLLGMGICRGSAKNPPAAQAWLFSDH-LIQQQGKCLAATSTSISPGS 498

Query: 269 DVILYPCHGSKGNQ 282
            V+L  C+  +G Q
Sbjct: 499 LVVLQGCNPREGRQ 512


>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
          Length = 613

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 105/326 (32%), Positives = 150/326 (46%), Gaps = 65/326 (19%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PL++ +A +    V P I  I  DTF+ R       +  +   G FDW  +F
Sbjct: 250 EVNVNWLPPLIEPIAEDYRTCVCPFIDVIAHDTFQYR-------AQDEGKRGAFDW--KF 300

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            +  +P R +    +  EP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 301 LYKRLPLRAQD-MVDPTEPFESPIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFK 359

Query: 125 FNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWGG 177
             W                 V  P +  G ++     F    GT      +    ++W  
Sbjct: 360 V-WQC-----------GGRMVDAPCSRVGHVYRGYAPFPNPRGTNFVTRNFKRVAEVWMD 407

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW------------- 216
           E  +  +       + D GD+T +K LR  L CK FKW+L EV+ D              
Sbjct: 408 EYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLVVRYPLRDPKPFA 467

Query: 217 SGMCIDSA----CKPTDMHK---PVGLYPCHKQ----GGNQFWMMSKHGEIRRD--EACL 263
           SG    +A    C  +  HK   P+G++ C         NQF+ ++ + +IR    + CL
Sbjct: 468 SGRVQSAANPKLCLDSMNHKAKEPIGVFSCAANRTYPQNNQFFTLTYYRDIRVSSVDKCL 527

Query: 264 DYA--GGDVILYPCHGSKGNQYFEYD 287
           D +  G +VIL+ CH S+GNQ ++YD
Sbjct: 528 DASSDGSEVILFNCHESQGNQLWQYD 553



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/90 (25%), Positives = 43/90 (47%), Gaps = 11/90 (12%)

Query: 205 SFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRR 258
           +  +Y ++       C+D++   ++    V L+ CH+  GNQ W       M +HG+  R
Sbjct: 511 TLTYYRDIRVSSVDKCLDASSDGSE----VILFNCHESQGNQLWQYDTETQMIRHGKPTR 566

Query: 259 DEACLDYAGGDVILYPCHGSKGNQYFEYDY 288
           ++ CLD     V++  C   K  Q +E+ +
Sbjct: 567 NQ-CLDLVERKVVVSKCDHRKKTQRWEWGF 595


>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
 gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
 gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
 gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
          Length = 601

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 92/314 (29%), Positives = 141/314 (44%), Gaps = 37/314 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + RN   +  P+I  I  +++E R   G   + +    G F+W L 
Sbjct: 252 CEVNTNWLPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I ERE    K+ ++P  +PT AGGLF+I++ +F++LG YD G  IWGGE  ELSF
Sbjct: 309 YKETQITERETAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P         +  P      +G  + SI+      + T+   +  + 
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYSFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
                 +   + GD++++  LR  L CKSFKWY+                     E  N 
Sbjct: 426 LTREPQATNVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPMLPPNDVWGEARNP 485

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
            +G C+D   +   +  P+G   CH  GGNQ   ++  G++ + E CL   G  +    C
Sbjct: 486 ATGKCLD---RMGGIPGPMGATGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542

Query: 276 HGSKGNQYFEYDYK 289
                N ++ YD K
Sbjct: 543 VKGTVNGFWSYDRK 556


>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
          Length = 501

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 99/307 (32%), Positives = 135/307 (43%), Gaps = 53/307 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  + ++   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 212 CECTVGWLEPLLARIKQDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 264

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F+++GTYD+G DIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324

Query: 123 FKFNWH--AIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSGFDIWGGE- 178
           F+  W      E     H        TP T  GG   I      +L       ++W  E 
Sbjct: 325 FRI-WQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLA------EVWMDEF 377

Query: 179 -NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWS----------GMCIDSACKP 227
            N       D G       +   L C S    + V   +S           +C+D +   
Sbjct: 378 KNFFYIISPDIG------RIEHWLYCDSLHGGMLVFQVFSYTANKEIRTDDLCLDVS--- 428

Query: 228 TDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEACLDYAGGDVILYP----CHG 277
             ++ PV +  CH   GNQ W      +  +H        CLD A  +    P    C G
Sbjct: 429 -KLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVN---SNQCLDKATEEDSQVPSIRDCSG 484

Query: 278 SKGNQYF 284
            +  Q+ 
Sbjct: 485 GRSQQWL 491



 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 108/215 (50%), Gaps = 45/215 (20%)

Query: 107 YDSGFDI-WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK 164
           Y +G D+ +GG N +L+F+  W+ +P+RE  R K +   PV TPTMAGGLFSID+ +F++
Sbjct: 248 YMAGSDMTYGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQE 305

Query: 165 LGTYDSGFDIWGGENLELS-----------------------------FKGDFGDVTSRK 195
           +GTYD+G DIWGGENLE+S                             F G  G + ++ 
Sbjct: 306 IGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKN 365

Query: 196 ELR-RNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHG 254
             R   +    FK +  + +   G  I+       +H  + ++        Q +  + + 
Sbjct: 366 NRRLAEVWMDEFKNFFYIISPDIGR-IEHWLYCDSLHGGMLVF--------QVFSYTANK 416

Query: 255 EIRRDEACLDYA--GGDVILYPCHGSKGNQYFEYD 287
           EIR D+ CLD +   G V +  CH  KGNQ +EYD
Sbjct: 417 EIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYD 451


>gi|313234048|emb|CBY19624.1| unnamed protein product [Oikopleura dioica]
          Length = 827

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 134/297 (45%), Gaps = 67/297 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    W +PLL+ +A + + V++P+I  I   TF      G  T +   F G F WNL 
Sbjct: 541 CECFPGWAEPLLERIAEDPTRVMTPVIEVIDAGTFRT----GE-TKTANIFKGVFGWNLV 595

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           FNW  A   +        A P+ +PTMAGGLF++DKA+F  LGTYD    IWGGENLE+S
Sbjct: 596 FNWIEAYGPKNPYTSAYEARPIRSPTMAGGLFTMDKAYFNWLGTYDEEMKIWGGENLEMS 655

Query: 123 FK-------FNW-------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 168
           F+       FN+       H   E+    H    EP+   ++                  
Sbjct: 656 FRVTNCSEFFNYHLCRFGCHVFREKSPYSHPGGEEPIMRNSIRVA--------------- 700

Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE--------V 212
               D+W  E  E+ F+         D G+++SR +LR NL C+ F WY+E         
Sbjct: 701 ----DVWLDEFKEVYFRRGAPILKNIDPGNMSSRIQLRENLQCQPFSWYMENVLPELDYT 756

Query: 213 SND---WSGMCIDSACKPTDMH---------KPVGLYPCHKQGGNQFWMMSKHGEIR 257
            ND   ++G  I  A   T+             + ++PCH   GNQ++  +   +IR
Sbjct: 757 MNDDLIFAGEIISQARNRTNRQCFDSTGKDNAQIQIFPCHGLLGNQYYEYTNIKDIR 813


>gi|427778457|gb|JAA54680.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 568

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 36/221 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+P+L  +  N + V  P+I  I  DTFE    P           GGF+W L 
Sbjct: 205 CEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFEYSASP--------IVRGGFNWGLH 256

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W + P    +  + A +P+ +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 257 FKWESPPRL--RGPQQAIDPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISF 314

Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +          +P        +R +    P    T+      +   + ++  TY      
Sbjct: 315 RIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGEDTLTKNSLRVAHVWMDEYKTY------ 368

Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYL 210
                  L  + D     +GDV++RKELR+ L C SF WY+
Sbjct: 369 ------YLQTRRDARNQWYGDVSARKELRKRLKCHSFDWYM 403


>gi|158286608|ref|XP_308833.4| AGAP006925-PA [Anopheles gambiae str. PEST]
 gi|157020549|gb|EAA04096.4| AGAP006925-PA [Anopheles gambiae str. PEST]
          Length = 622

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 141/320 (44%), Gaps = 66/320 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYK-FFIGGFDWNL 62
           CE   +WL+PLL+ +  + + V+ P+I  I    F         T+ Y  F IGGF W+ 
Sbjct: 264 CECMVQWLEPLLERIKESPTSVLVPIIDVIEAKNFYYS------TNDYNDFQIGGFTWDG 317

Query: 63  QFNWHAIPERERKRHKNAAE-------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 115
            F+WH + +RER+R K           P ++PTMAGGLF+I + +F  +G+YD   D WG
Sbjct: 318 HFDWHDVTKRERERQKRECAEKDLEICPTYSPTMAGGLFAIARDYFWDIGSYDEQMDGWG 377

Query: 116 GENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTY 168
           GENLE+SF+          IP            P   P      G+ ++  A        
Sbjct: 378 GENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMAI------- 430

Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE--------- 211
                +W  + +EL +          + GDVT RK LR  L CKSF WY++         
Sbjct: 431 -----VWMDDYVELLYLNRPDLKDHPELGDVTHRKVLREKLHCKSFDWYMKNVYPEKFIP 485

Query: 212 ---------VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDE 260
                    +++    +C+D+  +  D    +G+Y C K     +Q + ++K   +R + 
Sbjct: 486 TRNVRAFGRLASQADNLCLDTLQQNADKPWNLGIYTCFKPEVSASQLFSLTKRNVLRNER 545

Query: 261 ACLDYAGGD-----VILYPC 275
           +C            V++ PC
Sbjct: 546 SCATVQASKSESKFVVMIPC 565


>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 591

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 36/221 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+P+L  +  N + V  P+I  I  DTFE    P           GGF+W L 
Sbjct: 223 CEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFEYSASP--------IVRGGFNWGLH 274

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W + P    +  + A +P+ +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 275 FKWESPPRL--RGPQQAIDPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISF 332

Query: 124 KF-----NWHAIPERER----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +          +P        +R +    P    T+      +   + ++  TY      
Sbjct: 333 RIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGEDTLTKNSLRVAHVWMDEYKTY------ 386

Query: 175 WGGENLELSFKGD-----FGDVTSRKELRRNLGCKSFKWYL 210
                  L  + D     +GDV++RKELR+ L C SF WY+
Sbjct: 387 ------YLQTRRDARNQWYGDVSARKELRKRLKCHSFDWYM 421


>gi|332021082|gb|EGI61469.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Acromyrmex
           echinatior]
          Length = 580

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 42/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV + W++PLL  +A + + +  P+I  I  DTF+    P           GGF+W L 
Sbjct: 208 IEVNEIWIEPLLSRIAYSRNIIPMPVIDIINADTFQYTGSP--------LVRGGFNWGLH 259

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P        +  +P+ +PTMAGGLF+ID+ +F K+G YD G DIWGGENLE+SF
Sbjct: 260 FKWDNLPIGTLNHDVDFVKPIKSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISF 319

Query: 124 KF-----NWHAIPER------ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           +      +   IP         R+R   + +P    TM      +   + ++   Y    
Sbjct: 320 RIWMCGGSIELIPCSRVGHVFRRRRPYGSDDP--QDTMLKNSLRVAHVWMDEYKDY---- 373

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM-- 230
                  L+ +   D+GD++ R  LR+ L CK+F WYL+V      +  D+  +  D   
Sbjct: 374 ------FLKNAKTIDYGDISERLALRQKLKCKTFGWYLKVVYPELTLPDDTERRLKDKWA 427

Query: 231 ---HKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQY 283
               +P  + P H +  N   Q+ + +S      + E  +   G  +IL PC   K   +
Sbjct: 428 KLDQRP--MQPWHSRKRNYTDQYQIRLSNTALCIQSEKDIKTKGSKLILMPCLRVKSQMW 485

Query: 284 FEYD 287
           +E D
Sbjct: 486 YETD 489


>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
          Length = 590

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 47/305 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  N   VV P+I  I  DT  L + P  +        GGF+W L 
Sbjct: 234 CEVNEAWLQPLLTPIKENRKTVVCPVIDIISADT--LVYTPSPIVR------GGFNWGLH 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A   + +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 286 FKWDPVPMSELNSPDGA---IRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISF 342

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          +P        RKR +    P    TMA     +   +       D   +
Sbjct: 343 RIWMCGGQLLIVPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DDYKE 395

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKP 233
            +     EL  + D+GD++ R  +R+ L C SFKWYL+  N +  M + S  KP      
Sbjct: 396 QYFALRPELRNR-DYGDISERVSIRKRLQCHSFKWYLD--NIYPEMQVSSPHKPQQ---- 448

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRR--DEACL------DYAGGDVILYPCHGSKGNQYFE 285
               P     G +   + + G +R    + CL         GG V++  C      Q + 
Sbjct: 449 ----PVFINKGLKRPKVLQRGRLRNLLADKCLVAQGRPSQKGGAVVVKDCDPQDPEQEWA 504

Query: 286 YDYKY 290
           YD ++
Sbjct: 505 YDEEH 509


>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase 10-like [Metaseiulus
           occidentalis]
          Length = 590

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 100/329 (30%), Positives = 142/329 (43%), Gaps = 70/329 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLLD +ARN   VV P I  I  +TF  R       S  +   G FDW L +
Sbjct: 235 EANVNWLPPLLDPIARNRRTVVCPFIDVIHYETFAYR-------SQDEGARGAFDWELYY 287

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
               +   + KR     EP  +P MAGGLF+ID+++F +LG YD G D+WGGE  ELSFK
Sbjct: 288 KRLPLLSEDLKR---PTEPFRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFK 344

Query: 125 FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWGG 177
             W               +    P    G      A F   G        Y    ++W  
Sbjct: 345 I-WQC-----------GGQMFDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWMD 392

Query: 178 ENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------------------- 211
           E  E  +          +GDV+ +K LR+ L CK FKW++E                   
Sbjct: 393 EYKEFLYNRRPHYRTLGYGDVSKQKALRKKLKCKPFKWFMETVAFDQPLRYPPVEPPDFA 452

Query: 212 ---VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIR--RDEAC 262
              + N  +  C+D+  K  +  K   L  C    G+    Q ++++ H ++R  +   C
Sbjct: 453 WGAIRNVGADKCLDTKFK--EQGKRFSLETCISSNGDVSGEQNFVLTWHKDLRPAKRNVC 510

Query: 263 LDYAGGD----VILYPCHGSKGNQYFEYD 287
            D + G+    V+L+ CHG  GNQ F+Y+
Sbjct: 511 FDVSSGEKKAPVVLWTCHGMHGNQLFKYN 539


>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
          Length = 590

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 97/305 (31%), Positives = 138/305 (45%), Gaps = 47/305 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  N   VV P+I  I  DT  L + P  +        GGF+W L 
Sbjct: 234 CEVNEAWLQPLLTPIKENRKTVVCPVIDIISADT--LVYTPSPIVR------GGFNWGLH 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E      A   + +PTMAGGLF++D+ +F +LG YD G DIWGGENLE+SF
Sbjct: 286 FKWDPVPMSELNSPDGA---IRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISF 342

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +          +P        RKR +    P    TMA     +   +       D   +
Sbjct: 343 RIWMCGGQLLIVPCSRVGHIFRKR-RPYGSPGGQDTMAHNSLRLAHVWM------DDYKE 395

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKP 233
            +     EL  + D+GD++ R  +R+ L C SFKWYL+  N +  M + S  KP      
Sbjct: 396 QYFALRPELRNR-DYGDISERVSIRKRLQCHSFKWYLD--NIYPEMQVSSPHKPQQ---- 448

Query: 234 VGLYPCHKQGGNQFWMMSKHGEIRR--DEACL------DYAGGDVILYPCHGSKGNQYFE 285
               P     G +   + + G +R    + CL         GG V++  C      Q + 
Sbjct: 449 ----PVFINKGLKRPKVLQRGRLRNLLADKCLVAQGRPSQKGGAVVVKDCDPQDPEQEWA 504

Query: 286 YDYKY 290
           YD ++
Sbjct: 505 YDEEH 509


>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Takifugu rubripes]
          Length = 600

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 142/331 (42%), Gaps = 69/331 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 235 CEANVNWLPPLLDRIAQNRKSIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   ++   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 289 YKRIPIPAEMQR--DDPSQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 346

Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           K  W      E             + P    G  S+ K             ++W  E  E
Sbjct: 347 KV-WMCGGRMEDIPCSRVGHIYRKYVPYKVPGGISLAKNL-------KRVAEVWMDEYAE 398

Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
             ++          GD+T +KELR  LGCK+FKW++                      E+
Sbjct: 399 YVYQRRPEYRHLSAGDMTPQKELRSRLGCKNFKWFMSNVAWDLPKHYPPVEPPAAAWGEI 458

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
            N  SG+C++   K      P+ L  C K  G   W    HG++     R D        
Sbjct: 459 QNVGSGLCME--IKHFVSGSPIRLENCVKSRGEVGW---SHGQVLTFGWREDIRVGDPMH 513

Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
             + C D    +  V LY CHG KGNQ + Y
Sbjct: 514 TRKVCFDAVSHNSPVTLYDCHGMKGNQLWRY 544


>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
           echinatior]
          Length = 595

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 94/297 (31%), Positives = 139/297 (46%), Gaps = 40/297 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL+ + +N++ +V+P+I  I D+TF         T S++   G F+W+L 
Sbjct: 260 CECTIGWLEPLLEAVGKNATRIVAPVIDIINDNTFSY-------TRSFELHWGAFNWDLH 312

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  R  K R  N  EP  TP MAGGLFS+++ +F KLG+YD    IWGGENLELS
Sbjct: 313 FRWLTLNGRLLKERRDNIVEPFRTPAMAGGLFSMNRDYFFKLGSYDDQMRIWGGENLELS 372

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLF--SIDKAFFEKLGTYDSGFDIW 175
           F+      +    P          + P   P   G +   ++ +     +  +   +  +
Sbjct: 373 FRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLARVALVWMDQWAEFYFKF 432

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSG 218
             E   L +K     V SR  LR  L CKSF+WYLE                 V +  + 
Sbjct: 433 NPEAARLRYK---QQVRSRLALREKLQCKSFEWYLENVWPEHFFPTDDRFFGRVVHAGTK 489

Query: 219 MCIDSACKPTDMHKPVG---LYPC-HKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDV 270
            CI          +P G   L+ C  +   +Q ++M+K+G I  DE+ CLD    D+
Sbjct: 490 KCIMRPAAKGSYGQPSGNAVLHSCIPRPMLSQMFVMTKNGVIMTDESVCLDAPERDM 546


>gi|170051778|ref|XP_001861920.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
           quinquefasciatus]
 gi|167872876|gb|EDS36259.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
           quinquefasciatus]
          Length = 601

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 88/311 (28%), Positives = 139/311 (44%), Gaps = 48/311 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE   +WL+PLL+ +  + + V+ P+I     D  E +           F IGGF W+  
Sbjct: 244 CECMPQWLEPLLERIRESRTSVLVPII-----DVIEAKNFFYSTNGFTDFQIGGFTWDGH 298

Query: 64  FNWHAIPERERKRHKN-------AAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
           F+WH + +RE++R K        A  P ++PTMAGGLF+I + +F ++G+YD   D WGG
Sbjct: 299 FDWHDVTQREKERQKRECSEKDVAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGG 358

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYD 169
           ENLE+SF+          IP            P   P      G+ ++  A        D
Sbjct: 359 ENLEMSFRVWQCGGTLETIPCSRIGHIFRDFHPYSFPNDRDTHGINTVRMATV----WMD 414

Query: 170 SGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
              D+      +L    + GDVT R+ LR  L CKSF WY++                  
Sbjct: 415 DYIDLLYLNRPDLRDHPEVGDVTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAFGR 474

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGD 269
           V++    +C+D+  +  D    +G+Y C +     +Q   ++K G +R + +C       
Sbjct: 475 VTSLAENLCLDTLQQNADKPWNLGIYTCFRTEVSASQLMSLTKRGVLRTERSCATVQDNK 534

Query: 270 -----VILYPC 275
                V++ PC
Sbjct: 535 ADTRYVVMIPC 545


>gi|195384663|ref|XP_002051034.1| GJ22477 [Drosophila virilis]
 gi|194145831|gb|EDW62227.1| GJ22477 [Drosophila virilis]
          Length = 598

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 136/308 (44%), Gaps = 44/308 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE    W +PLL  +  + + V+ P+I  I  + F+        T+ YK F +GGF WN 
Sbjct: 243 CEANVGWCEPLLQRIKDSRTSVLVPIIDVIDANDFQYS------TNGYKSFQVGGFQWNG 296

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  + ERE+ R            P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 297 HFDWVNLSEREKLRQSRECSQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 356

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 357 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 414

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
            +++     +L F  D GDVT R  LR+ L CKSF WYL+                  + 
Sbjct: 415 INVFFLNRPDLKFHADIGDVTHRVMLRKKLRCKSFDWYLKNVYPEKFVPNKNVKAWGRIK 474

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              + +C D      +    +GLYPC K+   +Q +  +K   +R + +C          
Sbjct: 475 AVHANLCADDLLSNNEKPYNLGLYPCGKELQKSQLFSYTKSQVLRNEISCATVQHSSSPP 534

Query: 270 --VILYPC 275
             +++ PC
Sbjct: 535 YRIVMVPC 542


>gi|355689613|gb|AER98891.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mustela putorius
           furo]
          Length = 302

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 82/249 (32%), Positives = 121/249 (48%), Gaps = 36/249 (14%)

Query: 58  FDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 117
           FDW L F W ++P+ E++R K+   P+ TPT AGGLFSI K +FE +GTYD   +IWGGE
Sbjct: 1   FDWILSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGE 60

Query: 118 NLELSFKFNWHAIPERERK---------RHKNAAE-PVWTPTMAGGLFSIDKAFFEKLGT 167
           N+E+SF+  W    + E           R K+    P  T  +A     + + + ++   
Sbjct: 61  NIEMSFRV-WQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQVIARNQVRLAEVWMDE--- 116

Query: 168 YDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------- 211
           Y   F     +  ++  +  FGD++ R E++  L CK+F WYL                 
Sbjct: 117 YKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNTIYPEAYVPDLNPVIS 176

Query: 212 --VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD---EACLDYA 266
             + +    +C+D   +     KP+ LY CH  GGNQ++  S   EIR +   E CL  A
Sbjct: 177 GYIKSVGQPLCLDVG-ENNQGGKPLILYTCHGLGGNQYFEYSAQHEIRHNIQRELCLHAA 235

Query: 267 GGDVILYPC 275
            G V L  C
Sbjct: 236 QGLVQLRAC 244


>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Takifugu rubripes]
          Length = 552

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 100/331 (30%), Positives = 143/331 (43%), Gaps = 69/331 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 187 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 240

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K  ++ +EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 241 YKRIPIPLELQK--EDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 298

Query: 124 KFNWHAIPERERKRHKNAAEPV--WTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           K  W      E             + P    G  S+ +             ++W  E  E
Sbjct: 299 KV-WMCGGRMEDTPCSRVGHIYRKYVPYKVPGGVSLARNL-------KRVAEVWMDEYAE 350

Query: 182 LSFKGD-------FGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
             ++          GD+  +K+LR  L CKSFKW++                      E+
Sbjct: 351 YIYQRRPEYRHLAAGDMAVQKDLRSQLNCKSFKWFMTKVAWDLPKHYPPVEPPAAAWGEI 410

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
            N  SGMC+++  K      PV +  C K  G   W    HG++     R D        
Sbjct: 411 RNVASGMCLET--KHFASGSPVRMESCLKGRGEGGW---SHGQVFTFGWREDIRVGDPMH 465

Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
             + C D    +  V LY CHG KGNQ++ Y
Sbjct: 466 TKKVCFDAVSNNSPVTLYDCHGMKGNQFWHY 496


>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Oreochromis niloticus]
          Length = 567

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 141/328 (42%), Gaps = 85/328 (25%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP++  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 229 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F +D+++F  LG YD+  DIWGGEN ELSF
Sbjct: 282 FKWEQIPIEQKMARSDPTQAIRTPVIAGGIFVMDRSWFNHLGQYDTHMDIWGGENFELSF 341

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
           +                    VW   + GG   I         F K   YD         
Sbjct: 342 R--------------------VW---LCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 378

Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE------ 211
                   ++W  E  +  +          FG VT R  LRR L CK F+WY+E      
Sbjct: 379 IKNTRRAAEVWMDEYKQYYYSARPSAQGKAFGSVTDRLALRRKLNCKPFRWYMENVYPEL 438

Query: 212 -------VSN--DWSGMCIDSACKPTDMHKPVGLYPCHKQGGN----QFWMMSKHGEIRR 258
                  VS+     G+C+++  + TD    +GL  C   G N    Q W + +   IR+
Sbjct: 439 RVPEQEAVSSVLKQGGLCLET--RGTD---GLGLAECRGLGANRPQSQRWELIE-PLIRQ 492

Query: 259 DEACLDY----AGGDVILYPCHGSKGNQ 282
            + CL      AG  V + PC+  +  Q
Sbjct: 493 QDLCLAISAFTAGSKVKMEPCNTKEPRQ 520


>gi|292623437|ref|XP_001339749.3| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Danio rerio]
          Length = 567

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 133/322 (41%), Gaps = 74/322 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP++  +  + S VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 230 CEVNTDWLQPMIQRVKEDHSRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 282

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  +P+ TP +AGG+F I+K +F  LG YD+  DIWGGEN ELSF
Sbjct: 283 FKWEQIPIEQKMARNDPTQPIRTPVIAGGIFVIEKGWFNHLGQYDTHMDIWGGENFELSF 342

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
           +                    VW   M GG   I         F K   YD         
Sbjct: 343 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 379

Query: 170 -----SGFDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL------- 210
                   ++W  +  +  +          FG +  R  L+R L C SF+WYL       
Sbjct: 380 IKNTRRAAEVWMDDYKQYYYAARPSAQGKAFGSIADRLALKRKLNCNSFRWYLENVYPEL 439

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ---GGNQFWMMSKHGEIRRDEACLD 264
              E    +S +     C  +     +GL  C        +Q W + +  +IR+ + CL 
Sbjct: 440 KIPEQEEAYSLLKQGGLCLESHGTDSLGLAECRSTPSIPASQKWTLIE-PQIRQHDLCLA 498

Query: 265 Y----AGGDVILYPCHGSKGNQ 282
                AG  V L PC+  +  Q
Sbjct: 499 ITAFTAGSKVRLEPCNIKESRQ 520


>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
          Length = 612

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 84/237 (35%), Positives = 117/237 (49%), Gaps = 49/237 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K W++PLL  +  N   VV P+I  I + TF  +          + F GGF+WNLQ
Sbjct: 201 CECTKGWMEPLLARIKENRKAVVCPVIDIINERTFAYQ-------KGIELFRGGFNWNLQ 253

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P E  + R  +  +P+ +PTMAGGLFSID+ +FE++GTYD   DIWGGEN+E+S
Sbjct: 254 FRWYALPPEMIKSRSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEIS 313

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD-----SGF 172
            +           K  K     VW     GG   I         F +   +D     SG 
Sbjct: 314 LRL----------KLLKKNCFLVW---QCGGRVEILPCSHVGHVFRRTSPHDFPGRKSGT 360

Query: 173 ----------DIWGGE-------NLELSFK-GDFGDVTSRKELRRNLGCKSFKWYLE 211
                     ++W  E           ++K  +  DV+ R ELR+ L CKSFKW+L+
Sbjct: 361 ILNSNLLRVAEVWMDEWKFHFYRTAPQAYKMRETVDVSDRVELRKRLHCKSFKWFLD 417


>gi|241622516|ref|XP_002407424.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
 gi|215500988|gb|EEC10482.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
          Length = 471

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 103/330 (31%), Positives = 145/330 (43%), Gaps = 72/330 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A++   VV P I  I  +TF  R       +  +   G FDW L +
Sbjct: 116 EANTNWLPPLLEPIAKDYRTVVCPFIDVIDYETFAYR-------AQDEGARGSFDWELYY 168

Query: 65  N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
                +P+   K      EP  +P MAGGLF+I + +F +LG YD G D+WGGE  ELSF
Sbjct: 169 KRLPLLPDDLAK----PTEPFKSPVMAGGLFAISRKYFWELGGYDEGLDVWGGEQYELSF 224

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
           K  W                 V  P    G      A F   G        Y    ++W 
Sbjct: 225 KI-WQC-----------GGTMVDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWM 272

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------------------ 211
            E  E  +         D GD+T++K LR+ L CKSFKW++E                  
Sbjct: 273 DEYKEHLYHRRPHYRHLDPGDLTAQKALRKRLNCKSFKWFMEQVAFDQPSKYPALLTPVA 332

Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIR--RDEA 261
               V N+ SG+CID+  K    ++   L PC K    + G Q  +++ H ++R  +   
Sbjct: 333 HWPQVRNEESGLCIDTQFK--GQNERFSLAPCLKDQRGRSGEQQLVLTWHKDVRPAKRSV 390

Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
           C D +  D    V+L+ CHG  GNQ ++YD
Sbjct: 391 CFDVSSSDVHAPVMLWSCHGMHGNQLWKYD 420


>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
          Length = 521

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/323 (29%), Positives = 145/323 (44%), Gaps = 55/323 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +  + + +  PLI  I  + FE R     +      F G F+W + 
Sbjct: 169 CEVGLNWLPPLLYPIYLDRTTMTVPLIDGIDHENFEYR----PVYQGETNFRGVFEWGML 224

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +PERE +     +EP   PT AGGLF+I++A+F ++G YD G  +WGGEN ELSF
Sbjct: 225 YKENEVPEREAQSRTYNSEPYKAPTHAGGLFAINRAYFLEIGAYDPGLLVWGGENFELSF 284

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGG-------LFSIDKAFFEKLGT-----YDSG 171
           K  W               + +W P    G        ++  K    K G+     Y   
Sbjct: 285 KI-WQC-----------GGKILWVPCSRVGHVYRGFMPYTFGKLAANKKGSLITINYKRV 332

Query: 172 FDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL-EVSND------- 215
            ++W  +  +  F          D G++T + E+++ L CKSF W++ EV+ D       
Sbjct: 333 IEVWFDDKYKEFFYTREPTARFLDMGNITQQLEMKKRLNCKSFAWFMEEVAYDVLDKYPE 392

Query: 216 ------WSGMCIDSA--CKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
                 W  +   +A  C  T  H+P   +G+  CH  G NQ + ++K G++   E C++
Sbjct: 393 LPANLHWGELRNTAARQCLDTMGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVN 452

Query: 265 YAGGDVILYPCHGSKGNQYFEYD 287
                V L  C        +EYD
Sbjct: 453 ADSQGVKLVVCRLGSVEGPWEYD 475


>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
          Length = 615

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/330 (32%), Positives = 149/330 (45%), Gaps = 71/330 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A++    V P I  I  +TFE R       +  +   G FDW L +
Sbjct: 244 EANVNWLPPLLEPIAKDYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 296

Query: 65  N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
                +PE      KN +EP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSF
Sbjct: 297 KRLPLLPED----LKNPSEPFKSPVMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSF 352

Query: 124 KFNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGG 177
           K  W    +       R  H     P + P    G F         LG  Y    ++W  
Sbjct: 353 KI-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMD 401

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-------------VSNDWS 217
           E  +  ++        D GD+T +K LR  L CKSFKW++E               +D++
Sbjct: 402 EYADFIYRRRPHLRAMDPGDLTEQKALRDKLKCKSFKWFMENIAFDLVEVYPPIEPDDFA 461

Query: 218 ----------GMCIDSACKPTDMHKPVGLYPCHKQG----GNQFWMMSKHGEIR--RDEA 261
                      +C+D+  K  D  + + +  C K      G Q + ++ H +IR  R   
Sbjct: 462 YGEMRNIGVPNLCLDAKGKGKD--EEIAVDYCQKDTPKIKGEQEFQLTWHKDIRPNRRTE 519

Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
           CLD + GD    V LYPCHG +GNQ + Y+
Sbjct: 520 CLDVSRGDDKSPVTLYPCHGKQGNQLWRYN 549


>gi|221041542|dbj|BAH12448.1| unnamed protein product [Homo sapiens]
          Length = 360

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 88/272 (32%), Positives = 122/272 (44%), Gaps = 48/272 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL+ +A + + VVSP+I  I  D F+       L        GGF  NL 
Sbjct: 108 CECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASADLK-------GGFG-NLV 159

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +  PE+ R R  N   P+ TP +AGGLF +DK +FE+LG YD   D+WGGENLE+S
Sbjct: 160 FKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEIS 219

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F+ +              ++W  
Sbjct: 220 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGTVFARNTR---------RAAEVWMD 270

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEV----------------SN 214
           E     +          +G++ SR ELR+ L CK FKWYLE                 + 
Sbjct: 271 EYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGAL 330

Query: 215 DWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQ 246
                C+D+     D    VG+Y CH  G  Q
Sbjct: 331 QQGTNCLDTLGHFAD--GVVGVYECHVAGLRQ 360


>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oreochromis niloticus]
          Length = 587

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/330 (31%), Positives = 144/330 (43%), Gaps = 67/330 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 222 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 275

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + +EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 276 YKRIPIPTELQK--DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 333

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 334 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEY---IYQR 390

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GD+T +KELR  L CK+FKW++                      E+ 
Sbjct: 391 RPEYRHLS----AGDMTVQKELRNRLNCKNFKWFMSEVAWDLPKHYPPVEPPAAAWGEIR 446

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD--------- 259
           N  S MC++S  K      P+ L  C K  G+  W    HG++     R D         
Sbjct: 447 NVGSSMCMES--KHFVSGSPIRLENCVKGRGDVSW---SHGQVFTFGWREDIRVGDPMHT 501

Query: 260 -EACLDYAGGD--VILYPCHGSKGNQYFEY 286
            + C D    +  V LY CHG KGNQ + Y
Sbjct: 502 KKVCFDAISHNSPVTLYDCHGMKGNQLWRY 531


>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
 gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
          Length = 601

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 91/312 (29%), Positives = 141/312 (45%), Gaps = 37/312 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + RN   +  P+I  I  +++E R   G   + +    G F+W L 
Sbjct: 252 CEVNTNWLPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I ERE    K++++P  +PT AGGLF+I++ +F++LG YD G  IWGGE  ELSF
Sbjct: 309 YKETQITERETAHRKHSSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P         +  P      +G  + SI+      + T+   ++ + 
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYEKYY 425

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------VSND 215
                 +   + GD++++  LR  L CKSFKWY+E                       N 
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPLLPPNDVWGGAQNP 485

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
            +G C+D   +   +  P+G   CH  GGNQ   ++  G++ + E CL   G  +    C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLLRLNVQGQLAQGEWCLTANGIRIQANHC 542

Query: 276 HGSKGNQYFEYD 287
                N  + YD
Sbjct: 543 VKGSVNGNWVYD 554


>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7-like [Saccoglossus
           kowalevskii]
          Length = 594

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 87/320 (27%), Positives = 143/320 (44%), Gaps = 58/320 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +A+N + V  P+I  I +  + +R        S +   GGFDW+L 
Sbjct: 245 CEVGINWLPPLLSPIAQNRTTVTVPIIDVIDNMDYTMRS-----QGSGELSRGGFDWSLY 299

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    + + E ++   ++EP  +P MAGGLF++ + +F +LG YD G ++WGGEN ELSF
Sbjct: 300 WKHLPMSKEETRKRSLSSEPYRSPAMAGGLFAMARDYFFELGAYDPGLEVWGGENFELSF 359

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSI-DKAFFE---------KLGTYDSGF 172
           K  W                 +W P +  G ++ I  K  +           L  Y    
Sbjct: 360 KI-WQC-----------GGSMLWVPCSHVGHVYRILGKVPYRAPNATMTQWSLRNYRRVV 407

Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYL--------------- 210
           ++W  +  E  ++         FGD++ + E +    CK+F W++               
Sbjct: 408 EVWMDDYKEFFYRSKPESQLLHFGDISKQLEFKTKHNCKNFDWFMKEVAPDLLAVYPVPA 467

Query: 211 ------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
                 E+ ++ + +C+D+          +G+  CH QGGNQ + +++  E R  E CL 
Sbjct: 468 ANQAWGEIKSNTNKVCVDTMGNREG--GTIGISGCHGQGGNQLFRITEDHEFRIHELCLY 525

Query: 265 YAGGDVILYPCHGSKGNQYF 284
               +V L  C G     +F
Sbjct: 526 EIYSEVKLRRCDGKSKYSWF 545


>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Anolis carolinensis]
          Length = 574

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 103/218 (47%), Gaps = 24/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L        GGFDW+L 
Sbjct: 233 CEVNSEWLQPMLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLR-------GGFDWSLH 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +  + + TP +AGG+F IDK++F  LG YD+  DIWGGEN ELSF
Sbjct: 286 FKWEQIPIEQKLSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSF 345

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RKRH     P   P   G   +  K        +   + 
Sbjct: 346 RVWMCGGSLEIVPCSRVGHVFRKRH-----PYDFP--EGNALTYIKNTKRTAEVWMDEYK 398

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +  E    +    FG +  R + RR L CKSF+WYLE
Sbjct: 399 QYYYEARPSAIGKSFGSIADRVDQRRKLNCKSFQWYLE 436


>gi|328700065|ref|XP_003241139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Acyrthosiphon pisum]
          Length = 588

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 118/218 (54%), Gaps = 26/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   W+QPLL  +  N + +++P+I  I  DTF+ +  P           GGF+W L 
Sbjct: 222 VEVNTDWIQPLLTRVRDNRTQIIAPIIDIIQPDTFDYKSSP--------LVRGGFNWGLH 273

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++P+      K+  +P+ TPT+AGGLF++D+ +F ++G YDSG +IWGGENLELSF
Sbjct: 274 FKWDSLPKGTLVTDKDFVKPIKTPTIAGGLFAVDREYFNEIGQYDSGMNIWGGENLELSF 333

Query: 124 KFNWHA-----IPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +  W       I    R     ++H+  + P    TMA     +   + +    +     
Sbjct: 334 RV-WMCGGSLYIEPCSRVGHVFRQHRPYSAPNNEDTMARNSLRLANVWMDDFKKF----- 387

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +  + ++L  + D+GDV+ RK LR  LGC +F+WYLE
Sbjct: 388 -FISKRMDL-LRLDYGDVSERKALRTKLGCNNFEWYLE 423


>gi|344288103|ref|XP_003415790.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           protein 2 [Loxodonta africana]
          Length = 640

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 134/300 (44%), Gaps = 38/300 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + S VVSP+I  I   TF+  +P      S     G  DWNL 
Sbjct: 288 CECHQGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQY-YP------SEALQRGVLDWNLD 340

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +PE E+K  ++   P+ +P + GG+ +ID+ +F+  G YD    +WGGENLELS 
Sbjct: 341 FHWEPLPEHEKKALQSPISPIRSPVVPGGVVAIDRHYFQNTGAYDPLMSLWGGENLELSL 400

Query: 124 KF-----NWHAIP-ERERKRHKNA-AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           K      +   +P  R    ++N  A PV        L +  +     L ++   F    
Sbjct: 401 KTWLCGGSVEILPCSRVGHVYRNQDAHPVL--DQEATLQNKIRIAETWLASFKETFYKHS 458

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSG 218
            E   LS + +  D T R +L+R LGC+ F W+L                  ++ N   G
Sbjct: 459 PEAFSLS-QAEKPDCTERLQLQRRLGCRMFHWFLANIYPELYPSEHMPRFSGKLHNTGLG 517

Query: 219 MCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILYPC 275
            C D   +   +  P+ L+PC+     Q    +   EIR       C    G  V+L  C
Sbjct: 518 FCADCQAEGDTLGCPMMLFPCNDNRKQQHLQHTSRKEIRFGSPQHLCFGVRGAQVVLQNC 577


>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
 gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
          Length = 520

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/290 (29%), Positives = 137/290 (47%), Gaps = 46/290 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +  N S V+ P+I +I  +T        RL +     +GGF W+L 
Sbjct: 169 CEVSEGWLEPLLARIKENRSVVLCPIIDHISAETLAYS-GSDRLAN-----VGGFWWSLH 222

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +PE       +  +P+ +PTMAGGLF++D+ +F ++G YD   DIWGGENLE+SF
Sbjct: 223 FRWDPLPEE--YYGIDPTKPIRSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISF 280

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP         A  P +  T  G    +     ++L       ++W  +
Sbjct: 281 RVWMCGGGIEFIPCSHVGHIFRAGHP-YNMTGPGNNEDVHGTNSKRLA------EVWMDD 333

Query: 179 NLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE------------------VS 213
                +       + + GD++ R+ LR+ L CKSFKWYLE                  + 
Sbjct: 334 YKRFYYIHRSDLKEKNVGDLSERRALRKKLKCKSFKWYLENVAKNKFILDENVAAFGALR 393

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
           N  SG C+D+  +       + ++PC   +   Q + ++  G++RR+  C
Sbjct: 394 NPSSGFCLDTLQQDEKEAVSLAVFPCQNGKSEAQIFSLTNDGKLRRELTC 443


>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
           glaber]
          Length = 330

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 61/123 (49%), Positives = 78/123 (63%), Gaps = 8/123 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 152 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 204

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE++GTYD+G DIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEIS 264

Query: 123 FKF 125
           F+ 
Sbjct: 265 FRI 267



 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 63/81 (77%), Gaps = 4/81 (4%)

Query: 107 YDSGFDI-WGGENLELSFKFNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEK 164
           Y +G D+ +GG N +L+F+  W+ +P+RE  R K +   PV TPTMAGGLFSID+ +FE+
Sbjct: 188 YMAGSDMTYGGFNWKLNFR--WYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEE 245

Query: 165 LGTYDSGFDIWGGENLELSFK 185
           +GTYD+G DIWGGENLE+SF+
Sbjct: 246 IGTYDAGMDIWGGENLEISFR 266


>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
 gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
          Length = 591

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/304 (30%), Positives = 142/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD+T +  L++ LGCKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
          Length = 582

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 111/332 (33%), Positives = 150/332 (45%), Gaps = 73/332 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR-FPPGRLTSSYKFFIGGFDWNLQ 63
           EV   WL PL++ +A N    V P I  I  DTFE +    GR         G FDW  +
Sbjct: 219 EVNVNWLPPLIEPIAENYRTCVCPYIDGIAHDTFEYKPQSEGRR--------GAFDW--K 268

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P R + +  +  EP  +P MAGGLF+I   FF +LG YD   DIWGGE  ELSF
Sbjct: 269 FLYKRLPLRPQDQ-TDPTEPFDSPIMAGGLFAISAKFFWELGGYDEELDIWGGEQYELSF 327

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGT------YDSGFDIWG 176
           K  W                 V  P +  G ++     F    GT      +    ++W 
Sbjct: 328 KI-WQC-----------GGRMVDAPCSHVGHVYRGLAPFPNPRGTNFVTRNFKRVAEVWM 375

Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW------------ 216
            E  +  F       K D GD+T +K LR  L CK FKW+L EV+ D             
Sbjct: 376 DEYKQFLFERNPEYDKTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLLLKYPLRDPKPF 435

Query: 217 -SG---------MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHGEIR--RD 259
            SG         +C+DS        +P+G++ C     H Q  NQF+ +S   +IR    
Sbjct: 436 ASGRVQSLANPILCLDSLNHKE--KEPIGVFSCAANKTHPQ-SNQFFTLSYFRDIRVASV 492

Query: 260 EACLDYA--GGDVILYPCHGSKGNQYFEYDYK 289
           + CLD A  G +V L+ CH  +GNQ ++YD K
Sbjct: 493 DKCLDAASEGSEVRLFNCHEIQGNQLWQYDMK 524


>gi|260814835|ref|XP_002602119.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
 gi|229287425|gb|EEN58131.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
          Length = 1164

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/180 (38%), Positives = 94/180 (52%), Gaps = 24/180 (13%)

Query: 48   TSSYKFFIGGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY 107
            +SS     GGFDW + F W+ +P+ E  R K    P+ +PTMAGGLFSI K FFE+LGTY
Sbjct: 885  SSSGHMTRGGFDWRMHFRWNTVPDYEMARRKMEKAPIRSPTMAGGLFSIHKMFFEELGTY 944

Query: 108  DSGFDIWGGENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF 162
            D G +IWGGENLELSFK          +P          ++P   P   GG+ ++ +   
Sbjct: 945  DPGLEIWGGENLELSFKTWMCGGTLEILPCSRVGHIFRQSQPYRFP--GGGMQTVQRNSL 1002

Query: 163  EKLGTYDSGFDIWGGENLELSFKG----------DFGDVTSRKELRRNLGCKSFKWYLEV 212
              +        +W  E    +F             +GDV+ R++LR  LGCKSF+WYL+ 
Sbjct: 1003 RVV-------QVWMDERHRKAFYAVNPELKDMNISYGDVSERRQLRDRLGCKSFQWYLDT 1055



 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 64/120 (53%), Gaps = 11/120 (9%)

Query: 69  IPERE---RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125
           +PER    R R + AA      +  G    +D  +         GFD W          F
Sbjct: 850 LPERAGLIRARLRGAAVRRLLESKGGISLHLDHLYSSSGHMTRGGFD-W-------RMHF 901

Query: 126 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
            W+ +P+ E  R K    P+ +PTMAGGLFSI K FFE+LGTYD G +IWGGENLELSFK
Sbjct: 902 RWNTVPDYEMARRKMEKAPIRSPTMAGGLFSIHKMFFEELGTYDPGLEIWGGENLELSFK 961


>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
 gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
          Length = 415

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/216 (36%), Positives = 111/216 (51%), Gaps = 25/216 (11%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD +  + + VV P I  + + TF           + +   GGFDW L F W ++
Sbjct: 190 WLEPLLDRIREDRTRVVCPSIDRVNEATFAYEV-------ANENVRGGFDWELFFQWVSL 242

Query: 70  PERERKRHKNAA---EPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF- 125
           P  E KR  +     E + +PTMAGGLFSID+ FF +LG YD GF IWGGENLELSFK  
Sbjct: 243 PAVEAKRRTHNVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIW 302

Query: 126 ----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY------DSGFDIW 175
               +   +P          ++P +  + A  +  +      +L            + + 
Sbjct: 303 MCGGSLEILPCSRVGHVFRKSQP-YNYSNATSIMEVVHHNNVRLAEVWLDEYKKIYYALH 361

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            G  +EL+     GD++ RK LR NLGC+SF+WYLE
Sbjct: 362 PGVEVELA---KMGDISERKLLRENLGCRSFQWYLE 394


>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
 gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
           adhaerens]
          Length = 491

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 72/214 (33%), Positives = 112/214 (52%), Gaps = 16/214 (7%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV ++W +PLL+ +  N   +VSP++ NI  +TFE +          +   GGFDW+L 
Sbjct: 146 CEVNQQWAEPLLEQIVLNPKAIVSPVLDNIDMNTFEYQ-------EGTEDVRGGFDWSLT 198

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  + E    +  +   P+ TPT+AGG++++ K +F  LG YD G  IWGGENLELSF
Sbjct: 199 FRWDYMTEAMINQRIDPTSPIKTPTIAGGIYAVSKQWFNDLGEYDMGQKIWGGENLELSF 258

Query: 124 KFNW------HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           +  W        IP            P   P  AG  +   +     +  +   + ++  
Sbjct: 259 R-AWMCGGFMKIIPCSRVGHVFRLQHPYIFPEGAGRTYY--RNLRRVVEVWLDEYKVYFY 315

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +  ++    D+G+V SRK+LR+ L C++FKWYL+
Sbjct: 316 QIRKIIKSIDYGNVKSRKQLRKRLHCQTFKWYLD 349


>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Strongylocentrotus purpuratus]
          Length = 606

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 140/310 (45%), Gaps = 36/310 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  +A N +  V P+I  I  D  + R  P       +   GGFDW+L 
Sbjct: 257 CEVGVNWLPPLLTPIAVNRTTAVCPIIDVI--DNMDYRVYPQGTGDQDR---GGFDWSLY 311

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +P+ E+ R ++A+EP  +P MAGGLF++D+ +F +LG YD G +IWGGEN ELSF
Sbjct: 312 WKHLPVPQFEKSRRQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSF 371

Query: 124 KFNWHA------IP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P  R    ++   +  ++      L   ++     +  +   +  + 
Sbjct: 372 KI-WMCGGSLLWVPCSRVGHVYRILGKVPYSAPNGSMLILSERNLRRVVEVWFDDYKEYF 430

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
             +   S     G++  +   R    CKSF W++                     E+   
Sbjct: 431 YRSKPESLLVSTGNIEKQLAFREKFHCKSFGWFMKEIAPDIIEKYPLPHANKYWGEIRTK 490

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
              +C+DS          VG+  CH  GGNQ + ++++G++R  + C      +V L  C
Sbjct: 491 KGSLCVDSMGSKDGGR--VGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYKEVRLRRC 548

Query: 276 HGSKGNQYFE 285
            GS G   F+
Sbjct: 549 GGSGGGWSFD 558


>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
          Length = 662

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/306 (30%), Positives = 141/306 (46%), Gaps = 46/306 (15%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   W++PLL  +  N + +  P+I  I  DTF        + SS     GGF+W L F
Sbjct: 295 EVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTF--------IYSSSPLVRGGFNWGLHF 346

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +P+    +  +   P  +PTMAGGLF++D+ +F+ LG YD G D+WGGENLE+SF+
Sbjct: 347 KWDNLPKGTLAKESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFR 406

Query: 125 FNWHA-----------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W             I    RKR +    P  + TM      + + + +    Y     
Sbjct: 407 -TWQCGGSIELVPCSRIGHVFRKR-RPYGSPDGSDTMIRNSLRLSRVWMDDYIKY----- 459

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTD--MH 231
               EN   + K D GD+T R +LR+ L CKSF+WYL+  N +  + +    K TD  + 
Sbjct: 460 --FLENQPQAKKVDPGDLTDRHDLRKRLNCKSFEWYLK--NIYPQLKLPGE-KTTDSNVS 514

Query: 232 KPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----------AGGDVILYPCHGSKGN 281
           +P    P H +  N  ++ S    +     C+             G  ++L+PC   K  
Sbjct: 515 QP-KFQPWHSRKRN--YISSFQIRLSNSSLCVTTESAKEKSLWKKGSHLVLHPCLRVKAQ 571

Query: 282 QYFEYD 287
            ++E +
Sbjct: 572 MWYETE 577


>gi|195338421|ref|XP_002035823.1| GM15572 [Drosophila sechellia]
 gi|194129703|gb|EDW51746.1| GM15572 [Drosophila sechellia]
          Length = 604

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/306 (29%), Positives = 146/306 (47%), Gaps = 42/306 (13%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++WL+PLL ++   ++ +  P+I  I  DTFE  + P  L        GGF+W L F
Sbjct: 249 EVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 300

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K  ++   P  +PTMAGGLF++++ +F+ LG YD   DIWGGEN+E+SF+
Sbjct: 301 RWENLPEGTLKVPEDFRGPFRSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFR 360

Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W        +P        RKR +    P    TM      +   + ++   Y     
Sbjct: 361 -AWQCGGAIKIVPCSRVGHIFRKR-RPYTSPDGANTMLKNSLRLAHVWMDQYKDY----- 413

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGMCIDSACKPT 228
               E +  ++  D+GD++ R +LR  L C+ F WYL     E+    +G  + +A    
Sbjct: 414 YLKHEKVPKAY--DYGDISDRLKLRERLQCRDFAWYLKNVYPELHLRLTGTELCAAVVAP 471

Query: 229 DMH------KPVGLYPCHKQGGNQFWMMSKHGEIRRDE-ACLDYAG-GDVILYPCHGSKG 280
            +         + L  C ++  NQ W  ++  EI  D+  CL+ +G   V +  CH   G
Sbjct: 472 KVKGFWKKGSSLQLQTC-RRTPNQLWYETEKAEIVLDKLLCLEASGDAQVTVNKCHEMLG 530

Query: 281 NQYFEY 286
           +Q + +
Sbjct: 531 DQQWRH 536


>gi|334348942|ref|XP_001380115.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           protein 2-like [Monodelphis domestica]
          Length = 642

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 130/292 (44%), Gaps = 33/292 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K WL+PLL  +A + S +VSP+I  I    F+          S     G FDW L 
Sbjct: 288 CECHKGWLEPLLSRIAGDRSRLVSPIIDVIDWKNFQYYH-------SMDLQRGVFDWELN 340

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +PE+ERK  ++   P+ +P + GG+ +ID+ +F+  G YDS   IWG ENLELS 
Sbjct: 341 FHWRPLPEQERKMRQSPISPIRSPVLPGGVLAIDRHYFQNTGAYDSLMSIWGSENLELSI 400

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   IP            P  +P     L +  +     LG++   F     +
Sbjct: 401 RVWLCGGSVEIIPCSRVGHVYRHQPPNASPDPEAALKNKIRIVETWLGSFKDTFYQHSPK 460

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYP 238
              LS + +  D + R +L+R LGC++F W+L            +   P        LYP
Sbjct: 461 AFSLS-QAEKQDCSERLQLQRRLGCRTFHWFL------------ANLSPE-------LYP 500

Query: 239 C-HKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYFEYDYK 289
             HK G +     S  G      +     GG V+L PC  S+  Q+ EY  K
Sbjct: 501 SEHKPGFSGKLYSSGVGSCAECVSGQGLPGGWVMLSPCSDSRQPQHLEYTSK 552


>gi|402586218|gb|EJW80156.1| glycosyltransferase, partial [Wuchereria bancrofti]
          Length = 448

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 112/226 (49%), Gaps = 39/226 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N   VV+P+I  I  DTF+       L        GGF+WNL 
Sbjct: 236 CECNVNWLEPLLARVKENHRAVVAPVIDIIDKDTFKYIAASADLR-------GGFEWNLI 288

Query: 64  FNWHAIPERERK-RHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  + R  RH     P+ TP +AGGLF I K +FEKLGTYD   D+WGGENLELS
Sbjct: 289 FKWEYLLGKLRDDRHAQPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEEMDVWGGENLELS 348

Query: 123 FKF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           F+      +   IP        RK+H          T  GG  ++ +    ++       
Sbjct: 349 FRVWLCGGSLEIIPCSRVGHVFRKQHPY--------TFPGGSSNVFQKNTRRVA------ 394

Query: 173 DIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
           ++W G+   L  +        +FGD+T+R +L++ L CK F WYL+
Sbjct: 395 EVWLGDYKHLYLRKVPSARYVNFGDITARLDLKKRLHCKDFDWYLK 440


>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
           livia]
          Length = 560

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 103/338 (30%), Positives = 143/338 (42%), Gaps = 72/338 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 183 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDHF------GYETQAGDAMRGAFDWEMY 236

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 237 YKRIPIPPELQKL--DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISF 294

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   PT   +A  L  + + + ++   Y       
Sbjct: 295 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAEY---IYQR 351

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------------- 210
             E   LS     GDV ++KELR NL CKSFKW++                         
Sbjct: 352 RPEYRHLS----AGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEAR 407

Query: 211 --------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEI 256
                   ++ N  +G+C+D+  K   +  P+ L  C K  G   W        S   +I
Sbjct: 408 DSATSLLFQIRNVGTGLCVDT--KHGALGSPLRLENCVKDRGEAAWNNVQVFTFSWREDI 465

Query: 257 RRDEA------CLDYA--GGDVILYPCHGSKGNQYFEY 286
           R  +       C D       V LY CHG KGNQ + Y
Sbjct: 466 RPGDPQHTKKFCFDAISHSSPVTLYDCHGMKGNQLWRY 503


>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
 gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
           malayi]
          Length = 575

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/212 (36%), Positives = 112/212 (52%), Gaps = 23/212 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  K W++PLL  +  N   VV P+I  I D TF  +        S + F GGF+WNLQ
Sbjct: 185 CECTKGWMEPLLARIKENRKAVVCPVIDIINDRTFAYQ-------KSIELFRGGFNWNLQ 237

Query: 64  FNWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+A+P E  + R  +  +P+ +PTMAGGLFSID+ +FE++GTYD   DIWGGEN+E+S
Sbjct: 238 FRWYALPSEMIKSRSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEIS 297

Query: 123 FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE---N 179
            +  +  +P            P   P    G  +I  +   ++       ++W  E   +
Sbjct: 298 LRV-FEILPCSHVGHVFRRTSPHDFPGRKSG--TILNSNLLRVA------EVWMDEWKFH 348

Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
              +    FG V +    R+ L CKSFKW+L+
Sbjct: 349 FYRTAPRRFGCVVNS---RKRLHCKSFKWFLD 377


>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Megachile rotundata]
          Length = 637

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 139/305 (45%), Gaps = 41/305 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV K W++PLL  +A + + V  P+I  I  DTF+    P           GGF+W L 
Sbjct: 266 IEVNKMWIEPLLSRIAHSKTIVAMPVIDIINADTFQYTASP--------LVRGGFNWGLH 317

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P +     ++  +P+ +PTMAGGLF++D+ +F +LG YD+G D+WGGENLE+SF
Sbjct: 318 FKWEQLPTK-LVHDEDFIKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGGENLEISF 376

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   IP        RKR    A+      +   L    +  +  L  Y   + 
Sbjct: 377 RIWMCGGSIELIPCSRVGHVFRKRRPYGADDKHDTMLKNSL----RVAYVWLDEYKHYY- 431

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGMCIDSACKPT 228
                 L+   K D+GD+T R  LR+ L CK F WY+     E++               
Sbjct: 432 ------LKDVNKIDYGDITDRLNLRQKLKCKDFAWYVKEVYPELTFPDDDKKRLKDKWAR 485

Query: 229 DMHKPVGLYPCHKQGGN---QFWM-MSKHGEIRRDEACLDYAGGDVILYPCHGSKGNQYF 284
              KP  + P H +  N   Q+ + +S      + E  +   G  +IL PC   K   ++
Sbjct: 486 IEQKP--MQPWHSRKRNYTDQYQIRLSNTALCIQSEKDIKTKGAKLILMPCLRIKSQMWY 543

Query: 285 EYDYK 289
           E D K
Sbjct: 544 ETDKK 548


>gi|313228070|emb|CBY23220.1| unnamed protein product [Oikopleura dioica]
          Length = 467

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 133/287 (46%), Gaps = 47/287 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A   +  V+P+I NI D+ FE+R      T+ +   IG F W + 
Sbjct: 190 CEAISGWLEPLLQRVAEKPNVAVTPVILNIRDNDFEIR-----ATAPHNVQIGIFTWGMT 244

Query: 64  FNWHAIPERERKRH--KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           F W     R+R  +   N+ + V +PTMAGGLF+I++ +F   G+YD     WGGENLE+
Sbjct: 245 FTWERYFWRKRLNNVKNNSTKCVPSPTMAGGLFAINREYFYYSGSYDEQMHGWGGENLEM 304

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           SF+           P  +         P   P  A G ++++              D+W 
Sbjct: 305 SFRLWQCGGGIETHPCSQVGHVFRTHSPYKIPEGAEG-YNLNMRRL---------VDVWL 354

Query: 177 GENLELSFK------GDFGDVTSRKELRRNLGCKSFKWYLE---------VSNDW----- 216
            E  EL +       GD GD++ R  L+  L CKSF WY++          +ND      
Sbjct: 355 DEFKELYYSRSGGVWGDEGDISERLALKEKLQCKSFAWYMDNVATSIDYFFANDTRTGFL 414

Query: 217 --SGMCIDSACKPTDM--HKPVGLYPCH-KQGGNQFWMMSKHGEIRR 258
             +G C+D    P  +     VG YPCH + GGNQ  M ++   + R
Sbjct: 415 HSNGHCLDVGNLPMPLAPQNDVGTYPCHFEVGGNQVVMFTRLKGLTR 461


>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 573

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/334 (32%), Positives = 141/334 (42%), Gaps = 77/334 (23%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFP-PGRLTSSYKFFIGGFDWNLQ 63
           E    WL PLL+ +A N    V P I  I  DTFE R    GR         G FDW  +
Sbjct: 220 EANVNWLPPLLEPIAENYKTCVCPFIDVIAHDTFEYRAQDEGRR--------GAFDW--E 269

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P       K+  EP  +P MAGGLF+I   FF +LG YD G  IWGGE  ELSF
Sbjct: 270 FFYKRLPLLPEDL-KHPTEPFQSPVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSF 328

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
           K  W               + V  P    G      A F   G        Y    ++W 
Sbjct: 329 KI-WQC-----------GGKMVDAPCSRVGHIYRKFAPFPNPGIGDFVGKNYRRVAEVWM 376

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW------------ 216
            E  E  +K        D GD+T +K +R  L CK FKW++E ++ D             
Sbjct: 377 DEYAEYLYKRRPHYRNIDPGDLTVQKAVRERLNCKPFKWFIENIAFDLPLKYPPIEPPDL 436

Query: 217 ----------SGMCIDSACK-PTDMHKPVGLYPCHKQ------GGNQFWMMSKHGEIRRD 259
                      G+C+D+  K P D     GL PC K          Q+++++ H +IR  
Sbjct: 437 AEGEIRSIADPGLCVDTERKEPEDT---FGLKPCEKNFKSKNTRTEQYFILTWHEDIRPK 493

Query: 260 --EACLDYAGGD----VILYPCHGSKGNQYFEYD 287
               C D +  D    V LY CHG KGNQY+ YD
Sbjct: 494 GRNVCWDVSSIDNKASVNLYKCHGMKGNQYWHYD 527


>gi|260800261|ref|XP_002595052.1| hypothetical protein BRAFLDRAFT_125761 [Branchiostoma floridae]
 gi|229280294|gb|EEN51063.1| hypothetical protein BRAFLDRAFT_125761 [Branchiostoma floridae]
          Length = 941

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 119/269 (44%), Gaps = 81/269 (30%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLLD + RN + V  P I  I D+TF          ++ +   GGF+W ++F+W ++
Sbjct: 643 WLEPLLDRIGRNRTTVPCPSIDRINDNTFGYE-------AANENMRGGFNWGMKFDWVSL 695

Query: 70  PERERKRHK----NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF 125
           P  E  R      +  E + +PTMAGGLFSID+ FF +LG YD GF IWG ENLE+SFK 
Sbjct: 696 PPGEDDRRYQDIWSQNEIIKSPTMAGGLFSIDRRFFWELGGYDPGFQIWGAENLEISFKD 755

Query: 126 NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 185
            ++A+         NA                                            
Sbjct: 756 IFYALNPHVENEIANA-------------------------------------------- 771

Query: 186 GDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGMCIDSACKP 227
              GDV+ RK +R  LGCKSF+WY+                  EV N    +C+D+    
Sbjct: 772 ---GDVSDRKRMREQLGCKSFQWYIDHVYPEITIPDLRAKARGEVKNRAMSLCLDAV--- 825

Query: 228 TDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
               + VG Y CH +GG Q + +    +I
Sbjct: 826 --YGEKVGAYFCHGEGGQQSFTLRMDDKI 852


>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 520

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 137/296 (46%), Gaps = 58/296 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +    S V+ P+I +I  +T           +     +GGF W+L 
Sbjct: 169 CEVGEGWLEPLLARIKDKRSAVLCPIINHISAETLTYS------ANDRPTNVGGFSWSLH 222

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P+       +  EP+ +PTMAGGL ++D+++F ++G YD   DIWGGENLE+SF
Sbjct: 223 FLWDPMPKE--YFDADPTEPIRSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSF 280

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           +  W      E           + P    G    D   +  +G  D+  D+ G  +  L+
Sbjct: 281 RV-WMCGGSIE-----------FIPCSHVGHIFRDGHPYNMIGPGDNK-DVHGTNSKRLA 327

Query: 184 -----------------FKG-DFGDVTSRKELRRNLGCKSFKWYLE-------------- 211
                             KG D GD++ R+ LR+ L CKSFKWYL+              
Sbjct: 328 EVWMDDYKKFYYIHRLDLKGKDVGDLSERRALRQKLRCKSFKWYLQNVAKNKFVLDENVA 387

Query: 212 ----VSNDWSGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
               + N  SG+C+D+  +  D   P+ ++ C   +   Q + ++  G +RR+  C
Sbjct: 388 AFGALRNPSSGLCLDTLQRNEDEVIPLCVFSCQNGKSQTQIFSLTNDGILRRELTC 443


>gi|332243646|ref|XP_003270989.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Nomascus leucogenys]
          Length = 443

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/220 (34%), Positives = 106/220 (48%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKVVVCPLIDVIDDRTLEYKPSP--------VVRGTFDWNLQ 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+W+P M+GG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIWSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +     T+   +             Y     +W  E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++ +R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRARVELRKRLGCKSFQWYLD 430


>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 520

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 135/287 (47%), Gaps = 40/287 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +    S V+ P+I +I  +T           +     +GGF W+L 
Sbjct: 169 CEVGEGWLEPLLARIKDKRSAVLCPIINHISPETLTYS------ANDRPAHVGGFWWSLH 222

Query: 64  FNWHAIPERERKRHKNA--AEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           F W  +P    K + +A   EP+ +PTMAGGL ++D+ +F ++G YD   DIWGGENLE+
Sbjct: 223 FRWDPMP----KEYSDADPTEPIRSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEM 278

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY--DSGFDI 174
           SF+      +   IP         A  P +     G    +     ++L     D     
Sbjct: 279 SFRVWMCGGSVEFIPCSHVGHIFRAGHP-YNMIGPGNNKDVHGTNSKRLAEVWMDDYKKF 337

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE------------------VSNDW 216
           +    L+L  K D GD++ RK LR+ L CKSFKWYLE                  + N  
Sbjct: 338 YYIHRLDLKEK-DVGDLSERKALRQKLKCKSFKWYLENVAKNKFVLDENVAAFGSLRNPS 396

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHK-QGGNQFWMMSKHGEIRRDEAC 262
           S +C+D+  +      P+ ++PC   +   Q + ++  G +RR+  C
Sbjct: 397 SELCLDTLQRDEGEAIPLSVFPCQNGKSEAQIFSLTNDGILRRELTC 443


>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
 gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
          Length = 679

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/331 (32%), Positives = 146/331 (44%), Gaps = 65/331 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A N    V P I  I    F  R       +  +   G FDW  +
Sbjct: 307 VEANYNWLPPLLEPIAINERTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDW--E 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P       K+ +EP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSF
Sbjct: 358 FFYKRLPLLPEDL-KHPSEPFKSPVMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSF 416

Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
           K        + A   R    ++     V +P     L    K   E      K   YD G
Sbjct: 417 KIWMCGGEMYDAPCSRVGHIYRGPRNHVPSPRTGDYLHKNYKRVAEVWMDEYKKYLYDHG 476

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW-------------- 216
             I+         + D GD+T++K +R  L CKSFKW++ EV+ D               
Sbjct: 477 DGIYD--------RVDAGDLTAQKAIRTKLKCKSFKWFMEEVAFDLMKSYPPIDPPDYAS 528

Query: 217 --------SGMCIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMSKHGEI--RRDEAC 262
                   S +C+D+       H  +G+Y C     K   NQF+ +S   ++  RR + C
Sbjct: 529 GAIQNVGDSSLCVDTHG--LRKHNRMGVYSCAEDLQKPQRNQFFQLSWKRDLRQRRKKDC 586

Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDYK 289
           LD     A   V L+ CHG +GNQY+ YDY+
Sbjct: 587 LDVQIWDANAPVWLWDCHGQQGNQYWFYDYR 617


>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
 gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
          Length = 675

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 106/330 (32%), Positives = 146/330 (44%), Gaps = 65/330 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLLD +A+N    V P I  I    F  R       +  +   G FDW+  
Sbjct: 307 VEANYNWLPPLLDPIAQNKRAAVCPFIDVIDHSNFNYR-------AQDEGARGAFDWD-- 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P       K+ ++P  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSF
Sbjct: 358 FFYKRLPLLPEDL-KHPSDPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416

Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
           K        + A   R    ++   + V  P     L    K   E      K   Y+ G
Sbjct: 417 KIWMCGGEMYDAPCSRVGHIYRGPRQGVKNPRSGDYLHKNYKRVAEVWMDEYKNYLYNHG 476

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------------- 216
             I+           D GD+T++K +R  L CKSFKW++E V+ D               
Sbjct: 477 DGIYDN--------VDPGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSYPPVDPPDYAS 528

Query: 217 --------SGMCIDSACKPTDMHKPVGLYPCH----KQGGNQFWMMS--KHGEIRRDEAC 262
                   + +CID+  +    H  VG+Y C     K    Q+W +S  +   +RR + C
Sbjct: 529 GAIQNVGDNTLCIDTLGRVR--HNRVGMYRCAIDLVKPQRTQYWSLSWKRDLRLRRKKDC 586

Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
           LD     A   V L+ CHG +GNQY+ YDY
Sbjct: 587 LDVQIWDANAPVWLWDCHGQQGNQYWFYDY 616


>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
 gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
          Length = 604

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/308 (29%), Positives = 137/308 (44%), Gaps = 44/308 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFF-IGGFDWNL 62
           CE  + W +PLL  +  + + V+ P+I  I    F+        T+ YK F +GGF W+ 
Sbjct: 249 CEANEGWCEPLLQRIKESRTSVLVPIIDVIDAKDFQYS------TNGYKSFQVGGFQWSG 302

Query: 63  QFNWHAIPERERKRHKNAAE------PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 116
            F+W  +PERE++R            P ++PTMAGGLF++D+ +F ++G+YD   D WGG
Sbjct: 303 HFDWVNLPEREKQRQLRECSQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGG 362

Query: 117 ENLELSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG 171
           ENLE+SF+          IP            P   P        I+ A    L   D  
Sbjct: 363 ENLEMSFRIWQCGGTIETIPCSRVGHIFRDFHPYKFPN-DRDTHGINTARM-ALVWMDEY 420

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVS 213
            +++     +L F  D GDVT R  LR+ L CKSF+WYL                  +V 
Sbjct: 421 INVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKSFEWYLKNVYPEKFVPNMNVKAWGKVK 480

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQ-GGNQFWMMSKHGEIRRDEACLDYAGGD--- 269
              S +C+D      +    +GLY C K    +Q +  +    +R + +C          
Sbjct: 481 AVNSNLCLDDLLNNNEKPYNLGLYACGKALQKSQLFSYTNSLVLRNELSCATVQHSSSPP 540

Query: 270 --VILYPC 275
             V++ PC
Sbjct: 541 HRVVMVPC 548


>gi|260790280|ref|XP_002590171.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
 gi|229275360|gb|EEN46182.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
          Length = 1466

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 145/325 (44%), Gaps = 83/325 (25%)

Query: 4    CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIG-GFDWNL 62
             E    WL+PL+D +AR+   VVSP I  I  DTF   +    L  ++ + +G GFD   
Sbjct: 1113 VECNTGWLEPLVDRIARDRKTVVSPGIDWIHGDTFAYDYGIDTLRVTWGWNLGFGFDHEH 1172

Query: 63   QFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
               W  + E E+       +PV +P + GGLF+ID+ +F ++G YD G + WGGE+ E+S
Sbjct: 1173 AERWVQLSEDEQ------VKPVRSPMLLGGLFAIDRQYFREIGMYDPGLEYWGGEHFEIS 1226

Query: 123  FKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG------TYDSG----F 172
            FK                     W   M GG  SI+     ++G      TY +G     
Sbjct: 1227 FK--------------------AW---MCGG--SIEVLPCSRVGHVWGKKTYSTGNMTLH 1261

Query: 173  DIWGGENLELS------------------FKGDFGDVTSRKELRRNLGCKSFKWYL---- 210
            D     N+ ++                   K  FGD++ R+ LR  L CK F+WYL    
Sbjct: 1262 DWASRNNMRVAEVWMDHYKVHYYIRRPYLMKRKFGDISDRRRLRERLQCKDFRWYLDNAF 1321

Query: 211  --------------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
                          +V N+ +  C+D   KP    + + ++PCH   G QF+ ++   ++
Sbjct: 1322 PDLYIPDDIPGRYGQVRNNGTNTCLDWTSKP---QRELEMFPCHHGLGTQFFELTGQNQL 1378

Query: 257  RRDEACLDYA--GGDVILYPCHGSK 279
            R + +CL+    G DV+L  C  S+
Sbjct: 1379 RDERSCLEARDDGSDVMLVTCGRSE 1403


>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 612

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 144/330 (43%), Gaps = 72/330 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A++   VV P I  I  +TF  R       +  +   G FDW L +
Sbjct: 257 EANVNWLPPLLEPIAKDYRTVVCPFIDVIDYETFAYR-------AQDEGARGSFDWELYY 309

Query: 65  N-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
                +PE       N  EP  +P MAGGLF+I + +F +LG YD G D+WGGE  ELSF
Sbjct: 310 KRLPLLPED----LANPTEPFKSPVMAGGLFAISRRYFWELGGYDEGLDVWGGEQYELSF 365

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-------TYDSGFDIWG 176
           K  W                 V  P    G      A F   G        Y    ++W 
Sbjct: 366 KI-WQC-----------GGTMVDAPCSRVGHIYRKFAPFPNPGIGDFVGRNYRRVAEVWM 413

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
            E  E  +         + GD+T++KELR+ L CKSFKW++                   
Sbjct: 414 DEYKEYLYMRRPHYRNLEPGDLTAQKELRKRLNCKSFKWFMENVAFDQPSKYPAIEPPDY 473

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHK----QGGNQFWMMSKHGEIR--RDEA 261
              E+ ++ S +CID+  K    ++   L  C +    Q G Q  +++ H +IR  +   
Sbjct: 474 AWGEIRHEKSSLCIDTQFK--GQNERFSLEKCIRDHRDQSGEQHLVLTWHKDIRPQKRTV 531

Query: 262 CLDYAGGD----VILYPCHGSKGNQYFEYD 287
           C D +  +    V+L+ CHG  GNQ F+YD
Sbjct: 532 CFDVSSSEPRAPVVLWSCHGMHGNQLFKYD 561


>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Oryzias latipes]
          Length = 533

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/234 (33%), Positives = 104/234 (44%), Gaps = 56/234 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP++  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 195 CEVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 247

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ++    +   P+ TP +AGG+F +DK++F  LG YD+  DIWGGEN ELSF
Sbjct: 248 FKWEQIPIEQKMARSDPTLPIRTPVIAGGIFVMDKSWFNHLGQYDTHMDIWGGENFELSF 307

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--------- 169
           +                    VW   M GG   I         F K   YD         
Sbjct: 308 R--------------------VW---MCGGSLEILPCSRVGHVFRKRHPYDFPEGNALTY 344

Query: 170 -----SGFDIWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
                   ++W  E  +  +          FG +T R  LRR L CK F+WY+E
Sbjct: 345 IKNTRRAAEVWMDEYKQFYYSARPSAQGKAFGSITERLSLRRKLNCKPFRWYME 398


>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 677

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 102/330 (30%), Positives = 142/330 (43%), Gaps = 67/330 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 312 CEANINWLPPLLDRIALNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 365

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + +EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 366 YKRIPIPAELQK--NDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 423

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 424 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEYVYQRR-- 481

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++KELR  L CKSFKW++                      EV 
Sbjct: 482 -PEYRHLS----AGDVAAQKELRSTLNCKSFKWFMKEVAWDLPKHYPPVEPPAAAWGEVR 536

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD--------- 259
           +  SG+C++S  K      P+ L  C K   +  W    HG++     R D         
Sbjct: 537 SAASGLCLES--KHFVSGTPIRLESCVKGRADVSW---GHGQVFTFGWREDIRVGDPMHT 591

Query: 260 -EACLDYAG--GDVILYPCHGSKGNQYFEY 286
            + C D       V LY CHG +GNQ + Y
Sbjct: 592 KKVCFDAVSHHSPVTLYDCHGMRGNQLWRY 621


>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
 gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
          Length = 561

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/326 (30%), Positives = 139/326 (42%), Gaps = 65/326 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLL+ +A N   +V P I  I  D F         T +     G FDW + 
Sbjct: 195 CEANVNWLPPLLEPIALNKKTIVCPNIDVIDKDDFHYE------TQAGDAMRGAFDWEMY 248

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP+    ++ + ++P  +P MAGGLF++D+ +FE+LG YD G DIWGGE  ELSF
Sbjct: 249 YKRIPIPDE--IKNPDPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGGEQYELSF 306

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSG------FDIWGG 177
           K  W                 V  P    G        ++     + G       ++W  
Sbjct: 307 KV-WQC-----------GGRMVDAPCSRVGHVYRKFVPYKVPAGVNLGKNLKRVAEVWMD 354

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEVS----------------- 213
           E  E  +       K D GD++ + +LR  L CK FKW+++V                  
Sbjct: 355 EYKEHLYKRRPHLRKTDMGDISGQLQLRERLKCKPFKWFMKVVAPDIILHYPPVEPEPAA 414

Query: 214 -----NDWSGMCIDSACKPTDMHKPVGLYPCHKQG----GNQFWMMSKHGEIRRD--EAC 262
                N  S +CIDS  K       V L  C K G    G Q + MS H +IR      C
Sbjct: 415 SGEIWNKASNLCIDS--KHGGGQAEVRLDQCVKGGGIMNGEQNFHMSWHNDIRPKGRTFC 472

Query: 263 LD--YAGGDVILYPCHGSKGNQYFEY 286
            D    GG +IL+ CH   GNQ++ Y
Sbjct: 473 FDAQMKGGTLILFACHQMLGNQHWLY 498


>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 675

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 144/331 (43%), Gaps = 69/331 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 312 CEANVNWLPPLLDRIAQNRKTIVCPMIDVIDHDNF------GYETQAGDAMRGAFDWEMY 365

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K  ++ +EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 366 YKRIPIPPELQK--EDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 423

Query: 124 KFNW------HAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDI 174
           K  W        IP            P   P   ++A  L  + + + ++   Y      
Sbjct: 424 KV-WMCGGCMEDIPCSRVGHIYRKYVPYKVPGGVSLARNLKRVAEVWMDEYAEY---IYQ 479

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EV 212
              E   LS     GD  ++K+LR  L CKSFKW++                      E+
Sbjct: 480 RRPEYRHLS----AGDTAAQKDLRSQLNCKSFKWFMTKVAWDLSKHYPPVEPPAAAWGEI 535

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD-------- 259
            N  S MC+++  K      PV +  C K  G   W    HG++     R D        
Sbjct: 536 RNVGSSMCLET--KHFVSGSPVWMESCLKGRGEVGW---NHGQVFTFGWREDIRVGDPMH 590

Query: 260 --EACLDYAGGD--VILYPCHGSKGNQYFEY 286
             + C D    +  V LY CHG KGNQ + Y
Sbjct: 591 TKKVCFDAVSNNSPVTLYDCHGMKGNQLWRY 621


>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
 gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
          Length = 587

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 140/311 (45%), Gaps = 55/311 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 234 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 290 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 349

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGL-------FSIDKAFFEKLG-----TYDSG 171
           K  W                  W P    G        ++  K   +K G      Y   
Sbjct: 350 KI-WQC-----------GGSIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRV 397

Query: 172 FDIWGGENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSG 218
            + W  E  +        L+   D GD+T +  L++ L CKSF+W++     +V + + G
Sbjct: 398 IETWFDETHKEYFYTREPLARYLDMGDITEQLALKKRLNCKSFQWFMDHIAYDVYDKFPG 457

Query: 219 M-----------CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
           +                C  +  H+P   +GL  CH  G NQ   ++  G++   E C++
Sbjct: 458 LPANLHWGELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIE 517

Query: 265 YAGGDVILYPC 275
                + L  C
Sbjct: 518 ADRQGIKLAVC 528


>gi|115497708|ref|NP_001069909.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Bos taurus]
 gi|83405338|gb|AAI11261.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Bos taurus]
 gi|440895696|gb|ELR47826.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Bos grunniens mutus]
          Length = 448

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 109/217 (50%), Gaps = 25/217 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL+ +A++   VV PLI  I  D   L + P  +        G F+W L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWRLE 286

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E +  +    P+ +P MAGG+F+I++ +F ++G YD G ++WGGENLELS 
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSL 346

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF-FEKLGTYDSGFDIWGG 177
           +        + IP   R  H N              F I K   +  L    +  D + G
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH-------VTNRFEIMKVVEYNNLRLVHTWLDEYKG 398

Query: 178 E---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +            +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 399 QFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLD 435


>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
           [Saccoglossus kowalevskii]
          Length = 634

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 88/307 (28%), Positives = 132/307 (42%), Gaps = 51/307 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLL  + +N   VV PL+  +  D F      G    +     G F+W+  
Sbjct: 292 CECSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNF------GYAPQADGMARGVFNWDFF 345

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP +E  R +  +EP  +P MAGGLF++ ++FF  +G YD+G DIWGGE  E+SF
Sbjct: 346 YKRIPIPPKEANRRERNSEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISF 405

Query: 124 KFNWHA------IP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P  R    ++    P   P    G+  ++K +           ++W 
Sbjct: 406 KI-WMCGGILEFVPCSRVGHIYRRGGIPYSYPQSDDGISIVNKNYLRVA-------EVWM 457

Query: 177 GENLELSFK-------GDFGDVTSRKELRRNLGCKSFKWYL------------------- 210
            E  E  ++         +GD+T + + R+      FKW++                   
Sbjct: 458 DEYKEYFYRMKPELRGKPYGDITEQVQFRQEHCPHDFKWFMDEVAYDITERFPLISKNIG 517

Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
             EV    S  C+DS  +       V LY CH  GG+Q   +++ GE R +E CL   G 
Sbjct: 518 WGEVRGVGSSKCVDSMGRSPSGK--VALYGCHGYGGSQLLRLNEGGEFRVNEECLYTDGS 575

Query: 269 DVILYPC 275
            V L  C
Sbjct: 576 TVKLERC 582


>gi|357622639|gb|EHJ74065.1| putative N-acetylgalactosaminyltransferase [Danaus plexippus]
          Length = 646

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 146/359 (40%), Gaps = 109/359 (30%)

Query: 4   CEVQKRWLQPLLDVLA--------RNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFI 55
            EV   WL PLL  L+        R S   V+P+I  I  DTFE    P           
Sbjct: 275 IEVNVDWLPPLLTRLSEGVDGVNVRFSPRAVTPIIDVINADTFEYTSSP--------LVR 326

Query: 56  GGFDWNLQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 115
           GGF+W L F W  +P+   K  ++  +P+ +PTMAGGLF+I + +F K+G YDSG ++WG
Sbjct: 327 GGFNWGLHFKWDNLPKGTLKDDEDFIKPIRSPTMAGGLFAIYREYFNKIGKYDSGMNLWG 386

Query: 116 GENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDS 170
           GENLE+SF+                    +W   M GG+  +         F K   Y +
Sbjct: 387 GENLEISFR--------------------IW---MCGGVLELCPCSRVGHVFRKRRPYGA 423

Query: 171 GFD-----------IWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE- 211
           G D           +W  E +    + +        GD++ R ELR++L CKSFKWYLE 
Sbjct: 424 GEDYMLRNSMRMARVWMDEYVNKVIEQNPSAAHVSIGDISERVELRKSLKCKSFKWYLEN 483

Query: 212 ------------------VSND--------W-----------------SGMCIDSACKPT 228
                               ND        W                 + +CI SA    
Sbjct: 484 VYPELETGEDTAARKRIAALNDPEKNKFQPWHSRKRNYTDSYQIRLRNTSLCIQSAKDIK 543

Query: 229 DMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA-CLDYAGGDVILYPCHGSKGNQYFEY 286
               P+ L  C +   NQ W  +  GE+      CLD A    I+  CH   G Q +++
Sbjct: 544 SKGSPLLLAGCTRT-INQMWFETDRGELVLGRTLCLD-ANTSPIIAKCHELGGTQEWKH 600


>gi|296488205|tpg|DAA30318.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 5 [Bos
           taurus]
          Length = 447

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 109/217 (50%), Gaps = 25/217 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL+ +A++   VV PLI  I  D   L + P  +        G F+W L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWRLE 286

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E +  +    P+ +P MAGG+F+I++ +F ++G YD G ++WGGENLELS 
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSL 346

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAF-FEKLGTYDSGFDIWGG 177
           +        + IP   R  H N              F I K   +  L    +  D + G
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH-------VTNRFEIMKVVEYNNLRLVHTWLDEYKG 398

Query: 178 E---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +            +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 399 QFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLD 435


>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 639

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 108/218 (49%), Gaps = 28/218 (12%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   W++PLL  +  N + +  P+I  I  DTF     P           GGF+W L F
Sbjct: 272 EVNVDWIEPLLQRIKVNRTILAMPVIDIINSDTFAYTSSP--------LVRGGFNWGLHF 323

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +P+    +  +   P  +PTMAGGLF++D+ +F++LG YD G D+WGGENLE+SF+
Sbjct: 324 KWDNLPKGSLAKETDFVGPFQSPTMAGGLFAMDRKYFKELGEYDMGMDVWGGENLEISFR 383

Query: 125 FNWHA-----------IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W             I    RKR +    P  T TM      + + + +    Y     
Sbjct: 384 -AWQCGGSIELLPCSRIGHVFRKR-RPYGSPDGTDTMIRNSLRLARVWMDDYIKY----- 436

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
               EN   + K D GD++ R+ELR  L CKSF+WYL+
Sbjct: 437 --FFENQPHANKLDAGDLSERQELRNRLNCKSFEWYLK 472


>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
           echinatior]
          Length = 597

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 104/314 (33%), Positives = 147/314 (46%), Gaps = 45/314 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WLQPLL  +  N + V+ P+I NI ++T E          ++ F +GGF W+  
Sbjct: 239 CEVIKDWLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHD----NEAFFFQVGGFTWSGH 294

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  I + E +   +   P  +PTMAGGLF+I++ +F ++G+YD   D WGGENLE+SF
Sbjct: 295 FTWITIQKHEVESRFSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEISF 354

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT--MAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           +          IP            P   P      G+ +   AF   +  Y   F +  
Sbjct: 355 RIWQCGGTLEIIPCSRVGHIFRNFHPYKFPNDKDTHGINTARLAFVW-MDEYKRLFLLHR 413

Query: 177 GE---NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSND 215
            E   N EL      GD++ R +LR+ L CKSFKWYL                   V   
Sbjct: 414 SEFKDNPEL-----IGDISERLKLRKKLKCKSFKWYLNNVYPEKFIPDENAIAYGRVRLR 468

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCH-KQGGNQFWMMSKHGEIRRDEAC---LDYAGG--- 268
              +C+D+     D    +GLY CH K   +QF+ +SK GE+RR+E C   LD   G   
Sbjct: 469 NRRLCLDNLQHDDDKPYNLGLYNCHTKLYPSQFFSLSKSGELRREETCGRILDTDSGPYA 528

Query: 269 DVILYPCHGSKGNQ 282
            + +  C   KG +
Sbjct: 529 QIEMSDCSNEKGGK 542


>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
          Length = 600

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 142/324 (43%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A+N   +V P+I  I  + F      G    +     G FDW + 
Sbjct: 234 CEANINWLPPLLDQIAQNPKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +    + ++P  +P MAGGLF++++ +F +LG YD+G +IWGGE  E+SF
Sbjct: 288 YKRIPIPPELQG--PDPSDPYQSPVMAGGLFAVNRQWFWELGGYDTGLEIWGGEQFEISF 345

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K      + + +P            P   P+      ++ +     +  Y         E
Sbjct: 346 KVWMCGGSMYDVPCSRVGHIYRKYVPYKVPSGTSLARNLKRVAETWMDEYTEYIYQRRPE 405

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GD+T++KELR++L CK FKWY+                      E+ N  
Sbjct: 406 YRHLS----TGDLTAQKELRKHLKCKDFKWYMNTVAWDLPKYYPPVEPLPAAWGEIRNAA 461

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
           SG+C+DS    T     + L  C K+G  + W   +        +IR  +       C D
Sbjct: 462 SGLCVDSKHGSTGTE--LRLDNCLKEGAERTWAHEQIFTFGWREDIRPGDPLHTRKFCFD 519

Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
               +  + LY CHG KGNQ++ Y
Sbjct: 520 AISQNSPITLYDCHGMKGNQHWSY 543


>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 1 [Xenopus (Silurana) tropicalis]
          Length = 600

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 136/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL+ +A N   +V P+I  I  + F      G    +     G FDW + 
Sbjct: 234 CEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 287

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   ++   + +EP  +P MAGGLF++D+ +F +LG YD G +IWGGE  ELSF
Sbjct: 288 YKRIPIPPELQR--TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSF 345

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          +P            P   PT      ++ +     +  Y         E
Sbjct: 346 KVWMCGGEMFDVPCSRVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPE 405

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GD++S+KELR++L CK FKWY+                      E+ N  
Sbjct: 406 YRHLS----TGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVA 461

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
           S +CIDS    T     + L  C K G  + W   +        +IR  E       C D
Sbjct: 462 SNLCIDSKHGATGTE--LRLDTCVKDGSERTWSHEQLFTFGWREDIRPGEPLHTRKFCFD 519

Query: 265 YA--GGDVILYPCHGSKGNQYFEY 286
                  V LY CHG KGNQ + Y
Sbjct: 520 SISHSSPVTLYDCHGMKGNQQWSY 543


>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
 gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
          Length = 591

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 140/309 (45%), Gaps = 51/309 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I   TFE R     +  S   F G F+W + 
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKTFEYR----PVYGSDNHFRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
             +        L+   D GD++ +  L++ L CKSF+W++                    
Sbjct: 409 THKEFFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHW 468

Query: 211 -EVSNDWSGMCIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA 266
            E+ +  S  C+DS       H+P   +GL  CH  G NQ   ++  G++   E C++  
Sbjct: 469 GELRSVASDGCLDSMG-----HQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEAD 523

Query: 267 GGDVILYPC 275
              + L  C
Sbjct: 524 RQGIKLAVC 532


>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 2 [Xenopus (Silurana) tropicalis]
          Length = 601

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 136/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL+ +A N   +V P+I  I  + F      G    +     G FDW + 
Sbjct: 235 CEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHF------GYEAQAGDAMRGAFDWEMY 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   ++   + +EP  +P MAGGLF++D+ +F +LG YD G +IWGGE  ELSF
Sbjct: 289 YKRIPIPPELQR--TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSF 346

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          +P            P   PT      ++ +     +  Y         E
Sbjct: 347 KVWMCGGEMFDVPCSRVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPE 406

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GD++S+KELR++L CK FKWY+                      E+ N  
Sbjct: 407 YRHLS----TGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVA 462

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------CLD 264
           S +CIDS    T     + L  C K G  + W   +        +IR  E       C D
Sbjct: 463 SNLCIDSKHGATGTE--LRLDTCVKDGSERTWSHEQLFTFGWREDIRPGEPLHTRKFCFD 520

Query: 265 YA--GGDVILYPCHGSKGNQYFEY 286
                  V LY CHG KGNQ + Y
Sbjct: 521 SISHSSPVTLYDCHGMKGNQQWSY 544


>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
          Length = 540

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 106/327 (32%), Positives = 145/327 (44%), Gaps = 65/327 (19%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +AR+    V P I  I  +TFE R       +  +   G FDW L +
Sbjct: 178 EANVNWLPPLLEPIARDYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 230

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
               +   + KR    AEP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 231 KRLPLLPEDLKR---PAEPFKSPIMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFK 287

Query: 125 FNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGE 178
             W    +       R  H     P + P    G F         LG  Y    ++W  E
Sbjct: 288 I-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMDE 336

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
             E  +K        D GD++ +K LR  L CKSF W++                     
Sbjct: 337 YAEYIYKRRPHLRTLDPGDLSEQKALRTKLHCKSFNWFMKNIAFDLVEVYPPIEPDDFAF 396

Query: 211 -EVSN-DWSGMCIDSACKPTD--MHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA--CLD 264
            E+ N   + +C+DS  +  D  +   + +    K  G Q + ++ H +IR  +   CLD
Sbjct: 397 GEIRNMGATELCLDSKKRKRDEVVVMDICMKDDPKMSGEQEFRLTWHKDIRPKDRTDCLD 456

Query: 265 YAGGD----VILYPCHGSKGNQYFEYD 287
            + G+    V LYPCHG +GNQ + YD
Sbjct: 457 VSRGEEKAPVSLYPCHGKQGNQLWRYD 483


>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           5 [Pan troglodytes]
          Length = 638

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 142/313 (45%), Gaps = 60/313 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWN 61
           CE    WL+PLL  +A + + VVSP I  I  +TFE   P   GR+ S      G FDW+
Sbjct: 272 CECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSR-----GNFDWS 326

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           L F W  +P  E++R K+   P+ +PT AGGLFSI K++FE +GTYD+  +IWGGEN+E+
Sbjct: 327 LTFGWETLPPHEKQRRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEM 386

Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
           SF+  W    + E          +   ++ G +F           T+  G  +     + 
Sbjct: 387 SFRV-WQCGGQLE----------IIPCSVVGHVFRTKSPH-----TFPKGTSVIARNQVR 430

Query: 182 LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHK 241
           L+    + D   +   RRNL  ++ K   EV  + S   +D      + + P  L+    
Sbjct: 431 LA--EVWMDSYKKIFYRRNL--QAAKMAQEVRGNGSRRGLDE-----EKYGPQTLFMGLM 481

Query: 242 QGG---NQFWMMSK------------------HGEIRR--DEACLDY-----AGGDVILY 273
            G    +  W +                    +G I+      CLD       G  +I+Y
Sbjct: 482 AGTHLISTIWRLPSPSGTFYPERFVPDLTPTFYGAIKNLGTNQCLDVGENNRGGKPLIMY 541

Query: 274 PCHGSKGNQYFEY 286
            CHG  GNQYFEY
Sbjct: 542 SCHGLGGNQYFEY 554


>gi|195397828|ref|XP_002057530.1| GJ18184 [Drosophila virilis]
 gi|194141184|gb|EDW57603.1| GJ18184 [Drosophila virilis]
          Length = 625

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 96/336 (28%), Positives = 145/336 (43%), Gaps = 71/336 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++WL+PLL ++   ++ +  P+I  I  DTFE  + P  L        GGF+W L F
Sbjct: 243 EVNRQWLEPLLRLVHAENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 294

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K  ++   P  +PTMAGGLF++ + +F+ +G YD   DIWGGEN+E+SF+
Sbjct: 295 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVSRLYFQHIGEYDMAMDIWGGENIEISFR 354

Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W        +P        RKR    A P    TM      +   + +K   Y    +
Sbjct: 355 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAHVWMDKYKDYYLKHE 412

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--------------------VS 213
                  ++    DFGD+++R +LR  L CK F WYL+                    V 
Sbjct: 413 -------KVPKDYDFGDISARLQLRERLHCKDFDWYLKHVYPELRVPGDESKKPAVAPVF 465

Query: 214 NDW---------------SGMCIDSACKPTDMH------KPVGLYPCHKQGGNQFWMMSK 252
             W               SG  + +A     +         + L  C  +  NQ W  ++
Sbjct: 466 QPWHSRKRNYLDSFQLRLSGTQLCAAVVAPKVKGFWKKGSSLTLQNCRTRAANQMWYETE 525

Query: 253 HGEIRRDE-ACLDYAGGD-VILYPCHGSKGNQYFEY 286
             EI  D+  CL+ A    VI+  CH   G+Q + +
Sbjct: 526 KSEIILDKLLCLEAAADTLVIINKCHEMLGDQQWRH 561


>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 607

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 99/335 (29%), Positives = 143/335 (42%), Gaps = 77/335 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD + +N   +V P+I  I  D F      G  T +     G FDW + 
Sbjct: 242 CEANVNWLPPLLDRIVQNRKTIVCPMIDVIDHDNF------GYDTQAGDAMRGAFDWEMY 295

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   R    +  EP  +P MAGGLF++D+ +F +LG YD+G +IWGGE  E+SF
Sbjct: 296 YKRIPIPAEMRT--DDPTEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISF 353

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           K          IP  R    ++      + P    G  S+ K             ++W  
Sbjct: 354 KVWMCGGRMEDIPCSRVGHIYRK-----YVPYKVPGGISLAKNL-------KRVAEVWMD 401

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
           E  E  ++          GD++++KELR +L CK+F+W++                    
Sbjct: 402 EYAEYVYQRRPEYRHLSAGDMSAQKELRSHLNCKNFRWFMEEVAWDLPKHYPPVEPPAAA 461

Query: 211 --EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI-----RRD---- 259
             E+ +  SGMC++   K      P+ L  C K  G+  W    HG++     R D    
Sbjct: 462 WGEIRSVGSGMCME--IKHFVSGSPIRLESCVKGRGDVSW---SHGQVLTFGWREDIRVG 516

Query: 260 ------EACLDYAG--GDVILYPCHGSKGNQYFEY 286
                 + C D       V LY CHG KGNQ + Y
Sbjct: 517 DPMHTRKVCFDAVSHHSPVTLYDCHGMKGNQLWRY 551


>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
          Length = 699

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 146/317 (46%), Gaps = 56/317 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +A++ + VV P I  I   T +  +  G  +S      G F W L 
Sbjct: 367 CEANLGWLEPLLAWMAKDKTRVVCPTIDRISAQTMD--YVGGGASSR-----GTFHWTLD 419

Query: 64  FNW-HAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W +A+    R+  +  A+P+ +PTMAGGLF I++ +F +LGTYD G D WGGENLE+S
Sbjct: 420 FTWEYAV----RQHGETPADPIKSPTMAGGLFGINRDYFYELGTYDMGMDGWGGENLEMS 475

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      + H IP            P   P       ++++ F +         ++W  
Sbjct: 476 FRIWQCGGSLHIIPCSRVGHIFRDWHPYAIPNS-----TVNETFLKNSIRL---AEVWMD 527

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCI---DSACKP 227
           E  ++ +         DFGDV+ RK LR  LGCKSFKWYL+  N   G  I   D     
Sbjct: 528 EYKDIFYDIKPSARSVDFGDVSERKALREKLGCKSFKWYLD--NVVPGKLIPNSDVVLHK 585

Query: 228 TDMHKPVGL----------YPCHKQG----GNQFWMMSKHGEIRR--DEACLDYAGGDVI 271
             +   + +          YPCH  G       FW ++ + E+R   D     +    V+
Sbjct: 586 GQVRNSLNICMDKGAGSLAYPCHTPGVHSTSQAFW-LTVYKEVRHVWDLCLTSHDNKRVM 644

Query: 272 LYPCHGSKGNQYFEYDY 288
           L  C     ++ +EYD+
Sbjct: 645 LSTC--GPNSRKWEYDH 659


>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
           (Silurana) tropicalis]
 gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Xenopus (Silurana) tropicalis]
          Length = 563

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 107/217 (49%), Gaps = 24/217 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WLQPLL  +  N   VV P+I  I  DT         + SS     GGF+W L 
Sbjct: 203 CEVNEMWLQPLLAPIKENPRTVVCPVIDIISADTL--------IYSSSPVVRGGFNWGLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P  E    +  + P  +PTMAGGLF++D+ +F  LG YDSG DIWGGENLE+SF
Sbjct: 255 FKWDPVPLAELGGPEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSGMDIWGGENLEISF 314

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP----TMAGGLFSIDKAFFEKLGTYDSGFDI 174
           +      +   +P            P  +P    TMA     +   +       D   D 
Sbjct: 315 RIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHDTMAHNSLRLAHVWM------DEYKDQ 368

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
           +     EL  + DFGD+  R  LRR L CKSFKWYL+
Sbjct: 369 YFALRPELRNR-DFGDIRERLALRRRLNCKSFKWYLD 404


>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
 gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
          Length = 591

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/309 (29%), Positives = 140/309 (45%), Gaps = 51/309 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I   +FE R     +  S   F G F+W + 
Sbjct: 238 CEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKSFEYR----PVYGSDTHFRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
             +        L+   D GD+T +  L++ L CKSF+W++                    
Sbjct: 409 THKEFFYTREPLARYLDMGDITEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHW 468

Query: 211 -EVSNDWSGMCIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA 266
            E+ +  S  C+DS       H+P   +GL  CH  G NQ   ++  G++   E C++  
Sbjct: 469 GELRSVASDGCLDSMG-----HQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEAD 523

Query: 267 GGDVILYPC 275
              + L  C
Sbjct: 524 RQGIKLAIC 532


>gi|126303658|ref|XP_001380711.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Monodelphis domestica]
          Length = 552

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 103/213 (48%), Gaps = 14/213 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VV P+I  I  DTF          SS     GGFDW L 
Sbjct: 202 CEVNKDWLLPLLHRIKEDPTRVVCPVIDIINRDTFAY-------VSSSPDMRGGFDWTLH 254

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +  RE+    +  +P+ TP ++GGLF ++K++F  LG YD+  DIWGGEN E+SF
Sbjct: 255 FKWEELTLREKALRVDPIQPIETPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISF 314

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +      +   +P            P   P   G L +  K        +   F  +   
Sbjct: 315 RVWMCGGSLEILPCSRVGHVFRKKHPYTFP--EGNLNTYIKNTKRTAEVWMDEFKHYFYA 372

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
              ++    FG++ SR ELR+ L C +FKWYLE
Sbjct: 373 ARPVAQGRPFGNIQSRVELRKRLKCHTFKWYLE 405


>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
          Length = 486

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 133 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 188

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 189 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 248

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 249 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 303

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 304 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 363

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 364 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 423

Query: 272 LYPC 275
           L  C
Sbjct: 424 LAVC 427


>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
           N-acetylgalactosaminyltransferase 2 [Drosophila
           melanogaster]
          Length = 591

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
 gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
          Length = 591

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
 gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
          Length = 591

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
 gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
          Length = 591

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|119508144|gb|ABL75647.1| IP16941p [Drosophila melanogaster]
          Length = 245

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 85/240 (35%), Positives = 115/240 (47%), Gaps = 67/240 (27%)

Query: 78  KNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKR 137
           ++ AEPV++PTMAGGLFSID+ FF++LGTYDSGFDIWGGENLELSFK  W          
Sbjct: 22  ESTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFK-TW---------- 70

Query: 138 HKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFDIWGGENLELSF------ 184
                       M GG   I         F K   Y   SG ++    ++ L+       
Sbjct: 71  ------------MCGGTLEIVPCSHVGHIFRKRSPYKWRSGVNVLKKNSVRLAEVWMDEY 118

Query: 185 -----------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDW 216
                      KGD+GDV+ R++LR +L CKSFKWYL                 E++N  
Sbjct: 119 SQYYYHRIGNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIANVP 178

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQ--GGNQFWMMSKHGEIRRDEACLDYAGGDVILYP 274
           +GMC+D+  K ++   PV +Y C ++   G++F  MS   +      C       V   P
Sbjct: 179 NGMCLDAKEK-SEEETPVSIYECKRKIRRGHRFPFMSAMAKEEISTGCSARRAKSVATTP 237


>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 7;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
           AltName: Full=dGalNAc-T2
 gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
 gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
 gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
          Length = 591

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/304 (29%), Positives = 141/304 (46%), Gaps = 41/304 (13%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R  P   T ++  F G F+W + 
Sbjct: 238 CEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYR--PVYGTDNH--FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-----EVSNDWSGM------ 219
             +        L+   D GD++ +  L++ L CKSF+W++     +V + + G+      
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 220 -----CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVI 271
                     C  +  H+P   +GL  CH  G NQ   ++  G++   E C++     + 
Sbjct: 469 GELRSVASDGCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGIK 528

Query: 272 LYPC 275
           L  C
Sbjct: 529 LAVC 532


>gi|291397402|ref|XP_002715124.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Oryctolagus cuniculus]
          Length = 439

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 114/218 (52%), Gaps = 27/218 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL V+A++   VV P+I  I + T E +  P           G F+W LQ
Sbjct: 226 CEVNKVWLEPLLSVIAKDPHTVVCPIIDVIDEMTLEYKPSP--------IVRGTFNWMLQ 277

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E +  +  A+P+ +P+MAGG+F+I + +F+++G YD   D+WGGEN+E+S 
Sbjct: 278 FKWDNVFSYEMEGPEGPAKPIRSPSMAGGIFAIHRHYFKEIGQYDKDMDLWGGENVEISL 337

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           +          IP  R     + + EP    T A     + + +   + T+   +     
Sbjct: 338 RIWMCGGQLFIIPCSRVGHITRKSPEPNLAVTKA-----VTRNYLRLVHTWLDEYK---- 388

Query: 178 ENLELSFKG----DFGDVTSRKELRRNLGCKSFKWYLE 211
           E   L   G     +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 389 EQFFLHRPGLRSIPYGNISERVELRKRLGCKSFQWYLD 426


>gi|196001845|ref|XP_002110790.1| hypothetical protein TRIADDRAFT_23005 [Trichoplax adhaerens]
 gi|190586741|gb|EDV26794.1| hypothetical protein TRIADDRAFT_23005 [Trichoplax adhaerens]
          Length = 519

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 90/314 (28%), Positives = 134/314 (42%), Gaps = 54/314 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL+PLL  +  + + VV+P I  I   +F   + P  L        G FDWNL+
Sbjct: 170 CEVNVGWLEPLLRRVNEDPTVVVTPEIDLIDASSFRYLYGPSGLIR------GVFDWNLK 223

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  IP  ER   K+  E V +PTM G +F+ID+ FF+ +G YDS  + W  E+LE+SF
Sbjct: 224 FKWKVIPREERLARKSPIESVRSPTMGGDIFAIDRKFFQSIGKYDSQVETWEVEHLEISF 283

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF-DIWGG 177
           +          IP     +   + +P   P          ++F + LG       +IW  
Sbjct: 284 RIWLCGGKIEIIPCSHVGQVLRSFQPYQPP----------QSFDDYLGKNSQRIAEIWLD 333

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYL------------------EV 212
           +  E  +       +   GD+T+    R+ LGCK+F+WYL                  ++
Sbjct: 334 DYKEFYYQRYPHLRQNFLGDITAELRQRQKLGCKNFRWYLNNVFTDAVFPNESVMAEGKI 393

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDY----AGG 268
            N  S  C+  A K       V L  C     +  +  +   EI  +  CLD      G 
Sbjct: 394 RNPASANCLMVAGKTNSY---VRLITCVHDTSSMIFRFTIRREIEINGKCLDANRSKRGS 450

Query: 269 DVILYPCHGSKGNQ 282
            + L  CH  + +Q
Sbjct: 451 KIQLVDCHRMRDSQ 464


>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
          Length = 638

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 132/331 (39%), Gaps = 73/331 (22%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE QK WL+PLL  L+ N + VVSP+I  I   TF+          S     G FDW L 
Sbjct: 287 CECQKGWLEPLLARLSSNRNSVVSPIIDVIDWKTFQYYH-------SVSLHRGVFDWKLD 339

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +PE E K  ++   P+ +P +AG + ++D+ +F+ +G YDS   +WG ENLELS 
Sbjct: 340 FHWEPVPEHEEKVRQSPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSI 399

Query: 124 KFNW-------------------HAIP-----ERERKRHKNAAEPVWTPTMAGGLFSIDK 159
           +  W                   H IP     E    R+K      W  +     +  D 
Sbjct: 400 R-TWLCGGSVEIIPCSRVGHVYRHHIPHAFSYEEAIVRNKIRIAETWLDSFKENFYKNDT 458

Query: 160 AFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL--------- 210
             F                   L  K +  D + R +L++ LGC+SF+W++         
Sbjct: 459 VAF-------------------LISKAEKPDCSERLQLQKRLGCRSFQWFITNVYPELSR 499

Query: 211 ---------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEA 261
                    ++ N  +G C D           + L PC       F   SK  EIR   A
Sbjct: 500 PEDAPRLSGKLYNTGAGFCADYRPGMALADGSIKLSPCTNSLTQHFEYNSKK-EIRVGSA 558

Query: 262 ---CLDYAGGDVILYPCHGSKGNQYFEYDYK 289
              CLD   G VI   C     N    +D +
Sbjct: 559 LLFCLDVRHGKVIPQNCTKETDNSEQHWDVQ 589


>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           6 [Ciona intestinalis]
          Length = 627

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 92/321 (28%), Positives = 142/321 (44%), Gaps = 78/321 (24%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+P+L+ +A +++ VV P+I  I  DTF +      LT++     G   W+L 
Sbjct: 278 CECAPHWLEPMLERIAEDNTRVVCPVIEVIDADTFAMS-----LTTARSVQTGILSWSLG 332

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           FNW        +  KN  E + + TMAGGLF++ + +F  LG+YD+   +WGGEN+E+S 
Sbjct: 333 FNWAPRKINPGQPIKND-EALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSL 391

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYD--SGFD--- 173
           +                    +W   M GG   I         F K   Y    G D   
Sbjct: 392 R--------------------IW---MCGGSLEIHPCSHVGHVFRKRAPYSHPGGSDVIT 428

Query: 174 --------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------- 211
                   +W  E  E  +K        + GD+T+R +LR +L C++F+WY+        
Sbjct: 429 HNNKRVAEVWLDEYKEQYYKRVPRARAVEAGDLTARIKLRHDLKCRNFQWYITNIYPALY 488

Query: 212 --------VSNDW------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR 257
                      +W      S  C+DSA     +   + +Y CH  G NQ + ++++GEIR
Sbjct: 489 ATPKEDILKGGEWHNKDRDSKYCLDSANPDGKVGVKMTMYVCHGMGVNQDFDLTRNGEIR 548

Query: 258 RD---EACLDYAGGDVILYPC 275
                E CL  +G  ++ Y C
Sbjct: 549 HSYSKELCLQPSGNSIVTYDC 569


>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
 gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
          Length = 601

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 37/312 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  +++E R   G   + +    G F+W L 
Sbjct: 252 CEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I ERE    K+ ++P  +PT AGGLF+I++ +F++LG YD G  IWGGE  ELSF
Sbjct: 309 YKETQITERESAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P         +  P      +G  + SI+      + T+   +  + 
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
                 +   + GD++++  LR  L CKSFKWY+                     E  N 
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSYPLLPPNDVWGEARNP 485

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
            +G C+D   +   +  P+G   CH  GGNQ   ++  G++ + E CL   G  +    C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542

Query: 276 HGSKGNQYFEYD 287
                +  F YD
Sbjct: 543 VKGSVSGNFVYD 554


>gi|328794283|ref|XP_001122865.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like,
           partial [Apis mellifera]
          Length = 372

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 61/122 (50%), Positives = 79/122 (64%), Gaps = 8/122 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A + + VV P+I  I DDTFE   P   +T       GGF+W L 
Sbjct: 258 CECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY-IPASDMT------WGGFNWKLN 310

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G DIWGGENLE+S
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370

Query: 123 FK 124
           F+
Sbjct: 371 FR 372



 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/73 (56%), Positives = 54/73 (73%), Gaps = 3/73 (4%)

Query: 114 WGGENLELSFKFNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
           WGG N +L+F+  W+ + +RE  +R  +   P+ TPTMAGGLFSIDK +F +LG YD G 
Sbjct: 302 WGGFNWKLNFR--WYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGM 359

Query: 173 DIWGGENLELSFK 185
           DIWGGENLE+SF+
Sbjct: 360 DIWGGENLEMSFR 372


>gi|71996085|ref|NP_001022948.1| Protein GLY-11, isoform a [Caenorhabditis elegans]
 gi|51315905|sp|Q7K755.2|GLT11_CAEEL RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 11; Short=pp-GaNTase
           11; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 11; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|3980030|emb|CAA22098.1| Protein GLY-11, isoform a [Caenorhabditis elegans]
          Length = 605

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/221 (34%), Positives = 109/221 (49%), Gaps = 33/221 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL PLLD + +N   VV P+I  I  D   +++    + +      GG +W + 
Sbjct: 258 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 309

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W        +   N   P+ +PTMAGGLF+IDK +F ++G+YD G D+WG EN+E+S 
Sbjct: 310 FKWDYPHRSYFEDPMNYVNPLKSPTMAGGLFAIDKEYFFEIGSYDEGMDVWGAENVEISV 369

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
           +  W               E +  P +  G +F   + +  K  +          +W  E
Sbjct: 370 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 417

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
            LE  F+         D+GD+TSR  LRRNL CK FKWYLE
Sbjct: 418 YLENFFEARPNYRTFTDYGDLTSRISLRRNLQCKPFKWYLE 458


>gi|390348396|ref|XP_787966.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Strongylocentrotus purpuratus]
          Length = 403

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 138/320 (43%), Gaps = 50/320 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLL  +A N + VV P + +I  D FE R      +       G  DW+  
Sbjct: 44  CECSPNWLVPLLTEIALNRTTVVCPTVDSISADNFEYR------SQGDGLCRGAMDWD-- 95

Query: 64  FNWHAIP---ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 120
           F +  IP    R+R   K  +EP  +P MAGGLF++D+ FF +LG YD G  IWGGEN E
Sbjct: 96  FWYKRIPVDLSRQRLGLKYQSEPYDSPMMAGGLFALDREFFFELGGYDPGLQIWGGENFE 155

Query: 121 LSFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           +SFK      +   +P            P   P       S+    + ++       ++W
Sbjct: 156 ISFKAWMCGGSLKFVPCSRVGHVYRKGVPYTYPDSGVPGVSVIHMNYMRVA------EVW 209

Query: 176 GGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------ 210
             E  E  +          +GD+  +   R++   KSFKW++                  
Sbjct: 210 LDEFKEFFYTSRPDLRGKPYGDIGEQIRFRKHHCPKSFKWFMEEVAFDSLEKFPPPQPNQ 269

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAG 267
              E+ +D +GMC+DS           G+Y CH  GGNQ + ++  G+I  ++ C    G
Sbjct: 270 AWGEIKSDHTGMCVDSMGHQATAGGEAGVYYCHGMGGNQRFRLTGPGQIMFNDYCFYVDG 329

Query: 268 GDVILYPCHGSKGNQYFEYD 287
             V +  C+  +   ++ +D
Sbjct: 330 SRVRIDKCNKVQWPSFWVHD 349


>gi|345781283|ref|XP_853759.2| PREDICTED: LOW QUALITY PROTEIN:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Canis lupus
           familiaris]
          Length = 559

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 108/227 (47%), Gaps = 45/227 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQPLL  +A++S  VV PLI  I   T E +  P           G F+W+L 
Sbjct: 244 CEVNTAWLQPLLHAIAKDSKMVVCPLIDVIDSMTLEYQSSP--------VVRGAFNWHLD 295

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++   E    +    P+ +P MAGG+F+I++ +F ++G YD G D+WG ENLELS 
Sbjct: 296 FKWDSVYSYEMDGPEGPTRPIRSPAMAGGIFAINRHYFNEIGQYDKGMDLWGAENLELSL 355

Query: 124 KF-----NWHAIP-----ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--G 171
           +          IP        ++R  N  E V                  K  TY++   
Sbjct: 356 RIWMCGGQLFIIPCSRVGHISKQRFSNQPELV------------------KAMTYNNLRL 397

Query: 172 FDIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             +W  E  E  F          +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 398 VHVWLDEYKEQFFLQQPGLKSVAYGNISERVELRKRLGCKSFQWYLD 444


>gi|393912281|gb|EFO21646.2| glycosyl transferase [Loa loa]
          Length = 470

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 114/247 (46%), Gaps = 29/247 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  N   VV+P+I  I  DTF+       L        GGF+WNL 
Sbjct: 236 CECNVNWLEPLLARVKENHRTVVAPVIDVIDRDTFKYVAASADLR-------GGFEWNLV 288

Query: 64  FNWHAIPERER-KRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  +  + R +RH     P+ TP +AGGLF I K +FEKLGTYD   DIWGGENLELS
Sbjct: 289 FKWEYLTGKLRDERHARPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEEMDIWGGENLELS 348

Query: 123 FKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           F+      +   IP            P   P  +G +F  +              ++W G
Sbjct: 349 FRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGSGNVFQKNTR---------RAAEVWLG 399

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDM 230
           +   L  +        +FGD+T+R +L++    + F+   E +       ++ A K +D+
Sbjct: 400 DYKHLYLRKVPSARYVNFGDITARLDLKKKFALQGFRLVFERNLSGVDDSLERARKISDI 459

Query: 231 HKPVGLY 237
                LY
Sbjct: 460 QTGKSLY 466


>gi|338724473|ref|XP_001495495.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Equus caballus]
          Length = 448

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 111/223 (49%), Gaps = 37/223 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL  +A++   VV PLI  I  D   L++ P  +        G F+W+LQ
Sbjct: 235 CEVNKVWLEPLLLAIAKDPKMVVCPLIDVI--DYMTLKYKPSPVVR------GAFNWHLQ 286

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +    P+ +P MAGG+F+ID+ +F ++G YD   ++WGGENLELS 
Sbjct: 287 FKWDNVFSYEMDGPEGPIAPIRSPAMAGGIFAIDRQYFNEIGRYDKDMNLWGGENLELSL 346

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDS--GFDIW 175
           +  W               +    P    G   IDK   E      K  TY++     +W
Sbjct: 347 RI-WMC-----------GGQLFVLPCSRVG--HIDKQRIENKREYLKAMTYNNLRMVHVW 392

Query: 176 GGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLE 211
             E+ E  F          +G+++ R ELR+ LGCKSF+WYL+
Sbjct: 393 LDEHKEQVFLRRPGLKSVAYGNISERVELRKRLGCKSFQWYLD 435


>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 [Ovis
           aries]
          Length = 448

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 111/223 (49%), Gaps = 35/223 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL+ +A++   VV PLI  I  D   L + P  +        G F+W+L+
Sbjct: 235 CEVNKVWLEPLLNAIAKDPKMVVCPLIDVI--DYMTLEYQPSPIVR------GAFNWHLE 286

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E +  +    P+ +P MAGG+F+I + +F ++G YD G ++WGGENLELS 
Sbjct: 287 FKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGENLELSL 346

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDS--GFDIWG 176
           +        + IP   R  H N                 + +   K+  Y+S     IW 
Sbjct: 347 RIWMCGGQLYVIP-CSRVGHINRQH------------MTNDSEIMKVVEYNSLRLAHIWL 393

Query: 177 GENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEV 212
            E  E  F          +G+++ R ELR+ LGCKSF+WYL+ 
Sbjct: 394 DEYKEEFFLRRPALKSAAYGNISERVELRKRLGCKSFQWYLDT 436


>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
 gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
          Length = 591

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 139/307 (45%), Gaps = 47/307 (15%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + R+ + +  P+I  I    FE R   G  T     F G F+W + 
Sbjct: 238 CEVNLNWLAPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETH----FRGIFEWGML 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +  + +P RE++R  + +EP  +PT AGGLF+I++ +F +LG YD G  +WGGEN ELSF
Sbjct: 294 YKENEVPRREQRRRSHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSF 353

Query: 124 KFNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K  W      E     R  H       + P   G L S  K     +  Y    + W  +
Sbjct: 354 KI-WQCGGSIEWVPCSRVGHVYRG---FMPYNFGKLASKKKGPLITI-NYKRVIETWFDD 408

Query: 179 NLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL-------------------- 210
             +        L+   D GD++ +  L++ L CKSF+W++                    
Sbjct: 409 THKEYFYTREPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHW 468

Query: 211 -EVSNDWSGMCIDS-ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG 268
            E+ +  S  C+DS   +P  +   +GL  CH  G NQ   ++  G++   E C++    
Sbjct: 469 GELRSVASDGCLDSMGLQPPAI---MGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ 525

Query: 269 DVILYPC 275
            + L  C
Sbjct: 526 GIKLAVC 532


>gi|393910679|gb|EFO20658.2| glycosyl transferase [Loa loa]
          Length = 601

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 103/231 (44%), Gaps = 54/231 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV +RWL+PLLD +  +   VV P+I  I  +T +    P           GG  W+L 
Sbjct: 247 CEVNERWLEPLLDRIVTDRHTVVCPIIDIIDANTLKYIESP--------ICKGGMSWSLA 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P       K    PV +PTMAGGLF+IDK +F+KLG YD G +IWG EN+E+S 
Sbjct: 299 FKWDYLPSSYFDEPKQYVRPVKSPTMAGGLFAIDKKYFDKLGQYDRGMEIWGAENVEISL 358

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
           +                    +W   M GG   I         F +   Y  G D     
Sbjct: 359 R--------------------IW---MCGGRLEIIPCSRIGHIFRQRRPYGFGIDSMGHN 395

Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
                 IW  E ++  +         D GD+   K LR+ L CKSF WYL+
Sbjct: 396 AARTANIWLDEYIDQFYAARPNLRGIDIGDIKEMKALRKKLHCKSFFWYLQ 446


>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
          Length = 601

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 37/312 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL  + +N   +  P+I  I  +++E R   G   + +    G F+W L 
Sbjct: 252 CEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHS---GIFEWGLL 308

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I ERE    K+ ++P  +PT AGGLF+I++ +F++LG YD G  IWGGE  ELSF
Sbjct: 309 YKETQITERESGHRKHTSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSF 368

Query: 124 KFNWHA------IPERERKRHKNAAEPVWTPTMAGG-LFSIDKAFFEKLGTYDSGFDIWG 176
           K  W        +P         +  P      +G  + SI+      + T+   +  + 
Sbjct: 369 KI-WQCGGGIVFVPCSHVGHVYRSHMPYGFGKFSGKPVISIN--MMRVVKTWMDDYSKYY 425

Query: 177 GENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL---------------------EVSND 215
                 +   + GD++++  LR  L CKSFKWY+                     E  N 
Sbjct: 426 LTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLQSYPLLPPNDVWGEARNP 485

Query: 216 WSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYPC 275
            +G C+D   +   +  P+G   CH  GGNQ   ++  G++ + E CL   G  +    C
Sbjct: 486 ATGKCLD---RMGGIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTANGIRIQANHC 542

Query: 276 HGSKGNQYFEYD 287
                N  + YD
Sbjct: 543 VKGTVNGNWIYD 554


>gi|312082359|ref|XP_003143412.1| glycosyl transferase [Loa loa]
          Length = 599

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 103/231 (44%), Gaps = 54/231 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV +RWL+PLLD +  +   VV P+I  I  +T +    P           GG  W+L 
Sbjct: 245 CEVNERWLEPLLDRIVTDRHTVVCPIIDIIDANTLKYIESP--------ICKGGMSWSLA 296

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +P       K    PV +PTMAGGLF+IDK +F+KLG YD G +IWG EN+E+S 
Sbjct: 297 FKWDYLPSSYFDEPKQYVRPVKSPTMAGGLFAIDKKYFDKLGQYDRGMEIWGAENVEISL 356

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
           +                    +W   M GG   I         F +   Y  G D     
Sbjct: 357 R--------------------IW---MCGGRLEIIPCSRIGHIFRQRRPYGFGIDSMGHN 393

Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
                 IW  E ++  +         D GD+   K LR+ L CKSF WYL+
Sbjct: 394 AARTANIWLDEYIDQFYAARPNLRGIDIGDIKEMKALRKKLHCKSFFWYLQ 444


>gi|395507115|ref|XP_003757873.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Sarcophilus harrisii]
          Length = 633

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 107/218 (49%), Gaps = 24/218 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + + VV P+I  I  DTF          SS     GGFDW L 
Sbjct: 311 CEVNKDWLLPLLHRIKEDPTRVVCPVIDIINRDTFAY-------VSSSPDMRGGFDWTLH 363

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +  RE+    +  +P+ TP ++GGLF ++K++F  LG YD+  DIWGGEN E+SF
Sbjct: 364 FKWEELSLREKALRVDPIQPIKTPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISF 423

Query: 124 KF-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
           +      +   +P        RK+H     P   P   G L +  K        +   F 
Sbjct: 424 RVWMCGGSLEILPCSRVGHVFRKKH-----PYTFPE--GNLNTYIKNTKRTAEVWMDEFK 476

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
            +      ++    FG++ +R ELR+ L C +FKWYLE
Sbjct: 477 HYFYAARPVAQGRPFGNIQARVELRKRLKCHTFKWYLE 514


>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
          Length = 652

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 136/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 289 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 342

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 343 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 400

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 401 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 460

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDVT++K+LR +L CKSFKW++                      E+ N  
Sbjct: 461 YRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVG 516

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 517 TGLCADT--KHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 574

Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 575 AVSHTSPVTLYDCHSMKGNQLWKY 598


>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/345 (29%), Positives = 146/345 (42%), Gaps = 80/345 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            EV   WL PLL  ++ +   VV P+I  I ++ F+    PG          G FDW L 
Sbjct: 248 VEVSTNWLPPLLHPISLDRKTVVCPMIDIIDNENFQYVTQPGDAMR------GAFDWELY 301

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP    KR K+ +EP  +P MAGGLF+I++ +F ++G YD G +IWGGE  ELSF
Sbjct: 302 YKRIPIPNE--KRPKDPSEPFESPVMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSF 359

Query: 124 KFNWHA---IPERERKRHKNAAE---PVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           K  W     I +    R  +      P   P   G  ++           Y    ++W  
Sbjct: 360 KV-WMCGGRILDSPCSRIGHIYRKFVPYTIPNNGGPNYN-----------YKRVAEVWMD 407

Query: 178 ENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWY---------------LEVSND 215
           E  E  +       K D GD++  K LR+ L CKSF WY               L  S  
Sbjct: 408 EYAEFFYRRRPYVRKIDAGDLSKAKALRKELKCKSFDWYIKNVIPDLVQYYPPILPPSAA 467

Query: 216 W-------SGMCID-----------SACKPTD-----------MHKPVGLYPCHKQGGNQ 246
           W       S +CID           S C+  +             K +  + CH Q GNQ
Sbjct: 468 WGRLKHVVSNLCIDPQVKKGSQVVVSQCQTPEGAVRTCLDASYRSKSILTWDCHNQHGNQ 527

Query: 247 FWMMSKHGEIR-RDEACLDYAGGDVILYPCHGSKGNQYFEYDYKY 290
            W   +   I    + C   A G +++ PC  S G++ FE+++++
Sbjct: 528 LWKYFEKQLIHPSSKKCATVASGALLMMPC--SPGDRLFEWEWEH 570


>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
 gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
           taurus]
          Length = 620

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 136/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 257 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 310

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 311 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 368

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 369 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 428

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDVT++K+LR +L CKSFKW++                      E+ N  
Sbjct: 429 YRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVG 484

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 485 TGLCADT--KHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 542

Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 543 AVSHTSPVTLYDCHSMKGNQLWKY 566


>gi|15207811|dbj|BAB62930.1| hypothetical protein [Macaca fascicularis]
          Length = 373

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 160 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 211

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+ +P M+GG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 212 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 271

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +   T  +              +  Y     +W  E
Sbjct: 272 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 320

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R +LR+ LGCKSF+WYL+
Sbjct: 321 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 360


>gi|195115752|ref|XP_002002420.1| GI12891 [Drosophila mojavensis]
 gi|193912995|gb|EDW11862.1| GI12891 [Drosophila mojavensis]
          Length = 622

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 69/335 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++WL+PLL ++   +S +  P+I  I  DTFE  + P  L        GGF+W L F
Sbjct: 242 EVNRQWLEPLLRLVKAENSTLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 293

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K  ++   P  +PTMAGGLF++++ +F+ +G YD   DIWGGEN+E+SF+
Sbjct: 294 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 353

Query: 125 F-----NWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDI 174
                 +   +P        RKR    A P    TM      +   + +K   +    + 
Sbjct: 354 VWQCGGSIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAHVWMDKYKEFYLKHE- 411

Query: 175 WGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE--------------------VSN 214
                 +++   D+GD+++R +LR  L CK F WYL+                    V  
Sbjct: 412 ------KVAKDYDYGDISARLQLRERLHCKDFGWYLKHVYPELRLPGDESKKSGAAPVFQ 465

Query: 215 DWSG------------MCIDSACKPTDMHKPVG---------LYPCHKQGGNQFWMMSKH 253
            W              +     C      K  G         L  C  +  NQ W  ++ 
Sbjct: 466 PWHSRKRNYLDSFQLRLAGTQLCAAVVAPKVKGFWKKGSSLTLQICKPRAPNQMWYETEK 525

Query: 254 GEIRRDEA-CLDYAGGD-VILYPCHGSKGNQYFEY 286
            EI  D+  CL+ A    VI+  CH   G+Q + +
Sbjct: 526 SEIILDKLFCLEAAADTLVIINKCHEMLGDQQWRH 560


>gi|347971791|ref|XP_003436799.1| AGAP004375-PB [Anopheles gambiae str. PEST]
 gi|333469031|gb|EGK97157.1| AGAP004375-PB [Anopheles gambiae str. PEST]
          Length = 585

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 73/219 (33%), Positives = 112/219 (51%), Gaps = 24/219 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PL D LA + + ++SP+I  I   TFE R    RL        GGFDW+L 
Sbjct: 218 CEVNRGWLEPLHDRLAIDPTAILSPVIDIIDPHTFEYRANSARLR-------GGFDWSLH 270

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  I E E   R  + + P ++P ++GG+F + K+ F++LG +D G DIWGGE+LE+S
Sbjct: 271 FRWLPIAEEEFEHRRHDESLPFYSPAISGGIFIVAKSLFQQLGGFDPGMDIWGGESLEMS 330

Query: 123 FK-----FNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
            K      +   +P        R++H  + +P       G   +  +        +   F
Sbjct: 331 LKAWMCGAHVEVVPCSRIGHVFRRKHPFSFQP------DGSHLTYLRNTKRVALVWMDEF 384

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
             +  E    +   D G +T ++ELRR+L C+ F WYL+
Sbjct: 385 KNFFYETRPEAVAVDAGSITEQQELRRSLNCRKFSWYLQ 423


>gi|395838452|ref|XP_003792129.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Otolemur garnettii]
          Length = 869

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 76/224 (33%), Positives = 106/224 (47%), Gaps = 37/224 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLL  +A++   VV PLI  I + T E R  P           G FDW L+
Sbjct: 416 CEVNKGWLEPLLYSIAKDHKMVVCPLIDVIDETTLEYRASP--------VVRGAFDWELK 467

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E        +P+ +P MAGG+F+I + +F ++G YD G D+WGGENLELS 
Sbjct: 468 FKWDNVFSYEMDGPDRPIKPIRSPAMAGGIFAIYRHYFNEIGQYDKGMDLWGGENLELSL 527

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD--------IW 175
           +  W               +    P    G   I K  F+++      F         +W
Sbjct: 528 RI-WMC-----------GGQLFIIPCSRVG--HITKKQFKEVSAITRAFTRNSLRMVHVW 573

Query: 176 GGENLELSF-------KGDFGDVTSRKELRRNLGCKSFKWYLEV 212
             E  E  F          +G+++ R ELR+ LGCKSF+WYL+ 
Sbjct: 574 LDEYKEQFFLRKPGLRSIAYGNISERVELRKRLGCKSFQWYLDT 617


>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
          Length = 666

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 139/345 (40%), Gaps = 91/345 (26%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A N    V P I  I    F  R       +  +   G FDW  +
Sbjct: 298 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHTNFHYR-------AQDEGARGAFDW--E 348

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P       K+ A+P  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSF
Sbjct: 349 FFYKRLPLLPEDL-KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 407

Query: 124 KF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSID 158
           K                        N    P +    HKN      VW       L+S  
Sbjct: 408 KIWMCGGEMYDAPCSRIGHIYRGPRNHQPSPRKGDYLHKNYKRVAEVWMDEYKNYLYSHG 467

Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW- 216
              +E +                     D GD+T +K +R  L CKSFKW++ EV+ D  
Sbjct: 468 DGLYESV---------------------DPGDLTEQKAIRTKLNCKSFKWFMKEVAFDLM 506

Query: 217 ---------------------SGMCIDSACKPTDMHKPVGLYPC----HKQGGNQFWMMS 251
                                  +C+D+  +    H  +G+Y C          QFW +S
Sbjct: 507 KTYPPVDPPSYAMGALQNVGNQNLCLDTLGR--KKHNKMGMYACADNIKTPQRTQFWELS 564

Query: 252 --KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDYKY 290
             +   +RR + CLD     A   V L+ CH   GNQY+ YDY++
Sbjct: 565 WKRDLRLRRKKECLDVQIWDANAPVWLWDCHSQGGNQYWYYDYRH 609


>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
           impatiens]
          Length = 582

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 139/325 (42%), Gaps = 55/325 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELR--FPPGRLTSSYKFFIGGFDWN 61
           CEV   WL PLL  +A + + +  P+I  I   TFE R  +  G L      + G F+W 
Sbjct: 229 CEVNVNWLPPLLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHL------YRGIFEWG 282

Query: 62  LQFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLEL 121
           + +  + +P RE+K     + P  +PT AGGLF+I++ +F  LG YD G  +WGGEN EL
Sbjct: 283 MLYKENELPAREKKSRPYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFEL 342

Query: 122 SFKF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 176
           SFK      N   +P      H       + P   G L    K     +  Y    + W 
Sbjct: 343 SFKIWQCGGNILWVP----CSHVGHVYRGFMPYTFGKLAQKKKGPLITI-NYKRVVETWF 397

Query: 177 GENLE--------LSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------ 210
            +  +        L+   D GD++ + E +R   CKSF+WY+                  
Sbjct: 398 DDKYKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNI 457

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYP---CHKQGGNQFWMMSKHGEIRRDEACLD 264
              E+ N  +GMC+D+       H P  L     CH  G NQ   ++  G++   E C+ 
Sbjct: 458 HWGELRNIATGMCLDTMS-----HSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCIS 512

Query: 265 YAGGDVILYPCHGSKGNQYFEYDYK 289
             G  V    C     +  ++YD K
Sbjct: 513 ADGQGVKFVFCRLGTVDGPWQYDEK 537


>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
           pisum]
          Length = 588

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/301 (31%), Positives = 137/301 (45%), Gaps = 35/301 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PL+  +AR+   +  P+I  I  +T+E R     +      F G F+W + 
Sbjct: 233 CEVGYNWLPPLIAPIARDRKIMTVPVIDGIDHNTWEYR----PVYEKDHLFRGIFEWGML 288

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP +E ++    +EP  +PT AGGLF+ID+ +F +LG YD G  +WGGEN ELSF
Sbjct: 289 YKEIEIPAQEERKRIYKSEPYKSPTHAGGLFAIDRNYFLELGAYDPGLLVWGGENFELSF 348

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTY------DSGFDIWGG 177
           K  W      E          V+   M      + K     L TY      ++ FD    
Sbjct: 349 KI-WQCGGSIEWVPCSRVGH-VYRGFMPYNFGELGKKVKGPLITYNYKRVIETWFDNKHK 406

Query: 178 E----NLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSND-------------WSGM 219
           E       L+   D GD++ + EL+  L CK F W++E V+ D             W  +
Sbjct: 407 EFFYTREPLARYLDMGDISKQLELKDKLQCKDFSWFMENVAYDVYTKFPELPPNLYWGEL 466

Query: 220 --CIDSACKPTDMHKP---VGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGGDVILYP 274
                + C  T  H+P   VGL  CH QG NQ + ++  G++   E C+     +V L  
Sbjct: 467 RNIGKTTCLDTRGHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQNVKLVV 526

Query: 275 C 275
           C
Sbjct: 527 C 527


>gi|402865469|ref|XP_003896945.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 [Papio
           anubis]
          Length = 475

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 262 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 313

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+ +P M+GG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 314 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 373

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +   T  +              +  Y     +W  E
Sbjct: 374 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 422

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R +LR+ LGCKSF+WYL+
Sbjct: 423 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 462


>gi|51316066|sp|Q95JX4.2|GLTL5_MACFA RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5;
           AltName: Full=Polypeptide GalNAc transferase 15;
           Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 15;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 15
 gi|15207881|dbj|BAB62965.1| hypothetical protein [Macaca fascicularis]
          Length = 443

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVRPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+ +P M+GG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +   T  +              +  Y     +W  E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 390

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R +LR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 430


>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
 gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
          Length = 676

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 104/330 (31%), Positives = 143/330 (43%), Gaps = 65/330 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A+N    V P I  I   TF  R       +  +   G FDW  +
Sbjct: 307 VEANYNWLPPLLEPIAKNKRTAVCPFIDVIDHATFNYR-------AQDEGARGAFDW--E 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P  +    K  A+P  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSF
Sbjct: 358 FYYKRLPLLDEDL-KYPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416

Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
           K        + A   R    ++     V +P     L    K   E      K   YD  
Sbjct: 417 KIWMCGGEMYDAPCSRIGHIYRGPRNHVPSPRKGDYLHRNYKRVAEVWMDEYKNYLYDHA 476

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDWSG------------ 218
             I+         + D GD+T +  +R+ L CKSFKW++ EV+ D               
Sbjct: 477 DGIYD--------RIDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFAL 528

Query: 219 ----------MCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS--KHGEIRRDEAC 262
                     +CID+  +    HK +G+Y C +        QFW +S  +   +RR + C
Sbjct: 529 GAIQNVGDKRLCIDTMGRRK--HKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKEC 586

Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
           LD         V L+ CH   GNQY+ YDY
Sbjct: 587 LDVQIWTVNAPVWLWDCHLQGGNQYWSYDY 616


>gi|347971789|ref|XP_001237517.3| AGAP004375-PA [Anopheles gambiae str. PEST]
 gi|333469030|gb|EAU76847.3| AGAP004375-PA [Anopheles gambiae str. PEST]
          Length = 575

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/219 (33%), Positives = 112/219 (51%), Gaps = 24/219 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PL D LA + + ++SP+I  I   TFE R    RL        GGFDW+L 
Sbjct: 208 CEVNRGWLEPLHDRLAIDPTAILSPVIDIIDPHTFEYRANSARLR-------GGFDWSLH 260

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  I E E   R  + + P ++P ++GG+F + K+ F++LG +D G DIWGGE+LE+S
Sbjct: 261 FRWLPIAEEEFEHRRHDESLPFYSPAISGGIFIVAKSLFQQLGGFDPGMDIWGGESLEMS 320

Query: 123 FK-----FNWHAIPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF 172
            K      +   +P        R++H  + +P       G   +  +        +   F
Sbjct: 321 LKAWMCGAHVEVVPCSRIGHVFRRKHPFSFQP------DGSHLTYLRNTKRVALVWMDEF 374

Query: 173 DIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
             +  E    +   D G +T ++ELRR+L C+ F WYL+
Sbjct: 375 KNFFYETRPEAVAVDAGSITEQQELRRSLNCRKFSWYLQ 413


>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
 gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 104/330 (31%), Positives = 143/330 (43%), Gaps = 65/330 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A+N    V P I  I   TF  R       +  +   G FDW  +
Sbjct: 307 VEANYNWLPPLLEPIAKNKRTAVCPFIDVIDHATFNYR-------AQDEGARGAFDW--E 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P  +    K  A+P  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSF
Sbjct: 358 FYYKRLPLLDEDL-KYPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 416

Query: 124 KF------NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFE------KLGTYDSG 171
           K        + A   R    ++     V +P     L    K   E      K   YD  
Sbjct: 417 KIWMCGGEMYDAPCSRIGHIYRGPRNHVPSPRKGDYLHRNYKRVAEVWMDEYKNYLYDHA 476

Query: 172 FDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDWSG------------ 218
             I+         + D GD+T +  +R+ L CKSFKW++ EV+ D               
Sbjct: 477 DGIYD--------RIDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFAL 528

Query: 219 ----------MCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS--KHGEIRRDEAC 262
                     +CID+  +    HK +G+Y C +        QFW +S  +   +RR + C
Sbjct: 529 GAIQNVGDKRLCIDTMGRRK--HKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKEC 586

Query: 263 LDY----AGGDVILYPCHGSKGNQYFEYDY 288
           LD         V L+ CH   GNQY+ YDY
Sbjct: 587 LDVQIWTVNAPVWLWDCHLQGGNQYWSYDY 616


>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Saimiri boliviensis boliviensis]
          Length = 682

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 98/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 319 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 372

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 373 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 430

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 431 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 487

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDVT++K+LR +L CKSFKW++                      E+ 
Sbjct: 488 RPEYRHLS----AGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 543

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 544 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 601

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 602 CFDAISHTSPVTLYDCHSMKGNQLWKY 628


>gi|15207947|dbj|BAB62998.1| hypothetical protein [Macaca fascicularis]
          Length = 443

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------VVRGAFDWNLQ 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+ +P M+GG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 282 FKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +   T  +              +  Y     +W  E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTRKTSAIISA----------TIHNYLRLVHVWLDE 390

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R +LR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQWYLD 430


>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
           laevis]
 gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
          Length = 653

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 149/326 (45%), Gaps = 60/326 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   W  PL+  +A++ +    PLI  I  +T+EL   P        F  G +DW++ 
Sbjct: 300 CEVGINWYAPLIAPIAKDRTTCTVPLIDVIEGNTYELI--PQAGGDEDGFARGAWDWSML 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +  +E+++ K   EP  +P MAGGLF+I++ +F +LG YD G  IWGGEN E+S+
Sbjct: 358 WKRVPLTSKEKEQRKTKTEPYRSPAMAGGLFAIEREYFFELGLYDPGLQIWGGENFEISY 417

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSI----------DKAFFEKLGTYDSGF 172
           K  W               + ++TP +  G ++ +                 L  Y    
Sbjct: 418 KI-WQC-----------GGKLLFTPCSRVGHIYRLHGWQGNPTPAHVGSSPTLKNYVRVV 465

Query: 173 DIWGGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE-----VSN------ 214
           ++W  E  +  +          +GD+++ K+ R +  CKSFKW++E     + N      
Sbjct: 466 EVWWDEYRDYFYASRPETKALAYGDISALKKFREDHNCKSFKWFMEEIAYDIPNYYPLPP 525

Query: 215 ---DW-------SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLD 264
              DW       SG CIDS          +G   CH+ GGNQ + +++  ++ + + CL 
Sbjct: 526 RNVDWGEIRGFESGYCIDSMGHTNGGLAELG--GCHRMGGNQLFRINEANQLMQYDQCLT 583

Query: 265 YA--GGDVILYPCHGSKGNQYFEYDY 288
               G  VIL  C+    N+Y E+ Y
Sbjct: 584 KGTDGSKVILTHCN---LNEYKEWQY 606


>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Callithrix jacchus]
          Length = 667

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 304 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 357

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 358 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 415

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 416 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 472

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDVT++K+LR +L CKSFKW++                      E+ 
Sbjct: 473 RPEYRHLS----AGDVTAQKKLRSSLNCKSFKWFMMKIAWDLPKFYPPVEPPAAAWGEIR 528

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 529 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 586

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 587 CFDAISHTSPVTLYDCHSMKGNQLWKY 613


>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
           [Crassostrea gigas]
          Length = 526

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 104/329 (31%), Positives = 140/329 (42%), Gaps = 69/329 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLL+ +A +   VV P I  I  + F  R       +  +   G FDW  +
Sbjct: 167 CEANINWLPPLLEPIAEDYKTVVCPFIDVIDFENFAYR-------AQDEGARGAFDW--E 217

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P  E    K+ AEP  +P MAGGLF+I   +F ++G YD G DIWGGE  ELSF
Sbjct: 218 FFYKRLPLLEEDL-KHPAEPFKSPVMAGGLFAISAKWFWEMGGYDPGLDIWGGEQYELSF 276

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------YDSGFDIWG 176
           K  W                 V  P    G      A F   G        Y    ++W 
Sbjct: 277 KL-WQC-----------GGMMVDAPCSRIGHIYRKFAPFPNPGVGDFVGRNYRRVAEVWM 324

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
            E  E  +K        D GDV+ +K +R  L CK FKW++                   
Sbjct: 325 DEYAEYLYKRRPHYRNIDPGDVSEQKAIRDKLHCKPFKWFMEEVAFDLPKFYPPVEPPPF 384

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ---GGNQFWMMSKHGEIR--RDEAC 262
              EV N  + MC+D+  K    ++   L PC K    GG Q +  + H +IR  +   C
Sbjct: 385 ASGEVRNKAANMCLDTRYK--GQNERFDLQPCLKDGKGGGEQQFEFTWHKDIRPGKRTVC 442

Query: 263 LDYA----GGDVILYPCHGSKGNQYFEYD 287
            D +       VIL+ CHG  GNQ F+Y+
Sbjct: 443 FDVSQSIKKAPVILFNCHGMGGNQRFKYN 471


>gi|198473174|ref|XP_001356196.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
 gi|198139336|gb|EAL33256.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
          Length = 617

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 95/336 (28%), Positives = 147/336 (43%), Gaps = 72/336 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++WL+PLL ++   ++ +  P+I  I  DTFE  + P  L        GGF+W L F
Sbjct: 241 EVNRQWLEPLLRLIKAENASLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 292

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K  ++   P  +PTMAGGLF++++ +F+ +G YD   DIWGGEN+E+SF+
Sbjct: 293 RWENLPEGTLKVPEDFRGPFRSPTMAGGLFAVNRLYFQDIGEYDMAMDIWGGENIEISFR 352

Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W        +P        RKR    A P    TM      +   + +K   +    +
Sbjct: 353 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSLRLAYVWMDKYKDFYLKHE 410

Query: 174 IWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------EVSNDWSG--- 218
                  +++   D+GD++ R +LR  L C+ F+WYL            E     SG   
Sbjct: 411 -------KVAKDYDYGDISDRLQLRERLQCRDFEWYLRNVYPELHIPGEEPKKSASGPVF 463

Query: 219 -----------------MCIDSACKPTDMHKPVG---------LYPCHKQGGNQFWMMSK 252
                            +     C      K  G         L PC ++  NQ W  ++
Sbjct: 464 QPWHSRKRNYIDFYMLRLAGTELCASVMAPKVKGFWKKGSSLQLQPC-RRTPNQLWYETE 522

Query: 253 HGEIRRDE-ACLDYAG-GDVILYPCHGSKGNQYFEY 286
             EI  D+  CL+ +G   VI+  CH   G+Q + +
Sbjct: 523 KSEIILDKLLCLEASGDSQVIINKCHEMLGDQQWRH 558


>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
 gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
          Length = 666

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 103/343 (30%), Positives = 139/343 (40%), Gaps = 91/343 (26%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A N    V P I  I    F  R       +  +   G FDW  +
Sbjct: 298 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHSNFNYR-------AQDEGARGAFDW--E 348

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P   +   K+ A+P  +P MAGGLF+I + FF +LG YD G DIWGGE  ELSF
Sbjct: 349 FFYKRLPLL-KDDLKHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSF 407

Query: 124 KF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSID 158
           K                        N    P R    H+N      VW       L+S  
Sbjct: 408 KIWMCGGEMYDAPCSRIGHIYRGPRNHQPSPRRGDYLHRNYKRVAEVWMDEYKNYLYSHG 467

Query: 159 KAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW- 216
              +E +                     D GD+T +K +R  L CKSFKW++E V+ D  
Sbjct: 468 DGVYESV---------------------DPGDLTEQKAIRTKLKCKSFKWFMEAVAFDLM 506

Query: 217 ---------------------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMS 251
                                  +C+D+  K    H  +G+Y C         +QFW +S
Sbjct: 507 KTYPPVDPPAYAMGALQNVGNQNLCLDTMGKKK--HNRMGMYSCASDIKVPQRSQFWELS 564

Query: 252 --KHGEIRRDEACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
             +   +RR + CLD     A   V L+ CH   GNQY+ YDY
Sbjct: 565 WKRDLRLRRKKECLDVQIWDANAPVWLWDCHSQGGNQYWYYDY 607


>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
          Length = 602

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 106/329 (32%), Positives = 146/329 (44%), Gaps = 69/329 (20%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           E    WL PLL+ +A+N    V P I  I  +TFE R       +  +   G FDW L +
Sbjct: 238 EANVNWLPPLLEPIAQNYKTCVCPFIDVIAYETFEYR-------AQDEGARGAFDWELYY 290

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
               +   + KR    AEP  +P MAGGLF+I   FF +LG YD G DIWGGE  ELSFK
Sbjct: 291 KRLPLLPEDLKR---PAEPFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFK 347

Query: 125 FNWHAIPER-----ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLG-TYDSGFDIWGGE 178
             W    +       R  H     P + P    G F         LG  Y    ++W  E
Sbjct: 348 I-WQCGGQMYDAPCSRVGHIYRKFPPF-PNPGRGDF---------LGKNYKRVAEVWMDE 396

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE-VSNDW-------------- 216
             E  +K        D GD++ +K LR  L CK F W++E ++ D               
Sbjct: 397 YAEYIYKRRPHLRALDPGDLSEQKALRVKLHCKPFNWFIENIAFDLVEVYPPIEPDDFAY 456

Query: 217 --------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIRRDEA--C 262
                   + +C+DS  +  D  + + +  C K      G Q + ++ H +IR  +   C
Sbjct: 457 GEIRNMGATELCLDSKKRKRD--ELIVVDTCVKDDPKVSGEQEFRLTWHKDIRPKDRTDC 514

Query: 263 LDYAGGD----VILYPCHGSKGNQYFEYD 287
           LD + G+    V LYPCHG +GNQ + YD
Sbjct: 515 LDVSRGEEKAPVSLYPCHGKQGNQLWRYD 543


>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
          Length = 603

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549


>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
          Length = 603

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549


>gi|268576230|ref|XP_002643095.1| C. briggsae CBR-GLY-11 protein [Caenorhabditis briggsae]
          Length = 619

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 110/221 (49%), Gaps = 33/221 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL PLLD + +N   VV P+I  I  D   +++    + +      GG +W + 
Sbjct: 272 CEVNEDWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 323

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W        +   N   P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 324 FKWDYPHRSYFEDPMNYLNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 383

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
           +  W               E +  P +  G +F   + +  K  +          +W  E
Sbjct: 384 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 431

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
            LE  F+         D+GD+TSR  LR+NL CK FKWYLE
Sbjct: 432 YLENFFEARPTYRTFTDYGDLTSRINLRQNLQCKPFKWYLE 472


>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
          Length = 528

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 165 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 218

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 219 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 276

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 277 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 333

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 334 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 389

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 390 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 447

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 448 CFDAVSHTSPVTLYDCHSMKGNQLWKY 474


>gi|195033813|ref|XP_001988768.1| GH11345 [Drosophila grimshawi]
 gi|193904768|gb|EDW03635.1| GH11345 [Drosophila grimshawi]
          Length = 620

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 147/337 (43%), Gaps = 73/337 (21%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV + W++PLL ++   ++ +  P+I  I  DTFE  + P  L        GGF+W L F
Sbjct: 241 EVNREWVEPLLRLVKAENATLAVPVIDLINADTFE--YTPSPLVR------GGFNWGLHF 292

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K  ++   P  +PTMAGGLF++++ +F+ +G YD   DIWGGEN+E+SF+
Sbjct: 293 RWENLPEGTLKVPEDFKGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 352

Query: 125 FNWHA------IPERE-----RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFD 173
             W        +P        RKR    A P    TM      +   + +   TY   + 
Sbjct: 353 V-WQCGGAIKIVPCSRVGHIFRKRRPYTA-PDGANTMLKNSMRVAHVWMD---TYKEYY- 406

Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE--------------------V 212
                 LE   KG DFGD++ R +LR  L C++F WYL+                    V
Sbjct: 407 ----LKLEKVPKGYDFGDISDRLQLRERLECQNFDWYLKHVYPELRVPGEESKKPVSAPV 462

Query: 213 SNDW---------------SGMCIDSACKPTDMH------KPVGLYPCHKQGGNQFWMMS 251
              W               SG  + +A     +        P+ L  C  +  NQ W  +
Sbjct: 463 FQPWHSRKRNYLDSFQMRLSGTQLCAAVVSPKVKGFWKKGSPLTLQLCRPRAPNQLWYET 522

Query: 252 KHGEIRRDE-ACLDYAGGD-VILYPCHGSKGNQYFEY 286
           +  EI  D+  CL+      V++  CH   G+Q + +
Sbjct: 523 EKSEIILDKLLCLEAVEDTMVVVNKCHEMLGDQQWRH 559


>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
          Length = 603

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 465 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549


>gi|403276501|ref|XP_003929936.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Saimiri boliviensis boliviensis]
          Length = 455

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 107/220 (48%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV P+I  I D T  L++ P  +        G FDWNLQ
Sbjct: 242 CEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDDRT--LKYKPSPVVR------GAFDWNLQ 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    +   +P+ +P MAGG+F+I + +F ++G YD   D WGGENLELS 
Sbjct: 294 FKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSL 353

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +P     +   +             Y     +W  E
Sbjct: 354 RIWMCGGQLFIIP-CSRVGHISKKQPGKGSELINAVAR----------NYLRLVHVWLDE 402

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G+++ R ELR+ LGC+SF+WYL+
Sbjct: 403 YKEQFFLRKPGLKYMTYGNISERVELRKRLGCQSFQWYLD 442


>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Anolis carolinensis]
          Length = 941

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 127/305 (41%), Gaps = 83/305 (27%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
           WL+PLL+ +  N   V  P+I  I D             +   F  G F+W + F W  I
Sbjct: 598 WLEPLLERIHLNRKKVPCPVIEVISDKDMSY-------MTVDNFQRGIFNWPMNFGWKPI 650

Query: 70  PERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWH 128
           P    +++K    + +  P MAGGLFSIDK +F +LGTYD G D+WGGEN+E+SFK  W 
Sbjct: 651 PPDVIEKNKIKETDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKV-WM 709

Query: 129 AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFF---EKLGTYDSGF------------D 173
              E E          +   +  G +F  D  +    ++L T +               D
Sbjct: 710 CGGEIE----------IIPCSRVGHIFRSDNPYSFPKDRLTTVERNLARVAEVWLDDYKD 759

Query: 174 IWGGENLELSFKG-DFGDVTSRKELRRNLGCKSFKWYLE----------------VSNDW 216
           ++ G    L  K  D GD+T +KELR+ L CKSFKWYLE                + N  
Sbjct: 760 LFYGHGYHLVQKNLDVGDLTQQKELRKRLQCKSFKWYLENVYPDIEAPLVKASGLIINIA 819

Query: 217 SGMCI--------------------------------DSACKPTDMHKPVGLYPCHKQGG 244
              CI                                DS   PTD    +GL+PC K+  
Sbjct: 820 LAKCITVNQSSLAFETCDVNNKDQKFNYTWMRLIQHGDSCVAPTDAKGTLGLHPCDKRNK 879

Query: 245 NQFWM 249
           +  W+
Sbjct: 880 SLKWL 884


>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
           griseus]
          Length = 494

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 137/331 (41%), Gaps = 63/331 (19%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 125 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 178

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 179 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 236

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP          + P   P         D      L       ++W  E
Sbjct: 237 KVWMCGGRMEDIPCSRVGHIYRKSVPYKVPAGPA-----DPCNCLSLQNLKRVAEVWMDE 291

Query: 179 NLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL--------------------- 210
             E  ++          GDV ++K LR +L CKSFKW++                     
Sbjct: 292 YAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAW 351

Query: 211 -EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIR------ 257
            E+ N  +G+C D+    + +  P+ L  C +  G   W        +   +IR      
Sbjct: 352 GEIRNVGTGLCTDTKHGTSGL--PLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPGDPQH 409

Query: 258 RDEACLDYAGGD--VILYPCHGSKGNQYFEY 286
             + C D    +  V LY CH  KGNQ ++Y
Sbjct: 410 TKKLCFDAVSHNSPVTLYDCHSMKGNQLWKY 440


>gi|351708673|gb|EHB11592.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Heterocephalus glaber]
          Length = 570

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 101/354 (28%), Positives = 148/354 (41%), Gaps = 83/354 (23%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WLQP+L  +  + + VVSP+I  I  D F        L +S     GGFDW+L 
Sbjct: 180 CEVNIEWLQPMLQRVKEDHTRVVSPIIDVISLDNF------AYLAASADLR-GGFDWSLH 232

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN---LE 120
           F W  IP  ++    +  +P+ TP +AGG+F IDKA+F  LG YD+  DIWGGEN   + 
Sbjct: 233 FKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFGPVA 292

Query: 121 LSFK------------FNWHAIPERE---RKRHKNAAEPVWTPT-----MAGGLFSIDKA 160
           L+ K             ++  +P  +   ++     A+P+         M GG   I   
Sbjct: 293 LALKQPAQLEGVGDNFISYWCLPVAKPIIQREGSPMAQPIRAELSFRVWMCGGSLEIVPC 352

Query: 161 -----FFEKLGTYD--------------SGFDIWGGENLELSFKG-------DFGDVTSR 194
                 F K   Y+                 ++W  E  +  ++         FG V +R
Sbjct: 353 SRVGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATR 412

Query: 195 KELRRNLGCKSFKWYLE---------VSNDWSGM------CIDSACKPTDMHKPVGLYPC 239
            E R+ + CKSF+WYLE         V     G+      C++S  + T     +G+  C
Sbjct: 413 IEQRKKMDCKSFRWYLENVYPELTVPVKEVLPGIIKQGVNCLESQGQDTAGDFLLGMGIC 472

Query: 240 HKQGGN----QFWMMSKHGEIRRDEACLDYA-------GGDVILYPCHGSKGNQ 282
                N    Q W+ S H  I++   CL          G  VIL  C+  +G Q
Sbjct: 473 RGSAKNPPPPQAWLFSDH-LIQQQGKCLAATSTSTASPGSPVILQVCNSREGKQ 525


>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           17 [Ciona intestinalis]
          Length = 870

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 149/324 (45%), Gaps = 61/324 (18%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV   WL PLL+ +A +   +  P+I  I  D F     PG          G FDW L +
Sbjct: 508 EVTNNWLPPLLEPIALDRKVITCPMIDIINKDDFHYLTQPGDAMR------GAFDWELYY 561

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
               IP    K+ K+ ++P   P MAGGLF+ID+ +F+++G YD G +IWGGE  ELSFK
Sbjct: 562 KRIPIPPE--KQLKDPSDPFEDPVMAGGLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFK 619

Query: 125 FNWHA---IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
             W     I +    R  +        ++  G  +I+K F           ++W  E  E
Sbjct: 620 -AWMCGGKILDAPCSRVGHIYREFMPYSLPPGT-NINKNF-------KRVAEVWMDEYAE 670

Query: 182 LSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE----------------------V 212
             +K          GD++ +K LR  L C+SF W+++                      +
Sbjct: 671 YFYKKRPHVRGIHPGDLSKQKALRELLECRSFDWFMKEVAPDIIKHYPPVMPEPAAWGML 730

Query: 213 SNDWSGMCIDSACKPTDMHKPVGLYPCHKQG-GNQFWMMSKHGEIR----RDEA---CLD 264
           SN+ S  C+D   K      P+ L PC ++G  +Q ++++   +IR     D+A   CLD
Sbjct: 731 SNEGSKRCLDGLYKKEG--APLSLMPCREEGTADQSFILTWKEDIRPGTSMDKARKFCLD 788

Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
             G    V+L+ CHG  GNQ ++Y
Sbjct: 789 GQGLNSPVVLWQCHGQYGNQLWKY 812


>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
          Length = 634

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 271 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 324

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 325 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 382

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 383 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 439

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 440 RPEYRHLS----AGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 495

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 496 NVGTGLCTDT--KLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 553

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 554 CFDAVSHTSPVTLYDCHSMKGNQLWKY 580


>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
           norvegicus]
 gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
           norvegicus]
 gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
          Length = 603

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVVAQKKLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W   +        +IR  +       
Sbjct: 465 NVGTGLCTDT--KHGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAVSHTSPVTLYDCHSMKGNQLWKY 549


>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
           lupus familiaris]
          Length = 603

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 411

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDV ++K+LR  L CKSFKW++                      E+ N  
Sbjct: 412 YRHLS----AGDVAAQKKLRSALNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIHNVG 467

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C+D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 468 TGLCVDT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 525

Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 526 AISNTSPVTLYDCHSMKGNQLWKY 549


>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
          Length = 1150

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 100/326 (30%), Positives = 136/326 (41%), Gaps = 72/326 (22%)

Query: 10   WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQFNWHAI 69
            WL PLL+ +A N    V P I  I DDTFEL       T   +   G FDWN+ +    +
Sbjct: 788  WLPPLLEPIAHNPRTCVCPFIDVIMDDTFEL-------TPQDQGARGAFDWNMLYK--RL 838

Query: 70   PERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA 129
            P R   + K+  +P  +P MAGGLF+I   FF +LG YD   +IWG E  ELSFK  W  
Sbjct: 839  PLRPEDQ-KDPTQPFESPVMAGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKI-WQC 896

Query: 130  IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYD-------SGFDIWGGENLE- 181
                           +  P    G      + F  + +YD          ++W  E  + 
Sbjct: 897  -----------GGRMIDAPCSRVGHIYRSYSPFPNVKSYDYVAKNHKRVAEVWMDEYKKY 945

Query: 182  ------LSFKGDFGDVTSRKELRRNLGCKSFKWYLEVSN----DW--------------- 216
                  + F  D GD+T  KELRR L CK F+W++E       DW               
Sbjct: 946  VYRKDPMRFSIDAGDLTKMKELRRRLNCKPFRWFIENVAPDLIDWYPPIEPEPFAFGVIQ 1005

Query: 217  ----SGMCIDSACKPTDMHKPVGLYPCHKQGGN-----QFWMMSKHGEIRRD--EACLDY 265
                 G+C+       D  K   L  C K   N     Q +  +   E++      CLD 
Sbjct: 1006 SQANKGLCV-GVVNVVD-QKGTALVACAKDKVNPERAEQHFQFTWRREVKSMLWAQCLDV 1063

Query: 266  A----GGDVILYPCHGSKGNQYFEYD 287
            A    G ++ L+ CH  +GNQ F+YD
Sbjct: 1064 ANHSVGVELQLFSCHTQQGNQLFQYD 1089



 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/336 (28%), Positives = 137/336 (40%), Gaps = 97/336 (28%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFELRFPP--GRLTSSYKFFIGGFDWNLQFNWH 67
           WL PLL+ +A N    V PLI  I D TF +      GR         G FDW   +   
Sbjct: 385 WLPPLLEPIAENPKTCVCPLIDVIDDQTFNIHPQDDGGR---------GLFDWRFHYKRL 435

Query: 68  AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-- 125
           A+ E +R    +   P  +P MAGGLF+I   FF +LG YD   DIWG E  ELSFK   
Sbjct: 436 ALKESDRV---SPTAPFPSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQ 492

Query: 126 ------------------NWHAIPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFF 162
                             ++   P   +     + HK  AE +W       ++  D   +
Sbjct: 493 CGGRMLDAPCSRFSHIYRSYSPFPNSRKYDFITRNHKRVAE-IWMDEYKQYIYDRDPERY 551

Query: 163 EKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----- 216
                                 + D GD+T  K LR  L CK F+W+L EV+ +      
Sbjct: 552 A---------------------RSDAGDLTKMKALREKLQCKPFEWFLKEVAPEILQLYP 590

Query: 217 --------SG---------MCIDSACKPTDMHKPVGLYPC-----HKQGGNQFWMMSKHG 254
                   SG         +CID+  +P     P+G++PC     H +  NQ++++S H 
Sbjct: 591 PVEPEPFASGAIQSIAEPTLCIDTMQRPRG--NPIGMHPCDSDLIHPKNMNQYFVLSWHR 648

Query: 255 EIRR--DEACLDYAG----GDVILYPCHGSKGNQYF 284
           +I++  DE C D         V +Y CH  K  Q+ 
Sbjct: 649 DIQQKSDEQCFDVPESAPRSPVTIYTCHNIKYLQHL 684


>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
           garnettii]
          Length = 657

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 89/319 (27%), Positives = 142/319 (44%), Gaps = 37/319 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   W  PL+  ++++ +    PLI  I  +T+E+    G     Y    G +DW++ 
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYAR--GAWDWSML 361

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +  RE+   K   EP  +P MAGGLF+I++ FF +LG YD G  IWGGEN E+S+
Sbjct: 362 WKRVPLTLREKSLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 421

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           K  W    +                   G    +       L  Y    ++W  E  +  
Sbjct: 422 KI-WQCGGKLLFVPCSRVGHIYRLEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDEYKDYF 480

Query: 184 FKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------VSNDW------ 216
           +          +GD++  K+ R +  CKSFKW++E               + DW      
Sbjct: 481 YASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHYPLPPKNIDWGEIRGF 540

Query: 217 -SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  CIDS  K       V L PCH+ GGNQ + +++  ++ + + CL     G  +++ 
Sbjct: 541 ETAYCIDSMGKTNGGF--VELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMIT 598

Query: 274 PC--HGSKGNQYFEYDYKY 290
            C  +G K  QYF+  Y++
Sbjct: 599 HCSLNGFKEWQYFKNLYRF 617


>gi|324520154|gb|ADY47570.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Ascaris suum]
          Length = 286

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 81/252 (32%), Positives = 115/252 (45%), Gaps = 48/252 (19%)

Query: 73  ERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHA--- 129
           ER+ H  +A P+  PT+AGGLF+ID+ FF  +G+YD G  +WGGENLE+SF+  W     
Sbjct: 2   ERRNHDRSA-PIQAPTIAGGLFAIDRQFFYDIGSYDEGMQVWGGENLEISFRV-WTCGGS 59

Query: 130 --IPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKG- 186
             I    R  H    +  +T    GG   +      +        ++W  E  E  +K  
Sbjct: 60  LEIHPCSRVGHVFRKQTPYT--FPGGTAKVIHHNAARTA------EVWMDEYKEFFYKMV 111

Query: 187 ------DFGDVTSRKELRRNLGCKSFKWYLE-----------------VSNDWSGMCIDS 223
                 D GD+  RK LR NL C+SF+WYLE                 + N  +  C+D+
Sbjct: 112 PAARSVDVGDLADRKALRENLQCRSFRWYLENIYPEAPIPRGFKSIGQIKNPSTTKCVDT 171

Query: 224 ACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYAGG-------DVILYPCH 276
             +     +  G+  CH  GGNQ W ++  GE+R DE CL            DV L  C 
Sbjct: 172 LGRSAG--EAAGVTVCHGIGGNQAWSLTSDGEVRSDETCLAADRAADKAKKIDVKLEKCS 229

Query: 277 GSKGNQYFEYDY 288
            +  N   ++DY
Sbjct: 230 TTSVNVNHQFDY 241


>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cricetulus griseus]
          Length = 715

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 141/327 (43%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 352 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 405

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 406 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 463

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP          + P   P   ++A  L  + + + ++   Y       
Sbjct: 464 KVWMCGGRMEDIPCSRVGHIYRKSVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 520

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K LR +L CKSFKW++                      E+ 
Sbjct: 521 RPEYRHLS----AGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 576

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSK------HGEIR------RDEA 261
           N  +G+C D+    + +  P+ L  C +  G   W   +        +IR        + 
Sbjct: 577 NVGTGLCTDTKHGTSGL--PLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPGDPQHTKKL 634

Query: 262 CLDYAGGD--VILYPCHGSKGNQYFEY 286
           C D    +  V LY CH  KGNQ ++Y
Sbjct: 635 CFDAVSHNSPVTLYDCHSMKGNQLWKY 661


>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
           magnipapillata]
          Length = 599

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 85/306 (27%), Positives = 133/306 (43%), Gaps = 62/306 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PL+  +  + + + +P+I  I  D F +   P     S+    G F+W + 
Sbjct: 228 CEVGGNWLPPLIAPIQEDPTTLTAPIIDGINWDDFSIN--PVYQKGSHSR--GIFEWGML 283

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +PE+E ++    +EP  +PT AGGLF+I +++F++LG YD G  IWGGEN ELSF
Sbjct: 284 YKETDLPEKEARKRLYHSEPYNSPTHAGGLFAIKRSWFKELGWYDPGLLIWGGENYELSF 343

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTM---------------AGGLFSIDKAFFEKLGTY 168
           K  W                 +W P                 +G +          L  Y
Sbjct: 344 KL-WQC-----------GGRSLWVPCSHVSHVYRGHSCSSCHSGDMGRKWSGIPLSLRNY 391

Query: 169 DSGFDIWGGENLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYL---------- 210
               ++W  +  +  F          D GDV+ +  L++ + CKSF W++          
Sbjct: 392 KRLIEVWFDDKYKEFFYTREPLARFIDTGDVSEQMALKKRMNCKSFTWFMEEIAYDVLKK 451

Query: 211 -----------EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRD 259
                      EV N  + +CID+  +       +GL  CHK GGNQ W ++  G++   
Sbjct: 452 YPEPPPNAHWGEVRNIATNLCIDTLNRSPPYR--IGLSGCHKSGGNQLWRLNTLGQLASG 509

Query: 260 EACLDY 265
           E C+ Y
Sbjct: 510 EWCVRY 515


>gi|170591827|ref|XP_001900671.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158591823|gb|EDP30426.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 597

 Score =  116 bits (291), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 103/342 (30%), Positives = 134/342 (39%), Gaps = 98/342 (28%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV +RWL+PLLD +  +   VV P+I  I  DT +    P           GG  W+L 
Sbjct: 245 CEVNERWLEPLLDRIVADRHTVVCPVIDIIDADTLKYIESP--------VCKGGMSWSLA 296

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W   P       K    PV +PTMAGGLF+IDK +F  LG YD G +IWG EN+E+S 
Sbjct: 297 FKWDYFPPLYFDEPKQYVRPVKSPTMAGGLFAIDKKYFNMLGQYDPGMEIWGAENVEISL 356

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKA-----FFEKLGTYDSGFD----- 173
           +                    +W   M GG   I         F +   Y  G D     
Sbjct: 357 R--------------------IW---MCGGRLEIVPCSRVGHIFRQRRPYGLGIDSMGRN 393

Query: 174 ------IWGGENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYLE------VSN 214
                 IW  E ++  +         + GD+   KELRR L CK F WYL+      + N
Sbjct: 394 AARTANIWLDEYIDQFYAAKPNLRGINIGDIREMKELRRKLHCKPFLWYLQNIYPELLPN 453

Query: 215 DWSGMCIDSACKPTDMHK-------------------------------PVGLYPCHKQG 243
           +   M ID   K +DM +                                V +  C K  
Sbjct: 454 NHPTM-ID--LKKSDMLRSRNIARYHIILYNTSLCLTAQSVNGRLVRGSSVVVEYCRKGD 510

Query: 244 GNQFWMMSKHGEIR---RDEACLDYAGGDVILYPCHGSKGNQ 282
            +Q W  +K GE+R       CLD   G  IL  CH    +Q
Sbjct: 511 RHQIWRWTKLGELRPMGSATLCLDSLKGPRIL-KCHLQGAHQ 551


>gi|195436945|ref|XP_002066406.1| GK18112 [Drosophila willistoni]
 gi|194162491|gb|EDW77392.1| GK18112 [Drosophila willistoni]
          Length = 588

 Score =  116 bits (291), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 144/331 (43%), Gaps = 61/331 (18%)

Query: 5   EVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQF 64
           EV ++W++PLL ++   +S +  P+I  I  DTF   + P  L        GGF+W L F
Sbjct: 215 EVNRQWVEPLLRLIKAENSTLAVPVIDLINADTFG--YTPSPLVR------GGFNWGLHF 266

Query: 65  NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFK 124
            W  +PE   K+ ++   P  +PTMAGGLF++++ +F+ +G YD   DIWGGEN+E+SF+
Sbjct: 267 RWENLPEGTLKQPEDFRGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFR 326

Query: 125 F-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
                 +   +P            P  +P  A  +    K        +   F  +  ++
Sbjct: 327 AWQCGGSIKIVPCSRVGHIFRKRRPYTSPDGANTML---KNSLRLAYVWMDRFKDYYIKH 383

Query: 180 LELSFKGDFGDVTSRKELRRNLGCKSFKWYLE---------------------VSNDWSG 218
            ++S   D+GD++ R +LR  L C  F WYL+                     +   W  
Sbjct: 384 EKVSKDFDYGDISERVKLREKLQCHDFDWYLKNIYPELPIPGEEPKKTAAAAPIYQPWHS 443

Query: 219 M---CIDS---------ACKPTDMHKPVG---------LYPCHKQGGNQFWMMSKHGEIR 257
                IDS          C      K  G         L PCH    NQ W  ++  EI 
Sbjct: 444 RKRNYIDSYQLRLSGTELCASVVAPKVKGFWKKGSGLQLQPCH-NSPNQIWYETEKSEII 502

Query: 258 RDE-ACLDYAG-GDVILYPCHGSKGNQYFEY 286
            D+  CL+ +G   V++  CH   G+Q + +
Sbjct: 503 LDKLLCLEASGDAQVVINKCHEMLGDQQWRH 533


>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
           garnettii]
          Length = 657

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 87/319 (27%), Positives = 143/319 (44%), Gaps = 37/319 (11%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   W  PL+  ++++ +    PLI  I  + + +   P +      F  G +DW+L 
Sbjct: 304 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIE--PQQGGDEDGFARGAWDWSLL 361

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    +  +E+ + K+  EP  +P MAGGLF+I++ FF +LG YD G  IWGGEN E+S+
Sbjct: 362 WKRIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 421

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 183
           K  W    +                   G    +       L  Y    ++W  E  +  
Sbjct: 422 KI-WQCGGKLLFVPCSRVGHIYRLEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDEYKDYF 480

Query: 184 FKG-------DFGDVTSRKELRRNLGCKSFKWYLE--------------VSNDW------ 216
           +          +GD++  K+ R +  CKSFKW++E               + DW      
Sbjct: 481 YASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHYPLPPKNIDWGEIRGF 540

Query: 217 -SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILY 273
            +  CIDS  K       V L PCH+ GGNQ + +++  ++ + + CL     G  +++ 
Sbjct: 541 ETAYCIDSMGKTNGGF--VELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMIT 598

Query: 274 PC--HGSKGNQYFEYDYKY 290
            C  +G K  QYF+  Y++
Sbjct: 599 HCSLNGFKEWQYFKNLYRF 617


>gi|348568063|ref|XP_003469818.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Cavia porcellus]
          Length = 499

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 70/217 (32%), Positives = 113/217 (52%), Gaps = 23/217 (10%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  ++++S  VV+P+I  I  D   L++ P  L        G FDW LQ
Sbjct: 287 CEVNRVWLEPLLAAISKDSRTVVTPVIDII--DGISLQYLPSPLVR------GAFDWKLQ 338

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W ++   E     +   P+ +P MAGG+F++ + FF +LG YD   D+WGGENLELS 
Sbjct: 339 FKWDSVFSYETDSEGSPTNPIRSPAMAGGIFAMHRPFFYELGEYDKDMDLWGGENLELSL 398

Query: 124 KF-----NWHAIP-ERERKRHKNAAEP--VWTPTMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           +          IP  R     K  ++P    +  +A     +   + ++   Y   F + 
Sbjct: 399 RIWMCGGQLLIIPCSRVGHITKLYSKPDSALSKAVARNHLRLVHVWLDE---YKEQFFLR 455

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLEV 212
             +   ++    +G+++ R +LR+ LGC+SF+WYL+ 
Sbjct: 456 NPDLKSMT----YGNISERVQLRKQLGCRSFQWYLDT 488


>gi|308485607|ref|XP_003105002.1| CRE-GLY-11 protein [Caenorhabditis remanei]
 gi|308257323|gb|EFP01276.1| CRE-GLY-11 protein [Caenorhabditis remanei]
          Length = 624

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 33/221 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL PLLD + +N   VV P+I  I  D   +++    + +      GG +W + 
Sbjct: 277 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 328

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W            N   P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 329 FKWDYPHRSYFDDPMNYVNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 388

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
           +  W               E +  P +  G +F   + +  K  +          +W  E
Sbjct: 389 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRVARVWLDE 436

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
            LE  F          D+GD+TSR  LR+NL CK FKWYLE
Sbjct: 437 YLENFFVARPTYRTFTDYGDLTSRINLRQNLQCKPFKWYLE 477


>gi|341889625|gb|EGT45560.1| hypothetical protein CAEBREN_24622 [Caenorhabditis brenneri]
          Length = 625

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/221 (33%), Positives = 110/221 (49%), Gaps = 33/221 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL PLLD + +N   VV P+I  I  D   +++    + +      GG +W + 
Sbjct: 278 CEVNEEWLPPLLDQIKQNRRRVVCPIIDII--DAITMKYVESPVCT------GGVNWAMT 329

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W        +   N   P+ +PTMAGGLF+ID+ +F ++G+YD G D+WG EN+E+SF
Sbjct: 330 FKWDYPHRSYFEDPMNYVNPLKSPTMAGGLFAIDRDYFFEIGSYDEGMDVWGAENVEISF 389

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAFFEKLGTYDSG----FDIWGGE 178
           +  W               E +  P +  G +F   + +  K  +          +W  E
Sbjct: 390 RI-WTC-----------GGELLIMPCSRVGHIFRRQRPYGIKTDSMGKNSVRLARVWLDE 437

Query: 179 NLELSFKG--------DFGDVTSRKELRRNLGCKSFKWYLE 211
            LE  F+         ++GD+TSR  LR+NL CK FKWYLE
Sbjct: 438 YLENFFEARPTYRTFTEYGDLTSRINLRQNLQCKPFKWYLE 478


>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
           caballus]
          Length = 539

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/312 (30%), Positives = 137/312 (43%), Gaps = 59/312 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL+PLL  +  +   VV P+I  I DDTFE         +      GGF+W L 
Sbjct: 211 CECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEY-------MAGSDMTYGGFNWKLN 263

Query: 64  FNWHAIPERERKRHK-NAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWG-GENLEL 121
           F W+ +P+RE  R K +   PV     +G + ++         ++     IW  G +LE+
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPV--SCFSGNMTALPTGLLYNSCSFSQ---IWQCGGSLEI 318

Query: 122 SFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLE 181
                   +    RK     A P    T  GG   +      +L       ++W  E  +
Sbjct: 319 ---VTCSHVGHVFRK-----ATPY---TFPGGTGHVINKNNRRLA------EVWMDEFKD 361

Query: 182 LSF-------KGDFGDVTSRKELRRNLGCKSFKWYL-----------------EVSNDWS 217
             +       K D+GDV+ RK LR NL CK F WYL                 E+ N  +
Sbjct: 362 FFYIISPGVVKVDYGDVSVRKSLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 421

Query: 218 GMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRRDEACLDYA--GGDVILYPC 275
             C+D+  +  +  + VG++ CH  GGNQ +  +   EIR D+ CLD +   G VI+  C
Sbjct: 422 NQCLDNMGRKEN--EKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKC 479

Query: 276 HGSKGNQYFEYD 287
           H  +GNQ +EYD
Sbjct: 480 HHMRGNQLWEYD 491


>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
 gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
          Length = 534

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/335 (30%), Positives = 144/335 (42%), Gaps = 70/335 (20%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV   WL PLL+ ++ + + V  P I  I   TFE +   G          G FDW  Q
Sbjct: 168 CEVNVNWLPPLLEPISVSMTTVTIPTIDVIDHATFEYKEQQGGPMR------GVFDW--Q 219

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
            N+  IP  + R R      P  TP M GG+F+IDK FF  LG YDSG +IWGGE  ELS
Sbjct: 220 LNYKRIPVLDGRGRKVRPTLPFSTPVMPGGVFAIDKEFFHHLGGYDSGLEIWGGEQFELS 279

Query: 123 FKFNWH---AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGEN 179
           FK  W     + E    R  +     ++P      ++ D    + L  Y    ++W  + 
Sbjct: 280 FKI-WQCGGVLQEVPCSRVGHVFRK-FSP------YATDNDVLQILKNYMRVAEVWMDDY 331

Query: 180 LELSFKG-----------DFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----------- 216
            +  +K            D GD++S+K LR+ LGC+ F W++ EV++D            
Sbjct: 332 KQYYYKRMLRGPKNVTNFDLGDLSSQKPLRQRLGCRDFGWFMREVASDLVKHYPLKDPDV 391

Query: 217 ----------SGMCIDSACKPTDMHKPVGLYPCHKQGG---------NQFWMMSKHGEIR 257
                     +G+C+DS     +   PV L  C  + G         NQ +  +   EI 
Sbjct: 392 LQQGRIQSVGTGLCLDS--DGLNSEDPVVLRRCRDRQGAFVLTKTYPNQNFTYTGLKEIE 449

Query: 258 RDE--ACLDYAG----GDVILYPCHGSKGNQYFEY 286
             +   C D         ++   CHG  GNQ +EY
Sbjct: 450 TTDRHLCFDVDSLSREKTLVFLTCHGEGGNQMWEY 484


>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Sarcophilus harrisii]
          Length = 651

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/327 (30%), Positives = 141/327 (43%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A N   +V P+I  I +D F      G  T +     G FDW + 
Sbjct: 285 CEANVNWLPPLLDRIASNRKTIVCPMIDVIDNDHF------GYKTQAGDAMRGAFDWEMY 338

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 339 YKRIPIPLELQK--SDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 396

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPT---MAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   PT   +A  L  + + + ++   Y     I+
Sbjct: 397 KVWMCGGRMEDIPCSRVGHIYRKYIPYKIPTGVSLARNLKRVAEVWMDEYAEY-----IY 451

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             + L        GDVT++K+LR +L CKSFKW++                      E+ 
Sbjct: 452 --QRLPEYRHLSTGDVTAQKDLRNHLNCKSFKWFMTEIAWDLPRYYPPVEPAAAAWGEIR 509

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  + +CI +  K      P+ L  C K      W        S   +IR  +       
Sbjct: 510 NVGTQLCIGT--KHGAPGSPLRLESCVKGRAEAAWSNVQVFTFSWREDIRPGDPQHTKKF 567

Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
           C D       V LY CHG KGNQ ++Y
Sbjct: 568 CFDTISHSSPVTLYDCHGMKGNQLWKY 594


>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
           catus]
          Length = 603

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 411

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDV ++K+LR +L CKSFKW++                      E+ N  
Sbjct: 412 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 467

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 468 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 525

Query: 265 YAGGD--VILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 526 AISNTSPVTLYDCHSMKGNQLWKY 549


>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oryctolagus cuniculus]
          Length = 603

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--VDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 465 NVGTGLCADT--KHWALGSPLRLESCVRDRGEAAWNSMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549


>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
           scrofa]
          Length = 506

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 143 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 196

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 197 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 254

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 255 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 314

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDV ++K+LR +L CKSFKW++                      E+ N  
Sbjct: 315 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 370

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 371 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 428

Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 429 AISHTSPVTLYDCHSMKGNQLWKY 452


>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Otolemur garnettii]
          Length = 603

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVAAQKRLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 465 NVGTGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549


>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Anolis carolinensis]
          Length = 500

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/122 (45%), Positives = 74/122 (60%), Gaps = 7/122 (5%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL PLL  +  + SHVVSP+I  I  DTF          ++     GGFDW+L 
Sbjct: 178 CEVNKDWLLPLLQRIKEDPSHVVSPVIDIINLDTFAY-------VAASSDLRGGFDWSLH 230

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +  +++ +  +  EP+ TP +AGGLF IDKA+F  LG YD+  DIWGGEN E+SF
Sbjct: 231 FKWEQLSPKQKAKRTDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDAAMDIWGGENFEISF 290

Query: 124 KF 125
           + 
Sbjct: 291 RV 292



 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 65/220 (29%), Positives = 93/220 (42%), Gaps = 36/220 (16%)

Query: 96  IDKAFFEKLGTYDSGFDIWGGENLELSFKFNWHAIPERERKRHKNAAEPVWTPTMAGGLF 155
           ID    +      +  D+ GG   + S  F W  +  +++ +  +  EP+ TP +AGGLF
Sbjct: 204 IDIINLDTFAYVAASSDLRGG--FDWSLHFKWEQLSPKQKAKRTDPTEPIKTPIIAGGLF 261

Query: 156 SIDKAFFEKLGTYDSGFDIWGGENLELSFK-------------GDFGDVTSRK------E 196
            IDKA+F  LG YD+  DIWGGEN E+SF+                G V  +K      E
Sbjct: 262 VIDKAWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFPE 321

Query: 197 LRRNLGCKSFKWYLEVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEI 256
              N   K+ K   EV  D        A +P    +P G  P           + + G I
Sbjct: 322 GNANTYIKNTKRTAEVWMDEYKQYY-YAARPAAQGRPYGEIPEES--------LYQTGMI 372

Query: 257 RRDEACLDYAGGD------VILYPCHGSKGNQYFEYDYKY 290
           R+ + CL+    +      VIL PC  SKG      ++ Y
Sbjct: 373 RQRQRCLETQKSEGQDFPVVILNPCITSKGPASAAQEWTY 412


>gi|312381524|gb|EFR27256.1| hypothetical protein AND_06164 [Anopheles darlingi]
          Length = 377

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/214 (34%), Positives = 108/214 (50%), Gaps = 14/214 (6%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLLD L  + + ++SP+I  I  +TF  R    RL        GGFDW+L 
Sbjct: 7   CEVNRGWLEPLLDRLQLDPTGLLSPVIDIIDAETFGYRANSARLR-------GGFDWSLH 59

Query: 64  FNWHAIPERE-RKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           F W  I E E   R  + ++P ++P ++GG+F + K+ FE+LG +D G DIWGGE+LE+S
Sbjct: 60  FRWLPIAEEELEHRRHDESQPFYSPAISGGIFIVAKSLFEQLGGFDPGMDIWGGESLEMS 119

Query: 123 FK-----FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
            K      +   +P            P   P     L  +       L   D   + +  
Sbjct: 120 LKAWLCGAHVEVVPCSRIGHVFRRKHPFSFPPDGSHLTYLRNTKRVALVWMDEFKNFFYD 179

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE 211
             LE +   D G V +++ELR+ L C+ F WYL+
Sbjct: 180 VRLE-AIAIDAGSVRAQQELRQKLSCRRFSWYLQ 212


>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
          Length = 613

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 140/330 (42%), Gaps = 70/330 (21%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +A +   VV P I  +  +TF  R       +  +   G FDW  +
Sbjct: 248 CEANVNWLPPLLDPIAEDYRTVVCPFIDVVDYETFAYR-------AQDEGARGAFDW--E 298

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F +  +P       K+ A P  +P MAGGLF+I   +F +LG YD G DIWGGE  ELSF
Sbjct: 299 FFYKRLPLLPEDL-KHPARPFKSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSF 357

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGT-------YDSGFDIWG 176
           K  W               + +  P    G      A F   G        Y    ++W 
Sbjct: 358 KL-WQC-----------GGQMLDAPCSRVGHIYRKFAPFPNPGVGDFVGRNYRRVAEVWM 405

Query: 177 GENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL------------------- 210
            E  E  +K          G++T +  +R+ L CK FKW++                   
Sbjct: 406 DEYAEFLYKRRPQYRSIQPGNITEQLAIRKKLNCKPFKWFMEEIAFDLPKKYPPIEPPAV 465

Query: 211 ---EVSNDWSGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMMSKHGEIR--RDEA 261
              E+ N  + +C+D+  K     +  GL  C K     GG Q   ++ H +IR  +   
Sbjct: 466 AEGEMRNVGANLCVDTRFK--GQGETFGLEKCAKDEPGIGGEQRLQITWHKDIRPGKRSF 523

Query: 262 CLDYAG----GDVILYPCHGSKGNQYFEYD 287
           C D +       VILY CHG KGNQ+F+YD
Sbjct: 524 CFDVSTSVEKAPVILYNCHGMKGNQWFKYD 553


>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cavia porcellus]
          Length = 602

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 139/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 239 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 292

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 293 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 350

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + +    Y       
Sbjct: 351 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDDYAEY---IYQR 407

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 408 RPEYRHLS----AGDVVAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 463

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 464 NVGTGLCADT--KHGALGAPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 521

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 522 CFDAISHTSPVTLYDCHSMKGNQLWKY 548


>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
           scrofa]
          Length = 624

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 135/324 (41%), Gaps = 55/324 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 261 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 314

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 315 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 372

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           K          IP            P   P       ++ +     +  Y         E
Sbjct: 373 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEHIYQRRPE 432

Query: 179 NLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVSNDW 216
              LS     GDV ++K+LR +L CKSFKW++                      E+ N  
Sbjct: 433 YRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVG 488

Query: 217 SGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------CLD 264
           +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       C D
Sbjct: 489 TGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKFCFD 546

Query: 265 YAG--GDVILYPCHGSKGNQYFEY 286
                  V LY CH  KGNQ ++Y
Sbjct: 547 AISHTSPVTLYDCHSMKGNQLWKY 570


>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
 gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
          Length = 551

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 188 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 241

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 242 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 299

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 300 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 356

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 357 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 412

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 413 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 470

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 471 CFDAISHTSPVTLYDCHSMKGNQLWKY 497


>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
           anubis]
          Length = 637

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 274 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 327

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 328 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 385

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 386 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 442

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 443 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 498

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 499 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 556

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 557 CFDAISHTSPVTLYDCHSMKGNQLWKY 583


>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Hydra magnipapillata]
          Length = 565

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 91/320 (28%), Positives = 140/320 (43%), Gaps = 57/320 (17%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PL+  + +N   V  P +  I  D+F  R           +  G F+W   
Sbjct: 200 CEANVGWLPPLVSEIEKNYRCVTCPTVDFIDHDSFYYR-------GVDPYIRGTFNWRFD 252

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    I E ++   K+  E V +P MAGGLF+I K F+E+LG YD G  +WGGE  E+SF
Sbjct: 253 YKERGITEHQKAARKSVTEGVRSPVMAGGLFAISKKFWEELGKYDPGMYVWGGEQYEISF 312

Query: 124 KFNWHAIPERERKRHKNAAEPVWTP-TMAGGLFSIDKAF-----FEKLGTYDSGFDIWGG 177
           K  W               E +  P +  G ++  +  +     F  L  +    ++W  
Sbjct: 313 KL-WMC-----------GGEMLNMPCSRVGHVYRRNVPYTYNKPFASLINFKRVAEVWMD 360

Query: 178 ENLELSFKG-------DFGDVTSRKELRRNLGCKSFKWYL-------------------E 211
           E  E  ++G       + G+++ R ++R    CKSFKWYL                   E
Sbjct: 361 EFKEFLYRGNPMVRSQNAGNISERIKVRERNKCKSFKWYLLNVANDTVRTRYEPDRASGE 420

Query: 212 VSNDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIRR-DEACLD--YAGG 268
           + N  + +C+D+     +  + + L  C ++  NQ +  +   E+ +  E CLD  YA  
Sbjct: 421 IENTHTKLCLDTYG--ANAGRKIKLSKCGQRNSNQIFRWTYIYELHQYPEECLDARYADM 478

Query: 269 D-VILYPCHGSKGNQYFEYD 287
           D V +  CH   GNQ F YD
Sbjct: 479 DNVYIEKCHEMGGNQKFLYD 498


>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           isoform 5 [Macaca mulatta]
          Length = 603

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 240 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 293

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 294 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 351

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   Y       
Sbjct: 352 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEY---IYQR 408

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 409 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 464

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 465 NVGTGLCADT--KHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 522

Query: 262 CLDYAG--GDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 523 CFDAISHTSPVTLYDCHSMKGNQLWKY 549


>gi|311275140|ref|XP_003134592.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Sus scrofa]
          Length = 446

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 105/223 (47%), Gaps = 37/223 (16%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV K WL+PLLD + ++   VV P++  I  D   L + P  +        G F+W+LQ
Sbjct: 234 CEVNKIWLEPLLDAIVKDPKMVVCPIMDVI--DYVTLEYKPSPVVR------GVFNWHLQ 285

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E         P+ +P M GGLF+I + +F ++G YD G ++WGGENLELS 
Sbjct: 286 FEWDRVFSYEMDGPDGPTRPIRSPAMVGGLFAIHRHYFNEIGQYDKGMNLWGGENLELSL 345

Query: 124 KFNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGF--------DIW 175
           +  W               +    P    G   I+K +F   G               +W
Sbjct: 346 RI-WMC-----------GGQLFLLPCSRVG--HINKPYFTNQGEIKKAMAYNNLRIVHVW 391

Query: 176 GGENLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  E  F  +       +G+V+ R ELR+ LGCKSF+WYL+
Sbjct: 392 LDEYKEQFFLQNPRLKSLAYGNVSERVELRKRLGCKSFQWYLD 434


>gi|18314429|gb|AAH22021.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Homo sapiens]
 gi|51105933|gb|EAL24517.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 15 [Homo sapiens]
 gi|119574364|gb|EAW53979.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5, isoform CRA_c
           [Homo sapiens]
 gi|123979772|gb|ABM81715.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [synthetic
           construct]
 gi|123994539|gb|ABM84871.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [synthetic
           construct]
          Length = 443

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------LVRGTFDWNLQ 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    + + +P+ +P M+GG+F+I + +F ++G YD   D WG ENLELS 
Sbjct: 282 FKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSL 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +     T+   +             Y     +W  E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRERVELRKRLGCKSFQWYLD 430


>gi|345321967|ref|XP_001514624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           protein 2 [Ornithorhynchus anatinus]
          Length = 484

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 84/299 (28%), Positives = 134/299 (44%), Gaps = 37/299 (12%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE  + WL+PLL  +A N + VV+P++  I   TF+          S     G FDW L 
Sbjct: 114 CECHRGWLEPLLSRIASNRNRVVTPILDVIDWKTFQY-------FHSEDLQQGVFDWKLD 166

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F+W  +PE++RK  ++   P+ +P + GG+ ++D+ +F+  G YDS   +WGGENLELS 
Sbjct: 167 FHWELLPEQKRKVRQSPISPIRSPVVPGGVMAMDRHYFQNTGAYDSLMTLWGGENLELSI 226

Query: 124 KF-----NWHAIP-ERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGG 177
           +      +   +P  R    ++N A     P     L +  +     LG++   F     
Sbjct: 227 RVWLCGGSVEVLPCSRVGHVYRNQASDT-LPNQEAILRNKIRIAETWLGSFKEIFYQHSP 285

Query: 178 ENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL------------------EVSNDWSGM 219
           E   L  K +  D + R +L+R LGC++F W+L                  ++ +   G 
Sbjct: 286 EAFSLR-KVEKPDCSERLQLQRRLGCRTFHWFLSNIYPELYPSERRPGFSGKLFSTRVGF 344

Query: 220 CIDSACKPTDMHKPVGLYPCHKQGGNQFWMMSKHGEIR---RDEACLDYAGGDVILYPC 275
           C+D   K       + L PC     +Q    +   EIR   + + C D     +IL  C
Sbjct: 345 CVDGGSKGKIPGSSITLLPC-SDSQHQHLEYTSRKEIRSGTKLQLCFDVREEQLILQNC 402


>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 599

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 140/327 (42%), Gaps = 61/327 (18%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CE    WL PLLD +ARN   +V P+I  I  D F         T +     G FDW + 
Sbjct: 236 CEANVNWLPPLLDRIARNRKTIVCPMIDVIDHDDFRYE------TQAGDAMRGAFDWEMY 289

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           +    IP   +K   + ++P  +P MAGGLF++D+ +F +LG YD G +IWGGE  E+SF
Sbjct: 290 YKRIPIPPELQK--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISF 347

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTP---TMAGGLFSIDKAFFEKLGTYDSGFDIW 175
           K          IP            P   P   ++A  L  + + + ++   +       
Sbjct: 348 KVWMCGGRMEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEFAEH---IYQR 404

Query: 176 GGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL----------------------EVS 213
             E   LS     GDV ++K+LR +L CKSFKW++                      E+ 
Sbjct: 405 RPEYRHLS----AGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIR 460

Query: 214 NDWSGMCIDSACKPTDMHKPVGLYPCHKQGGNQFW------MMSKHGEIRRDEA------ 261
           N  +G+C D+  K   +  P+ L  C +  G   W        +   +IR  +       
Sbjct: 461 NVGTGLCADT--KHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPGDPQHTKKF 518

Query: 262 CLDYA--GGDVILYPCHGSKGNQYFEY 286
           C D       V LY CH  KGNQ ++Y
Sbjct: 519 CFDAVSHSSPVTLYDCHSMKGNQLWKY 545


>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
 gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
          Length = 662

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/344 (29%), Positives = 141/344 (40%), Gaps = 93/344 (27%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
            E    WL PLL+ +A N    V P I  I    F+ R       +  +   G FDW   
Sbjct: 294 VEANYNWLPPLLEPIALNKRTAVCPFIDVIDHSNFQYR-------AQDEGARGAFDWEFY 346

Query: 64  FN-WHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 122
           +     +PE      K+ A+P  +P MAGGLF+I   FF +LG YD G DIWGGE  ELS
Sbjct: 347 YKRLRLLPED----LKHPADPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELS 402

Query: 123 FKF-----------------------NWHAIPERERKRHKN--AAEPVWTPTMAGGLFSI 157
           FK                        N +  P +    H+N      VW       L+S 
Sbjct: 403 FKIWMCGGQMYDAPCSRIGHIYRGPRNHNPSPRKGDYLHRNYKRVAEVWMDEYKNYLYSH 462

Query: 158 DKAFFEKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYLE-VSNDW 216
               +E++                     D GD+T++K +R  L CKSF+W++E V+ D 
Sbjct: 463 GDGIYERV---------------------DAGDLTAQKAIRTKLKCKSFRWFMEEVAFDL 501

Query: 217 ----------------------SGMCIDSACKPTDMHKPVGLYPCHKQ----GGNQFWMM 250
                                   +C+D+  +    H  +G++ C         +QFW +
Sbjct: 502 MKNYPPVDPPNYAMGAIQSVGNPQLCLDTMGRKK--HNRMGMFACADDLKVPQKSQFWEL 559

Query: 251 SKHGEIR--RDEACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
           S   ++R  R + CLD     A   V L+ CHG  GNQY+ YDY
Sbjct: 560 SWKRDLRQRRKKECLDVQIWEANAPVWLWDCHGQGGNQYWYYDY 603


>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
 gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
          Length = 587

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/340 (29%), Positives = 139/340 (40%), Gaps = 97/340 (28%)

Query: 10  WLQPLLDVLARNSSHVVSPLIANICDDTFEL--RFPPGRLTSSYKFFIGGFDWNLQFNWH 67
           WL PLL+ +A N    V PLI  I D TF++  +   GR         G FDW   +   
Sbjct: 224 WLPPLLEPIAENPKTCVCPLIDVIDDQTFDVHPQDEGGR---------GLFDWTFHYKRV 274

Query: 68  AIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKF-- 125
            I   +R    +  EP  +P MAGGLF+I   FF +LG YD   DIWG E  E+SFK   
Sbjct: 275 VIKNEDRI---SPTEPFPSPVMAGGLFAIGADFFWELGGYDEELDIWGAEQYEISFKIWQ 331

Query: 126 ------------------NWHAIPERER-----KRHKNAAEPVWTPTMAGGLFSIDKAFF 162
                              +   P   +     + HK  AE +W       ++  D    
Sbjct: 332 CGGRMLDAPCSRFGHIYRTYSPFPNSRKYDFITRNHKRVAE-IWMDEYKQYIYDRDP--- 387

Query: 163 EKLGTYDSGFDIWGGENLELSFKGDFGDVTSRKELRRNLGCKSFKWYL-EVSNDW----- 216
                             E   K D GD++  K +R  L CK FKW+L EV+ +      
Sbjct: 388 ------------------ERYAKTDAGDMSKMKTIREKLMCKPFKWFLQEVAPEIIELYP 429

Query: 217 -----------------SGMCIDSACKPTDMHKPVGLYPCHKQ-----GGNQFWMMSKHG 254
                            S +CID+  +     +P+GLYPC          NQ+++ S H 
Sbjct: 430 PVEPEPYASGSIQSVADSSLCIDTMQR--GRGEPIGLYPCSNSLIEPTNHNQYFVHSWHR 487

Query: 255 EIRRD--EACLDY----AGGDVILYPCHGSKGNQYFEYDY 288
           +I+    E C D      G  V ++ CH  +GNQ+F+YD+
Sbjct: 488 DIQHKYGEGCFDVPQSKPGSPVTIFTCHMHQGNQFFQYDH 527


>gi|281485547|ref|NP_660335.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Homo sapiens]
 gi|322510123|sp|Q7Z4T8.3|GLTL5_HUMAN RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5;
           AltName: Full=Polypeptide GalNAc transferase 15;
           Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 15;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 15
          Length = 443

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 31/220 (14%)

Query: 4   CEVQKRWLQPLLDVLARNSSHVVSPLIANICDDTFELRFPPGRLTSSYKFFIGGFDWNLQ 63
           CEV + WL+PLL  +A++   VV PLI  I D T E +  P           G FDWNLQ
Sbjct: 230 CEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLEYKPSP--------LVRGTFDWNLQ 281

Query: 64  FNWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSF 123
           F W  +   E    + + +P+ +P M+GG+F+I + +F ++G YD   D WG ENLELS 
Sbjct: 282 FKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSL 341

Query: 124 KF-----NWHAIPERERKRHKNAAEPVWTPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGE 178
           +          IP   R  H +  +     T+   +             Y     +W  E
Sbjct: 342 RIWMCGGQLFIIP-CSRVGHISKKQTGKPSTIISAM----------THNYLRLVHVWLDE 390

Query: 179 NLELSFKGD-------FGDVTSRKELRRNLGCKSFKWYLE 211
             E  F          +G++  R ELR+ LGCKSF+WYL+
Sbjct: 391 YKEQFFLRKPGLKYVTYGNIRERVELRKRLGCKSFQWYLD 430


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.140    0.474 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,339,044,611
Number of Sequences: 23463169
Number of extensions: 241110088
Number of successful extensions: 415141
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1891
Number of HSP's successfully gapped in prelim test: 158
Number of HSP's that attempted gapping in prelim test: 404195
Number of HSP's gapped (non-prelim): 5918
length of query: 290
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 149
effective length of database: 9,050,888,538
effective search space: 1348582392162
effective search space used: 1348582392162
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)