BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy11642
         (372 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
          Length = 596

 Score =  580 bits (1494), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 272/363 (74%), Positives = 309/363 (85%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP     ++GPGEGGKAY LPE  +     +  EYGMN+  S+ IS DRTI D R+
Sbjct: 75  LGNFEPKEVDRRDGPGEGGKAYILPEDQQNRASDAEMEYGMNIVVSDTISLDRTIRDTRL 134

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           EECK+WDYP +LP  SVI+VFHNEGFS LMRTVHS++ R+P   L EIILVDDFS K DL
Sbjct: 135 EECKHWDYPHNLPTTSVIIVFHNEGFSVLMRTVHSVLNRSPKHVLHEIILVDDFSDKEDL 194

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            +KLE+YI+RF+GKV+LIRN EREGLIRTRSRGAKE+ GEVIV+LDAHCEV  NWLPPLL
Sbjct: 195 KEKLENYIERFDGKVKLIRNVEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 254

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           APIY DR +MTVPVIDGID++T+E+R VY   HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIYRDRTVMTVPVIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           +SEPYKSPTHAGGLFA++R FFLE+G YDPGLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 315 DSEPYKSPTHAGGLFAINREFFLEIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSR 374

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFDE++K YFYTREPLA FLDMGDI
Sbjct: 375 VGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDEQYKEYFYTREPLARFLDMGDI 434

Query: 370 SEQ 372
           SEQ
Sbjct: 435 SEQ 437


>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
           pisum]
          Length = 588

 Score =  576 bits (1485), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 272/371 (73%), Positives = 310/371 (83%), Gaps = 1/371 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           P+++ D   GN E      K GPGE GKA+H+P         SL EYGMNM  S+ IS +
Sbjct: 58  PIYR-DQIFGNFEYSTSTNKPGPGEKGKAHHVPSDRENEALQSLSEYGMNMACSDDISLN 116

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R+IPD R EECKYW YP  LP+ SVI+VFHNEG+SSL+RTVHSI+ RTP Q+LEEI+LVD
Sbjct: 117 RSIPDHREEECKYWTYPEQLPRTSVIIVFHNEGWSSLLRTVHSILNRTPPQFLEEILLVD 176

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           DFSSK +L +KLE YI++FNGKVRLIRN+EREGLIRTRS+GA  +RGEVI+FLDAHCEVG
Sbjct: 177 DFSSKENLKKKLEYYIEKFNGKVRLIRNSEREGLIRTRSKGASNARGEVILFLDAHCEVG 236

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
            NWLPPL+API  DRKIMTVPVIDGID+ TWE+R VYE DH +RGIFEWGMLYKE E+P 
Sbjct: 237 YNWLPPLIAPIARDRKIMTVPVIDGIDHNTWEYRPVYEKDHLFRGIFEWGMLYKEIEIPA 296

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           +E +KR Y SEPYKSPTHAGGLFA+DR +FLELG YDPGLLVWGGENFELSFKIW CGGS
Sbjct: 297 QEERKRIYKSEPYKSPTHAGGLFAIDRNYFLELGAYDPGLLVWGGENFELSFKIWQCGGS 356

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           IEWVPCSR+GHVYR FMPYNFG+L  +VKGPLITYNYKRVIETWFD KHK +FYTREPLA
Sbjct: 357 IEWVPCSRVGHVYRGFMPYNFGELGKKVKGPLITYNYKRVIETWFDNKHKEFFYTREPLA 416

Query: 362 MFLDMGDISEQ 372
            +LDMGDIS+Q
Sbjct: 417 RYLDMGDISKQ 427


>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
          Length = 583

 Score =  568 bits (1465), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 267/372 (71%), Positives = 310/372 (83%), Gaps = 2/372 (0%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RP   +D  LGN EP      EGPGEGGK +HL +  +   D S  EYGMN+  S+ IS 
Sbjct: 55  RPKLVSD--LGNFEPRDSQEHEGPGEGGKPHHLRQDQQNDADQSESEYGMNVACSDEISL 112

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
           DRTI D R+ ECK+W+YP +LP  SVI+VFHNEG+S L+RTVHS+I R+P + L+E++LV
Sbjct: 113 DRTILDTRLSECKHWNYPENLPSTSVIIVFHNEGWSVLLRTVHSVINRSPPKILKEVLLV 172

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +LE YI+RFNGKVRLIRN +REGLIRTRSRGAKE+ GEVIVFLDAHCEV
Sbjct: 173 DDFSDKENLKTRLETYIERFNGKVRLIRNAQREGLIRTRSRGAKEATGEVIVFLDAHCEV 232

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVPVIDGID++T+E+R VY  D H+RGIFEWGMLYKENE+P
Sbjct: 233 NTNWLPPLLAPIYRDRSVMTVPVIDGIDHKTFEYRPVYGEDRHFRGIFEWGMLYKENEVP 292

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ++E   RK+NSEPYKSPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 293 QKELNTRKHNSEPYKSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 352

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYRSFMPYNFGKLA + KGPLIT NYKRVIETWFD+K+K +FYTREP+
Sbjct: 353 SIEWVPCSRVGHVYRSFMPYNFGKLAQKKKGPLITINYKRVIETWFDDKYKEFFYTREPM 412

Query: 361 AMFLDMGDISEQ 372
           A FLDMGDISEQ
Sbjct: 413 ARFLDMGDISEQ 424


>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
 gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
          Length = 598

 Score =  568 bits (1464), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 267/363 (73%), Positives = 305/363 (84%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP  +P  +GPGEGGKAY LPE  +     +  EYGMN+  S+ IS DRTI D R+
Sbjct: 77  LGNFEPADKPMVDGPGEGGKAYVLPEDQQNRATDAEMEYGMNIVVSDAISLDRTIKDTRL 136

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           EECK+WDYP  LP+ SV++VFHNEGFS LMRTVHS++ R+P   L EIILVDD+S K DL
Sbjct: 137 EECKHWDYPYHLPRTSVVIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 196

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KLE YI+RF+G VRLIRN+EREGLIRTRSRGAKE+ GEVIV+LDAHCEV  NWLPPLL
Sbjct: 197 KGKLERYIERFDGMVRLIRNSEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 256

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           API+ DR +MTVP+IDGID++T+E+R VY   HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 257 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 316

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           +SEPY+SPTHAGGLFA++R FFLELG YD GLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 317 DSEPYRSPTHAGGLFAINRKFFLELGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPCSR 376

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFD  +K YFYTREPLA FLDMGDI
Sbjct: 377 VGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDGPYKEYFYTREPLARFLDMGDI 436

Query: 370 SEQ 372
           SEQ
Sbjct: 437 SEQ 439


>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
          Length = 874

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 264/363 (72%), Positives = 301/363 (82%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP     +EGPGEGG+AY LPE  +     +  EYGMN+  S+ IS DRTI D R+
Sbjct: 75  LGNFEPHEPTVREGPGEGGRAYVLPEDQQNQATDAEMEYGMNIVVSDAISLDRTIRDTRL 134

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           EECK+WDYP  LPK SVI+VFHNEGFS LMRTVHS++ R+P   L EIILVDD+S K DL
Sbjct: 135 EECKHWDYPYHLPKTSVIIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 194

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KLE YI+RF   V+LIRN+EREGLIRTRSRGA E+ GEVIV+LDAHCEV  NWLPPLL
Sbjct: 195 RGKLERYIERFGSLVKLIRNSEREGLIRTRSRGAHEATGEVIVYLDAHCEVNTNWLPPLL 254

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           API+ DR +MTVP+IDGID++T+E+R VY   HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           +SEPY+SPTHAGGLFA++R FFL+LG YD GLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 315 DSEPYRSPTHAGGLFAINRKFFLDLGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPCSR 374

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFDE +K YFYTREPLA +LDMGDI
Sbjct: 375 VGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDEPYKEYFYTREPLAQYLDMGDI 434

Query: 370 SEQ 372
           SEQ
Sbjct: 435 SEQ 437


>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
 gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
          Length = 591

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 265/372 (71%), Positives = 305/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G AY LP   +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVEGLGNFEPKDLKPRTGPGENGDAYTLPPEKKNVADASEMEYGMNIACSDDISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ + R+EECK+WDYP DLP  SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRETRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  KL+DY+Q+FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSKLDDYVQQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            LNWLPPLLAPIY DR +MTVP+IDGID++T+E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK +FYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEFFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
 gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
          Length = 587

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 264/372 (70%), Positives = 305/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G A+ L    + A DAS  EYGMN+  S+ IS 
Sbjct: 57  REVPKLIEGLGNFEPKDLKPRSGPGENGDAHVLNANKKNAADASEMEYGMNIACSDDISM 116

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EECK+WDYP DLP  SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 117 HRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 176

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  KL++YI +F+G V++IRN EREGLIRTRSRGAKE+ GEVIVFLDAHCEV
Sbjct: 177 DDFSDKENLKAKLDEYILQFDGLVKIIRNKEREGLIRTRSRGAKEATGEVIVFLDAHCEV 236

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 237 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 296

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 297 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 356

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETWFDE HK YFYTREPL
Sbjct: 357 SIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETWFDETHKEYFYTREPL 416

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDI+EQ
Sbjct: 417 ARYLDMGDITEQ 428


>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
 gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
          Length = 591

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 262/372 (70%), Positives = 302/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVEGLGNFEPKDLKPRNGPGENGDAHTLSPDKKNVADASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EECK+WDYP DLP  SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  KL+DY+ +F G VR+IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRTKLDDYVLQFKGLVRIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            LNWLPPLLAPIY DR +MTVP+IDGID++++E+R VY  D H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKSFEYRPVYGSDTHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK +FYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEFFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDI+EQ
Sbjct: 421 ARYLDMGDITEQ 432


>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
 gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
          Length = 591

 Score =  553 bits (1426), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 259/372 (69%), Positives = 303/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPEKKHMSDASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +L++Y+Q+F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVQQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
 gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
          Length = 591

 Score =  552 bits (1423), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 256/363 (70%), Positives = 301/363 (82%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP     + GPGE G+A++L +  +   DAS  EYGMN+  S+ IS  RT+ D R+
Sbjct: 70  LGNFEPKDLKPRTGPGENGEAHNLSKDKKNKADASEMEYGMNIACSDEISMHRTVKDTRL 129

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           EEC++WDYP DLPK SVI+VFHNEGFS LMRTVHS+I R+P+  L EIILVDDFS K +L
Sbjct: 130 EECRHWDYPYDLPKTSVIIVFHNEGFSVLMRTVHSVIDRSPSHILHEIILVDDFSDKENL 189

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             +L+ Y+++F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWL PLL
Sbjct: 190 GNQLDKYVEQFKGLVKVIRNKEREGLIRTRSRGATEATGEVIVFLDAHCEVNLNWLAPLL 249

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           APIY DR +MTVP+IDGID++ +E+R VY  + H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETHFRGIFEWGMLYKENEVPRREQRRRSH 309

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPCSR
Sbjct: 310 NSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSR 369

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPLA +LDMGDI
Sbjct: 370 VGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPLARYLDMGDI 429

Query: 370 SEQ 372
           SEQ
Sbjct: 430 SEQ 432


>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
 gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
          Length = 591

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 260/372 (69%), Positives = 302/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   D S  EYGMN+  SN IS 
Sbjct: 61  REVPKLVEGLGNFEPKDLKPRSGPGENGEAHTLSPDKKNVADDSEMEYGMNIACSNDISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DD+S K DL   L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDI+EQ
Sbjct: 421 ARYLDMGDITEQ 432


>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
 gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
          Length = 591

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 258/372 (69%), Positives = 302/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 7;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
           AltName: Full=dGalNAc-T2
 gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
 gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
 gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
          Length = 591

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 258/372 (69%), Positives = 302/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
 gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
          Length = 591

 Score =  550 bits (1416), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 257/372 (69%), Positives = 302/372 (81%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +L++Y+ +F G V++IRN +REGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKQREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
          Length = 521

 Score =  548 bits (1413), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 251/363 (69%), Positives = 304/363 (83%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           +GN EPP+E  + GPGEGGK + L    R     S+ E+GMNM  S+ IS  RTI D R 
Sbjct: 1   MGNFEPPIEAPRSGPGEGGKPHTLLPDQRNEASQSISEFGMNMVVSDEISLSRTISDTRT 60

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
            EC++W YP DLPKASV++VFHNEG+S+L+RTV S+I R+P Q+LEE++LVDDFS KA L
Sbjct: 61  PECQHWSYPEDLPKASVVIVFHNEGWSTLLRTVQSVIDRSPPQFLEEVLLVDDFSEKAHL 120

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            +KLED+I+R++GKVRLIRN EREGLIRTR+RGA+E+RGEV++FLDAHCEVGLNWLPPLL
Sbjct: 121 KRKLEDFIERYDGKVRLIRNKEREGLIRTRTRGAEEARGEVVLFLDAHCEVGLNWLPPLL 180

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
            PIY DR  MTVP+IDGID++ +E+R VY+ + ++RG+FEWGMLYKENE+PEREA+ R Y
Sbjct: 181 YPIYLDRTTMTVPLIDGIDHENFEYRPVYQGETNFRGVFEWGMLYKENEVPEREAQSRTY 240

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           NSEPYK+PTHAGGLFA++RA+FLE+G YDPGLLVWGGENFELSFKIW CGG I WVPCSR
Sbjct: 241 NSEPYKAPTHAGGLFAINRAYFLEIGAYDPGLLVWGGENFELSFKIWQCGGKILWVPCSR 300

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPY FGKLA   KG LIT NYKRVIE WFD+K+K +FYTREP A FLDMG+I
Sbjct: 301 VGHVYRGFMPYTFGKLAANKKGSLITINYKRVIEVWFDDKYKEFFYTREPTARFLDMGNI 360

Query: 370 SEQ 372
           ++Q
Sbjct: 361 TQQ 363


>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
           N-acetylgalactosaminyltransferase 2 [Drosophila
           melanogaster]
          Length = 591

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 257/372 (69%), Positives = 301/372 (80%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + G GE G+A+ L    +   DAS  EYGMN+  S+ IS 
Sbjct: 61  REVPKLVDGLGNFEPKDVKPRSGSGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS K +L  +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
             NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 420

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDISEQ
Sbjct: 421 ARYLDMGDISEQ 432


>gi|383860243|ref|XP_003705600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Megachile
           rotundata]
          Length = 581

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 260/371 (70%), Positives = 301/371 (81%), Gaps = 2/371 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PV   D  LGN E    P + GPGEGGK + L +  +     S  EYGMNM  S+ IS D
Sbjct: 54  PVLVKD--LGNFELQHVPIRTGPGEGGKPHILRDDQQNDVQQSESEYGMNMVCSDEISLD 111

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R++PD RM ECK+W+YP  LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVD
Sbjct: 112 RSVPDTRMTECKHWNYPEVLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVD 171

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           D+S K +L  +LE YI+++ GKV+LIRN EREGLIRTRSRGA+E++GEVIVFLDAHCEV 
Sbjct: 172 DYSDKDNLKGELESYIEQWEGKVKLIRNYEREGLIRTRSRGAREAKGEVIVFLDAHCEVN 231

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           +NWLPPLLAPI  DR +MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP 
Sbjct: 232 VNWLPPLLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPA 291

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE K R YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 292 REQKTRPYNSMPYKSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 351

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           I WVPCS +GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFD+K+K +FYTREPLA
Sbjct: 352 ILWVPCSHVGHVYRGFMPYNFGKLAQKKKGPLITINYKRVIETWFDDKYKEFFYTREPLA 411

Query: 362 MFLDMGDISEQ 372
             LD GDISEQ
Sbjct: 412 QLLDHGDISEQ 422


>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
           vitripennis]
          Length = 583

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 254/363 (69%), Positives = 302/363 (83%), Gaps = 1/363 (0%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP ++ ++ GPGE GK + L +  +     S   YGMN+  S+ IS DR++PD R 
Sbjct: 63  LGNFEPEIQ-HRTGPGEEGKPHILRDDQQNDVQESETAYGMNIVCSDEISLDRSVPDTRP 121

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           +ECK+W+Y  +LPK SVI+VFHNEG+S LMRTVHS++ RTP QYLEEI+LVDDFS K +L
Sbjct: 122 DECKHWNYSKNLPKTSVIIVFHNEGWSVLMRTVHSVLNRTPPQYLEEILLVDDFSDKENL 181

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             +LE YI+++  KVRL+RN EREGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 182 KGELESYIEQWGPKVRLLRNKEREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 241

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           +PI  D K+MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP+REAK RK+
Sbjct: 242 SPIAEDNKVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPQREAKTRKH 301

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           NSEPY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS 
Sbjct: 302 NSEPYRSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPCSH 361

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA  LD GDI
Sbjct: 362 VGHVYRGFMPYNFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARLLDHGDI 421

Query: 370 SEQ 372
           +EQ
Sbjct: 422 TEQ 424


>gi|307212076|gb|EFN87959.1| N-acetylgalactosaminyltransferase 7 [Harpegnathos saltator]
          Length = 563

 Score =  539 bits (1389), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 252/363 (69%), Positives = 299/363 (82%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP    +K GPGEGGK + L E  +     S  +YGMNM  S+ IS  R+IPD R 
Sbjct: 42  LGNFEPRDTSFKAGPGEGGKPHILREDQQNDVQQSESDYGMNMVCSDEISMSRSIPDTRP 101

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
            ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEE++LVDDFS K +L
Sbjct: 102 AECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEVLLVDDFSDKENL 161

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 162 KDDLDSYIEQWGGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 221

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           API  +R +MTVPVIDGID++T+E+R VY+  H YRGIFEWGMLYKENELP REAK R +
Sbjct: 222 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRSH 281

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           +S PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPCS 
Sbjct: 282 DSMPYKSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGTILWVPCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA FLD GDI
Sbjct: 342 VGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARFLDHGDI 401

Query: 370 SEQ 372
           SEQ
Sbjct: 402 SEQ 404


>gi|242016390|ref|XP_002428804.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212513501|gb|EEB16066.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 579

 Score =  538 bits (1385), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 256/364 (70%), Positives = 300/364 (82%), Gaps = 3/364 (0%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           GN EP  E   +GPGEGG+ + L E  +     SL +YGMN+  S+ IS DR+IPD R+ 
Sbjct: 58  GNFEPREEEISDGPGEGGRPHKLREDQQNDASQSLADYGMNIACSDEISLDRSIPDTRLP 117

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           ECK W YP DLPKASVI+VFHNEG+S+L+RTVHS+I RTP Q+LEE+++VDDFS K +L 
Sbjct: 118 ECKRWMYPEDLPKASVIIVFHNEGWSTLLRTVHSVINRTPPQFLEEVLMVDDFSDKENL- 176

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           ++L+DYI RFNGKVRLIRN+ER+GLIRTRSRGA E+RGEVIVFLDAHCEV  NWLPPLLA
Sbjct: 177 KELDDYILRFNGKVRLIRNSERQGLIRTRSRGAVEARGEVIVFLDAHCEVNKNWLPPLLA 236

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER--EAKKRK 248
           PIY DR  +TVPVIDGID+ T+E++ VY   HHYRGIFEWGMLYKE EL ++   A  RK
Sbjct: 237 PIYYDRTTLTVPVIDGIDHDTFEYKPVYVDGHHYRGIFEWGMLYKEIELTDQFANADNRK 296

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           YNSEPY+SPTHAGGLFA+DR +FL++G YD GLLVWGGENFELSFK+W CGG I WVPCS
Sbjct: 297 YNSEPYRSPTHAGGLFAIDRNYFLDIGAYDDGLLVWGGENFELSFKVWQCGGRILWVPCS 356

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           R+GHVYRSFMPY FG LA   KGPLIT NYKRVIETWFDEK+K +FYTREPLA +L+MGD
Sbjct: 357 RVGHVYRSFMPYTFGSLAKNKKGPLITINYKRVIETWFDEKYKEFFYTREPLARYLNMGD 416

Query: 369 ISEQ 372
           IS+Q
Sbjct: 417 ISKQ 420


>gi|307169192|gb|EFN62008.1| N-acetylgalactosaminyltransferase 7 [Camponotus floridanus]
          Length = 580

 Score =  535 bits (1377), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 255/371 (68%), Positives = 302/371 (81%), Gaps = 2/371 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PVF  +G LGN EP   P + GPGE GK + L +        S  +YGMNM  S+ IS  
Sbjct: 53  PVF-VEG-LGNYEPRDVPVRSGPGENGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R+IPD R  ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEEI+LVD
Sbjct: 111 RSIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEILLVD 170

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           DFS K +L   L+ YI+++NGKVRL+RN ER+GLIRTRSRGA++++GEVIVFLDAHCEV 
Sbjct: 171 DFSDKENLKGDLDSYIEQWNGKVRLLRNYERQGLIRTRSRGARDAKGEVIVFLDAHCEVN 230

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           +NWLPPLLAPI  +R +MTVPVIDGID++T+E+R VY+  H YRGIFEWGMLYKENELP 
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           REAK R Y+S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 291 REAKTRAYDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 350

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           I WVPCS +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA
Sbjct: 351 ILWVPCSHVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLA 410

Query: 362 MFLDMGDISEQ 372
             LD GDISEQ
Sbjct: 411 RLLDHGDISEQ 421


>gi|340718182|ref|XP_003397550.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Bombus
           terrestris]
          Length = 581

 Score =  534 bits (1376), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 254/364 (69%), Positives = 295/364 (81%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LGN E      + GPGEGGK Y L +  +     S  +YGMNM  S+ IS DR+I D R
Sbjct: 59  ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 118

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           M ECK+W+YP  LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 178

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L   L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 238

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LAPI  DR +MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP RE K R 
Sbjct: 239 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 298

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+KHK +FYTREPLA  LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKHKEFFYTREPLAQLLDHGD 418

Query: 369 ISEQ 372
           ISEQ
Sbjct: 419 ISEQ 422


>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
          Length = 499

 Score =  533 bits (1372), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 244/326 (74%), Positives = 283/326 (86%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +YGMN+  SN I+ +R+IPD R++ECKYW YP DL K SVI+VFHNEGFS LMRTVHS+I
Sbjct: 15  KYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEDLAKTSVIIVFHNEGFSVLMRTVHSVI 74

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            RTPAQ+L E++LVDDFS K DL + L++YI+R+NGKVRL+RN +REGLIRTRSRGA+E+
Sbjct: 75  NRTPAQFLHEVVLVDDFSDKDDLKENLDNYIKRWNGKVRLVRNVQREGLIRTRSRGAQEA 134

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+VIVFLDAHCEV +NWLPPLLAPIY D + MTVPVIDGIDY T+E+R VY+   +YRG
Sbjct: 135 TGDVIVFLDAHCEVNVNWLPPLLAPIYRDYRTMTVPVIDGIDYNTFEYRPVYQHGTNYRG 194

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           IFEWGMLYKENE+P+REA   K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLVWGG
Sbjct: 195 IFEWGMLYKENEVPDREAHLHKHKSEPYKSPTHAGGLFAINRRYFLEIGAYDPGLLVWGG 254

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           ENFELSFKIW CGGSIEWVPCSR+GHVYR+FMPY FG LA   KG LIT NYKRVIETWF
Sbjct: 255 ENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYTFGNLAKNRKGSLITINYKRVIETWF 314

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE+HK YFYTREP+A FLDMGDISEQ
Sbjct: 315 DEEHKEYFYTREPMARFLDMGDISEQ 340


>gi|380013105|ref|XP_003690610.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
           7-like [Apis florea]
          Length = 581

 Score =  533 bits (1372), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 254/364 (69%), Positives = 296/364 (81%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LGN EP     + GPGE GK + L +  +     S  +YGMNM  S+ IS DR IPD R
Sbjct: 59  ELGNFEPKRISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNMVCSDEISLDRLIPDTR 118

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           M ECK+W+YP  LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEMLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L  +LE YI+R+  KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIERWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LAPI +DR +MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP RE K R 
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKIRP 298

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA  LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAXKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 418

Query: 369 ISEQ 372
           ISEQ
Sbjct: 419 ISEQ 422


>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  532 bits (1371), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 250/373 (67%), Positives = 303/373 (81%), Gaps = 3/373 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PVF+ D  LGN EP     ++GPGEGG AYH+PE  R +   S  +YGMN+  S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R++PD+R+EECKYWDYP DLP  SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231

Query: 122 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           DFS K +L  +LE YI      G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
           VG+NWLPPLLAPI ++R+ MTVPVIDGID  T+E+R VY    H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           P+ E K+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CG
Sbjct: 352 PDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCG 411

Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           G I WVPCSR+GHVYR FMPY+FGKLA + KGPLIT NYKRV+E W DE +K YFYTREP
Sbjct: 412 GMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRVVEVWMDE-YKEYFYTREP 470

Query: 360 LAMFLDMGDISEQ 372
           LA + D GD+ +Q
Sbjct: 471 LATYYDAGDLKQQ 483


>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  532 bits (1371), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 250/373 (67%), Positives = 303/373 (81%), Gaps = 3/373 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PVF+ D  LGN EP     ++GPGEGG AYH+PE  R +   S  +YGMN+  S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R++PD+R+EECKYWDYP DLP  SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231

Query: 122 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           DFS K +L  +LE YI      G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
           VG+NWLPPLLAPI ++R+ MTVPVIDGID  T+E+R VY    H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           P+ E K+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CG
Sbjct: 352 PDEEIKRRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCG 411

Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           G I WVPCSR+GHVYR FMPY+FGKLA + KGPLIT NYKRV+E W DE +K YFYTREP
Sbjct: 412 GMIYWVPCSRVGHVYRGFMPYSFGKLAQKRKGPLITVNYKRVVEVWMDE-YKEYFYTREP 470

Query: 360 LAMFLDMGDISEQ 372
           LA + D GD+ +Q
Sbjct: 471 LATYYDAGDLKQQ 483


>gi|328781461|ref|XP_395266.4| PREDICTED: n-acetylgalactosaminyltransferase 7 [Apis mellifera]
          Length = 581

 Score =  531 bits (1369), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 252/364 (69%), Positives = 296/364 (81%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LGN EP     + GPGE GK + L +  +     S  +YGMN+  S+ IS DR IPD R
Sbjct: 59  ELGNFEPKHISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNIVCSDEISLDRLIPDTR 118

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           M ECK+W+YP  LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEILPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L  +LE YI+++  KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIEQWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LAPI +DR +MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP RE K R 
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRS 298

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS
Sbjct: 299 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPCS 358

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA  LD GD
Sbjct: 359 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 418

Query: 369 ISEQ 372
           ISEQ
Sbjct: 419 ISEQ 422


>gi|332023194|gb|EGI63450.1| N-acetylgalactosaminyltransferase 7 [Acromyrmex echinatior]
          Length = 614

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 249/363 (68%), Positives = 294/363 (80%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN E    P + GPGEGGK + L +        S  +YGMNM  S+ IS  R+IPD R+
Sbjct: 93  LGNYELRDVPVRSGPGEGGKPHILKDDQLNDVQQSESDYGMNMVCSDEISLSRSIPDTRL 152

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
            +CK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP++ LEEI+LVDDFS K +L
Sbjct: 153 AQCKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKLLEEILLVDDFSDKENL 212

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E+RGEVIVFLDAHCEV +NWLPPLL
Sbjct: 213 KSDLDSYIEQWGGKVRLLRNHERQGLIRTRSRGAREARGEVIVFLDAHCEVNVNWLPPLL 272

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           API  +R +MTVPVIDGID++T+E+R VY+  H YRGIFEWGMLYKENELP REAK R +
Sbjct: 273 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRAH 332

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
           +S PY+SPTHAGGLFA+ R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPCS 
Sbjct: 333 DSMPYRSPTHAGGLFAISRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPCSH 392

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA  LD GDI
Sbjct: 393 VGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLARLLDHGDI 452

Query: 370 SEQ 372
           SEQ
Sbjct: 453 SEQ 455


>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
           impatiens]
          Length = 582

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 252/364 (69%), Positives = 295/364 (81%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LGN E      + GPGEGGK Y L +  +     S  +YGMNM  S+ IS DR+I D R
Sbjct: 60  ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 119

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           M ECK+W+YP  LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 120 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 179

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L   L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 180 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 239

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LAPI  DR +MTVP+IDGID++T+E+R VY+  H YRGIFEWGMLYKENELP RE K R 
Sbjct: 240 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 299

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPCS
Sbjct: 300 YNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGNILWVPCS 359

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHVYR FMPY FGKLA + KGPLIT NYKRV+ETWFD+K+K +FYTREPLA  LD GD
Sbjct: 360 HVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVVETWFDDKYKEFFYTREPLAQLLDHGD 419

Query: 369 ISEQ 372
           ISEQ
Sbjct: 420 ISEQ 423


>gi|322798640|gb|EFZ20244.1| hypothetical protein SINV_10970 [Solenopsis invicta]
          Length = 580

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 254/371 (68%), Positives = 300/371 (80%), Gaps = 2/371 (0%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PVF  +G LGN EP   P + GPGEGGK + L +        S  +YGMNM  S+ IS  
Sbjct: 53  PVF-VEG-LGNYEPRDIPVRSGPGEGGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R IPD R  ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP+++LEEI+LVD
Sbjct: 111 RAIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKFLEEILLVD 170

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           DFS K +L   L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV 
Sbjct: 171 DFSDKENLKGDLDSYIEQWEGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVN 230

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           +NWLPPLLAPI  +R +MTVPVIDGID++T+E+R VY+  H YRGIFEWGMLYKENELP 
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           REAK R ++S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGS
Sbjct: 291 REAKTRAHDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGS 350

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           I WVPCS +GHVYR FMPY FGKLA + KGPLIT NYKRVIETWFDEKHK +FYTREPLA
Sbjct: 351 ILWVPCSHVGHVYRGFMPYTFGKLAQKKKGPLITINYKRVIETWFDEKHKEFFYTREPLA 410

Query: 362 MFLDMGDISEQ 372
             LD GDISEQ
Sbjct: 411 RLLDHGDISEQ 421


>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
          Length = 486

 Score =  526 bits (1355), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 240/326 (73%), Positives = 280/326 (85%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           EYGMN+  S+ IS  R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I
Sbjct: 2   EYGMNIACSDEISMHRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVI 61

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            R+P   L EIILVDDFS K +L  +L++Y+ +F G V++IRN EREGLIRTRSRGA E+
Sbjct: 62  DRSPTHMLHEIILVDDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEA 121

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            GEVIVFLDAHCEV  NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RG
Sbjct: 122 TGEVIVFLDAHCEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRG 181

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           IFEWGMLYKENE+P RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGG
Sbjct: 182 IFEWGMLYKENEVPRREQRRRAHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGG 241

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           ENFELSFKIW CGGSIEWVPCSR+GHVYR FMPYNFGKLA + KGPLIT NYKRVIETWF
Sbjct: 242 ENFELSFKIWQCGGSIEWVPCSRVGHVYRGFMPYNFGKLASKKKGPLITINYKRVIETWF 301

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           D+ HK YFYTREPLA +LDMGDISEQ
Sbjct: 302 DDTHKEYFYTREPLARYLDMGDISEQ 327


>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
          Length = 499

 Score =  521 bits (1342), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 239/329 (72%), Positives = 283/329 (86%)

Query: 44  SLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVH 103
           S  EYGMN+  SN I+ +R+IPD R++ECKYW YP +LP  SVI+VFHNEGFS LMRTVH
Sbjct: 12  SESEYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEELPSTSVIIVFHNEGFSVLMRTVH 71

Query: 104 SIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA 163
           ++I R+P   L+E+++VDDFS K DL + L++Y++R+ GKVR+IRN+ER+GLIRTRSRGA
Sbjct: 72  TVIDRSPPNILKEVVMVDDFSDKDDLKENLDNYVKRWKGKVRIIRNSERQGLIRTRSRGA 131

Query: 164 KESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH 223
            E+ GEVIVFLDAHCEV +NWLPPLLAPIY D KIMTVPVIDGID++T+E+R VY    +
Sbjct: 132 MEATGEVIVFLDAHCEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTFEYRPVYSHGIN 191

Query: 224 YRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLV 283
           YRGIFEWGMLYKENE+P+REA   K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLV
Sbjct: 192 YRGIFEWGMLYKENEVPDREASLHKHKSEPYKSPTHAGGLFAINRNYFLEIGAYDPGLLV 251

Query: 284 WGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIE 343
           WGGENFELSFKIW CGGSIEWVPCSR+GHVYR+FMPY+FG LA   KG LIT NYKRVIE
Sbjct: 252 WGGENFELSFKIWQCGGSIEWVPCSRVGHVYRAFMPYSFGNLAKNRKGSLITINYKRVIE 311

Query: 344 TWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           TWFDE+HK +FYTREP+A FLDMGDISEQ
Sbjct: 312 TWFDEEHKEFFYTREPMARFLDMGDISEQ 340


>gi|391336074|ref|XP_003742408.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Metaseiulus
           occidentalis]
          Length = 593

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 245/374 (65%), Positives = 293/374 (78%), Gaps = 8/374 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISF 60
           PVF+ D  LGN E  + P K GPGEGG AY +  + R+     L  +YGMNM  SN IS 
Sbjct: 70  PVFR-DDVLGNFEMSM-PKKVGPGEGGAAYVI--SGRSVEQQKLKNQYGMNMVVSNEISP 125

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
           +RTIPDLR++ECKYW YP DLP  SVI+VFHNEG S LMRTVHS+I R+P Q+L E++LV
Sbjct: 126 NRTIPDLRLDECKYWHYPEDLPGTSVIVVFHNEGLSVLMRTVHSVINRSPRQFLHEVVLV 185

Query: 121 DDFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           DDFS K +L ++LE+YI R    G VRL+RN  R+GLIR+RS GA+ + GEVI+FLDAHC
Sbjct: 186 DDFSDKLNLREELENYIARNFPKGLVRLVRNKSRQGLIRSRSYGAEVATGEVILFLDAHC 245

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           EVG NWLPPLLAPI ++ K MTVPVIDGID++ +E+R VY    H+RGIFEWGMLYKE E
Sbjct: 246 EVGANWLPPLLAPIKANPKTMTVPVIDGIDHENFEYRPVYHGKQHFRGIFEWGMLYKEIE 305

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +PE E K+R  +SEPYKSPTHAGGLFAM+R +FLELGGYDPGLLVWGGENFELSFK+W C
Sbjct: 306 IPEEEVKRRTKHSEPYKSPTHAGGLFAMNREYFLELGGYDPGLLVWGGENFELSFKLWQC 365

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG I WVPCSR+GHVYR FMPY+FG L  + KGPLI  NYKRV+E WFDE +K YFYTRE
Sbjct: 366 GGQILWVPCSRVGHVYRGFMPYSFGDLGKKRKGPLIVINYKRVVEVWFDE-YKEYFYTRE 424

Query: 359 PLAMFLDMGDISEQ 372
           P+A   D G++++Q
Sbjct: 425 PMARDYDAGNLTQQ 438


>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 478

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 244/367 (66%), Positives = 292/367 (79%), Gaps = 5/367 (1%)

Query: 10  LGNLEPPLEPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
           LGN EP +     K  PGEGG  YH P   +     S  EYGMN+  S+HIS +RTIPD+
Sbjct: 1   LGNFEPAVADVVDKRKPGEGGFPYHTPPKLKNNVAHSNMEYGMNVVASDHISPNRTIPDM 60

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R++ECKYWDYP DLP  SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVDD+S K 
Sbjct: 61  RLQECKYWDYPTDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVDDYSDKE 120

Query: 128 DLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
           +L  +LE YI R    G VRL+RN ER+GLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 121 NLKGELETYIARNFPVGLVRLLRNEERQGLIRSRSYGAEQSVGDVVLFLDAHCEVGINWL 180

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           PPLLAPI ++R  MTVPVIDGID  T+E+R VY    H+RGIFEWGMLYKE E+PE E K
Sbjct: 181 PPLLAPIRANRYTMTVPVIDGIDKDTFEYRPVYHGGQHFRGIFEWGMLYKEIEIPEEEIK 240

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +RKY+SEPYKSPTHAGGLFA+DR +FL+LGGYDPGLLVWGGENFELSFKIW CGGSI WV
Sbjct: 241 RRKYHSEPYKSPTHAGGLFAIDRKYFLKLGGYDPGLLVWGGENFELSFKIWQCGGSIYWV 300

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
           PCSR+GHVYR FMPY+FGKLA + KGP++T NYKRV+E W DE +K YFYTREP+A   D
Sbjct: 301 PCSRVGHVYRGFMPYSFGKLAHKRKGPIVTVNYKRVVEVWMDE-YKEYFYTREPMARHYD 359

Query: 366 MGDISEQ 372
            GD+S Q
Sbjct: 360 PGDLSGQ 366


>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
 gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
          Length = 567

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 239/372 (64%), Positives = 280/372 (75%), Gaps = 24/372 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           R V K    LGN EP     + GPGE G+A+ L    +   D S  EYGMN+  SN IS 
Sbjct: 61  REVPKLIEGLGNFEPKDLKPRSGPGENGEAHSLSPDKKNVADDSEMEYGMNIACSNDISM 120

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P   L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DD+S K DL   L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY  D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            RE ++R +NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGG
Sbjct: 301 RREQRRRTHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGG 360

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SI                        D+ KGPLIT NYKRVIETWFD+ HK YFYTREPL
Sbjct: 361 SI------------------------DKKKGPLITINYKRVIETWFDDTHKEYFYTREPL 396

Query: 361 AMFLDMGDISEQ 372
           A +LDMGDI+EQ
Sbjct: 397 ARYLDMGDITEQ 408


>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
          Length = 640

 Score =  456 bits (1173), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 226/376 (60%), Positives = 283/376 (75%), Gaps = 16/376 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNH 57
           VFK  G+LGN EP  +  + G  GE G+  ++    PE  RA     + E+G N   S+ 
Sbjct: 115 VFKK-GELGNFEPKEKQSRPGKHGEMGEPVNVDLNQPEVQRA-----MNEFGFNTFVSDM 168

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS +R++PD+RM+ECKYW YP DLP ASV++VFHNEG+S L+RTVHS+I R+P   L+EI
Sbjct: 169 ISLNRSVPDVRMDECKYWHYPEDLPTASVVIVFHNEGWSPLLRTVHSVILRSPPNLLKEI 228

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           +LVDDFS K  L  +L+ YI++FNGKVRL+RN EREGLIRTRS GA+ + G+V++FLDAH
Sbjct: 229 VLVDDFSDKEHLKDRLDRYIEQFNGKVRLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAH 288

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKE 236
           CEV +NWLPPLLAPI  +RK+MTVPVIDGID  TW +R VY   D H+RGIFEWG+LYKE
Sbjct: 289 CEVNINWLPPLLAPIRRNRKVMTVPVIDGIDMHTWSYRRVYGSADRHFRGIFEWGLLYKE 348

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
            E+ + EA++RKYNSEP++SPTHAGGLFA+D+ +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 349 TEITKEEARRRKYNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSFKIW 408

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG I +VPCS +GHVYRS MPY FGKL+ +   P+I+ N  RVI+TW DE  K Y+Y 
Sbjct: 409 QCGGGILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMVRVIKTWMDEYEK-YYYI 464

Query: 357 REPLAMFLDMGDISEQ 372
           REP A     GDIS Q
Sbjct: 465 REPSAKHRSPGDISAQ 480


>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
           magnipapillata]
          Length = 599

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 216/376 (57%), Positives = 272/376 (72%), Gaps = 6/376 (1%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PVF +D KLGN E   E  K GPGEGGK + L    +   +   G YG N   S+ IS D
Sbjct: 51  PVFLSDNKLGNFEK-YEDVKSGPGEGGKPHRLKPEQKEEEERLKGVYGFNQLVSDEISLD 109

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R +PD+R EECK+W YP DLP +SVI +FHNEG+S+L+R+VHS+I RTPA  L EI+LVD
Sbjct: 110 RVVPDMREEECKHWSYPNDLPSSSVIFIFHNEGWSTLLRSVHSVINRTPAHLLHEIVLVD 169

Query: 122 DFSSKADLDQKLEDYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           D S    L ++L++ I++  +  KV+L+RN +REGLIR R+ GA  + GEV+VFLDAHCE
Sbjct: 170 DKSELEHLHERLDEEIKKPYYQSKVKLVRNKQREGLIRARNIGAIAATGEVLVFLDAHCE 229

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
           VG NWLPPL+API  D   +T P+IDGI++  +    VY+   H RGIFEWGMLYKE +L
Sbjct: 230 VGGNWLPPLIAPIQEDPTTLTAPIIDGINWDDFSINPVYQKGSHSRGIFEWGMLYKETDL 289

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           PE+EA+KR Y+SEPY SPTHAGGLFA+ R++F ELG YDPGLL+WGGEN+ELSFK+W CG
Sbjct: 290 PEKEARKRLYHSEPYNSPTHAGGLFAIKRSWFKELGWYDPGLLIWGGENYELSFKLWQCG 349

Query: 300 GSIEWVPCSRIGHVYR--SFMPYNFGKLADRVKG-PLITYNYKRVIETWFDEKHKAYFYT 356
           G   WVPCS + HVYR  S    + G +  +  G PL   NYKR+IE WFD+K+K +FYT
Sbjct: 350 GRSLWVPCSHVSHVYRGHSCSSCHSGDMGRKWSGIPLSLRNYKRLIEVWFDDKYKEFFYT 409

Query: 357 REPLAMFLDMGDISEQ 372
           REPLA F+D GD+SEQ
Sbjct: 410 REPLARFIDTGDVSEQ 425


>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
          Length = 560

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 222/376 (59%), Positives = 281/376 (74%), Gaps = 16/376 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 57
           +FK D ++GN EP    ++ G  GE GK      +LPE  +A     + EYG N   S+ 
Sbjct: 37  IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 90

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 91  ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 150

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDDFS K  L  +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 151 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 210

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 236
           CEV +NWLPPLLAPI  +RK+MTVPVIDGID   W +R VY   D HYRGIFEWG+LYKE
Sbjct: 211 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 270

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
            E+P +E  +RK++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 271 TEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIW 330

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG I ++PCS +GHVYRS MPY FGKL+ +   P+I+ N  RVI+TW DE  K Y+Y 
Sbjct: 331 QCGGGILFIPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYEK-YYYI 386

Query: 357 REPLAMFLDMGDISEQ 372
           REP A     GDIS Q
Sbjct: 387 REPSAKHRLPGDISSQ 402


>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
          Length = 562

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 222/376 (59%), Positives = 281/376 (74%), Gaps = 16/376 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 57
           +FK D ++GN EP    ++ G  GE GK      +LPE  +A     + EYG N   S+ 
Sbjct: 39  IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 92

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 93  ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 152

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDDFS K  L  +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 153 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 212

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 236
           CEV +NWLPPLLAPI  +RK+MTVPVIDGID   W +R VY   D HYRGIFEWG+LYKE
Sbjct: 213 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 272

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
            E+P +E  +RK++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW
Sbjct: 273 TEIPAQELLRRKHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIW 332

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG I ++PCS +GHVYRS MPY FGKL+ +   P+I+ N  RVI+TW DE  K Y+Y 
Sbjct: 333 QCGGGILFIPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYEK-YYYI 388

Query: 357 REPLAMFLDMGDISEQ 372
           REP A     GDIS Q
Sbjct: 389 REPSAKHRLPGDISSQ 404


>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
 gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
          Length = 601

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 213/370 (57%), Positives = 276/370 (74%), Gaps = 9/370 (2%)

Query: 7   DGKLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRT 63
           DG+LGN EP  P  P  + PGE G+   +  E   AAG A+  E+G N   S+ IS +RT
Sbjct: 79  DGELGNYEPKTPEIPSNQ-PGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDLISMNRT 137

Query: 64  IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
           IPD+R +ECK+WDYP +LP  SV++VFHNEG++ L+RTVHS++ R+P + +E I++VDD 
Sbjct: 138 IPDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIESIVMVDDD 197

Query: 124 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
           S K  L +KL+ Y+ RFNGKV ++R  +REGLI  RS GAK S GEV++FLDAHCEV  N
Sbjct: 198 SDKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTN 257

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPER 242
           WLPPLLAPI  +RK+MTVPVIDGID  +WE+RSVY  P+ H+ GIFEWG+LYKE ++ ER
Sbjct: 258 WLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITER 317

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
           E+  RK+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I
Sbjct: 318 ESAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGI 377

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
            +VPCS +GHVYRS MPY FGK + +   P+I+ N  RV++TW D+  K Y+ TREP A 
Sbjct: 378 VFVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQAA 433

Query: 363 FLDMGDISEQ 372
            ++ GDIS Q
Sbjct: 434 HVNPGDISAQ 443


>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 645

 Score =  437 bits (1124), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 214/372 (57%), Positives = 278/372 (74%), Gaps = 8/372 (2%)

Query: 3   VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           +FK D ++GN EP     + G  GE G+   + +      +A + EYG N   S+ IS +
Sbjct: 122 IFKLD-EIGNFEPKETQLQPGDYGEMGEPVLIDKTLTEVKEA-MREYGFNTYVSDMISLN 179

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R+IPD+RM+ECKYW YP DLP AS+++ FHNEG++ L+RTVHS++ R+P   ++EII+VD
Sbjct: 180 RSIPDVRMDECKYWHYPEDLPTASIVIAFHNEGWTPLLRTVHSVLLRSPPHLIKEIIMVD 239

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           DFS K  L  +L+ Y+++F+GKV+L+RN+EREGLIRTRS GAKE+ G+V++FLDAHCEV 
Sbjct: 240 DFSDKEHLKDRLDVYLKQFDGKVKLVRNSEREGLIRTRSIGAKEAVGDVVIFLDAHCEVN 299

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELP 240
           +NWLPPLLAPI  +RK+MTVPVIDGID   W +R VY   D HYRGIFEWG+LYKE EL 
Sbjct: 300 VNWLPPLLAPIRQNRKVMTVPVIDGIDKNDWSYRIVYGSADKHYRGIFEWGLLYKETELS 359

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            +E  +RK+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW CGG
Sbjct: 360 SQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGG 419

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
            I +VPCS +GHVYRS MPY FGKL+ +   P+I+ N  RVI+TW DE  K Y+Y REP 
Sbjct: 420 GILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYDK-YYYIREPS 475

Query: 361 AMFLDMGDISEQ 372
           A     G+IS Q
Sbjct: 476 ARHRLPGNISSQ 487


>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
          Length = 601

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 211/369 (57%), Positives = 274/369 (74%), Gaps = 7/369 (1%)

Query: 7   DGKLGNLEPPL-EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           DG+LGN EP   E     PGE G+   +  E   AAG A+  E+G N   S+ IS +RTI
Sbjct: 79  DGELGNYEPKTAEIPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           PD+R +ECK+WDYP +LP  SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
            K  L +KL+ Y+ RFNGKV ++R  +REGLI  RS GAK S GEV++FLDAHCEV  NW
Sbjct: 199 DKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 243
           LPPLLAPI  +RK+MTVPVIDGID  +WE+RSVY  P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318

Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
           +  RK+ S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I 
Sbjct: 319 SGHRKHTSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIV 378

Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
           +VPCS +GHVYRS MPY FGK + +   P+I+ N  RV++TW D+  K Y+ TREP A  
Sbjct: 379 FVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQAAH 434

Query: 364 LDMGDISEQ 372
           ++ GDIS Q
Sbjct: 435 VNPGDISAQ 443


>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
 gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
          Length = 601

 Score =  436 bits (1120), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 210/369 (56%), Positives = 275/369 (74%), Gaps = 7/369 (1%)

Query: 7   DGKLGNLEPPL-EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           +G+LGN EP + E     PGE G+   +  E   AAG A+  E+G N   S+ IS +RTI
Sbjct: 79  EGELGNYEPKIPEVPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           PD+R +ECK+WDYP +LP  SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
            K  L +KL+ Y+ RFNGKV ++R  +REGLI  RS GAK S GEV++FLDAHCEV  NW
Sbjct: 199 DKQHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 243
           LPPLLAPI  +RK+MTVPVIDGID  +WE+RSVY  P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318

Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
              RK++S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I 
Sbjct: 319 TAHRKHSSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIV 378

Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
           +VPCS +GHVYRS MPY FGK + +   P+I+ N  RV++TW D+  K Y+ TREP A  
Sbjct: 379 FVPCSHVGHVYRSHMPYGFGKFSGK---PVISINMMRVVKTWMDDYEK-YYLTREPQAAH 434

Query: 364 LDMGDISEQ 372
           ++ GDIS Q
Sbjct: 435 VNPGDISAQ 443


>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
 gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
 gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
 gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
          Length = 601

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 212/368 (57%), Positives = 274/368 (74%), Gaps = 9/368 (2%)

Query: 9   KLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           +LGN EP  P  P  + PGE GK   +  E   AAG A+  E+G N   S+ IS +RTIP
Sbjct: 81  ELGNYEPKEPEIPSNQ-PGEHGKPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTIP 139

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D+R EECK+WDYP  LP  SV++VFHNEG++ L+RTVHS++ R+P + +E++++VDD S 
Sbjct: 140 DIRPEECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRSPPELIEQVVMVDDDSD 199

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
           K  L +KL+ Y+ RFNGKV ++R  +REGLI  RS GAK S GEV++FLDAHCEV  NWL
Sbjct: 200 KPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWL 259

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREA 244
           PPLLAPI  +RK+MTVPVIDGID  +WE+RSVY  P+ H+ GIFEWG+LYKE ++ ERE 
Sbjct: 260 PPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERET 319

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
             RK+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +
Sbjct: 320 AHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVF 379

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           VPCS +GHVYRS MPY+FGK + +   P+I+ N  RV++TW D+  K Y+ TREP A  +
Sbjct: 380 VPCSHVGHVYRSHMPYSFGKFSGK---PVISINMMRVVKTWMDDYSK-YYLTREPQATNV 435

Query: 365 DMGDISEQ 372
           + GDIS Q
Sbjct: 436 NPGDISAQ 443


>gi|195130803|ref|XP_002009840.1| GI15586 [Drosophila mojavensis]
 gi|193908290|gb|EDW07157.1| GI15586 [Drosophila mojavensis]
          Length = 595

 Score =  427 bits (1098), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 201/294 (68%), Positives = 238/294 (80%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           LGN EP     + GPGE G+ + L    +   DAS  EYGMN+  S+ IS  R++ D R+
Sbjct: 70  LGNFEPRDLKPRTGPGENGEGHILSPDKKNVADASEMEYGMNIACSDEISMHRSVRDTRL 129

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           EECK+WDYP DLP  SVI+VFHNEGFS LMRTVHS+I R+P   L EIILVDDFS K +L
Sbjct: 130 EECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILVDDFSDKENL 189

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KL++Y+ +F G V++IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPPLL
Sbjct: 190 RSKLDEYVLQFKGLVKIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPPLL 249

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           APIY DR +MTVP+IDGID++T+E+R VY  D+H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVPRREQRRRAH 309

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
           NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIE
Sbjct: 310 NSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIE 363


>gi|443700020|gb|ELT99205.1| hypothetical protein CAPTEDRAFT_172619 [Capitella teleta]
          Length = 336

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 184/304 (60%), Positives = 232/304 (76%), Gaps = 2/304 (0%)

Query: 38  RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
           + A D S+ E+G NM  S+ IS +RTIPD RMEECKYW YP  LP ASVILVFHNEG+S+
Sbjct: 4   KEAADRSIREFGFNMVASDKISMNRTIPDTRMEECKYWHYPKTLPSASVILVFHNEGWST 63

Query: 98  LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
           L+RTVHS+I  +P + L EI++VDDFS K  L  +LEDY+++F+GKV+L RN ER GLI 
Sbjct: 64  LVRTVHSVIDMSPPELLHEIVMVDDFSDKEHLKTRLEDYLKQFHGKVKLYRNKERLGLIG 123

Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
           TR+ GA+ + G+ IVFLDAHCE   NWLPPLLA I  DR I+ +PVIDGID+  + +  V
Sbjct: 124 TRTLGAQYATGDAIVFLDAHCECNRNWLPPLLARIAYDRTILAIPVIDGIDFDNFRYNPV 183

Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
           Y     +RGIFEWG LYKE+++P +   +R++ SE YKSPTHAGGLFA+DR +F ELG Y
Sbjct: 184 YSGRELFRGIFEWGFLYKESKVPGKTLLERQHQSEAYKSPTHAGGLFAIDRKYFFELGAY 243

Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYN 337
           DPGL +WGGENFELSFKIW CGGS+EWVPCS +GHVYR+ MPY FGK+  ++  P++  N
Sbjct: 244 DPGLQIWGGENFELSFKIWQCGGSVEWVPCSHVGHVYRNSMPYGFGKINPKI--PVVLLN 301

Query: 338 YKRV 341
           Y R+
Sbjct: 302 YMRL 305


>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Strongylocentrotus purpuratus]
          Length = 606

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 10/374 (2%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           VF+ DG  G+ EP   P +EGPGEGG A     + +A  D  + EYG N   S+ IS DR
Sbjct: 81  VFR-DGVRGDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDR 139

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I DLR ++CK+W YP  LP  SVI+VFHNEG+S+L+RTVHS+  R+P+Q L EIILVDD
Sbjct: 140 NIADLRSQQCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDD 199

Query: 123 FSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           FS+K  L ++LEDY+Q  RFNGK++L+RN+ REGLIRTR  GA+ S G+V+++LDAHCEV
Sbjct: 200 FSTKEHLKERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEV 259

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
           G+NWLPPLL PI  +R     P+ID ID   +        D   RG F+W + +K   +P
Sbjct: 260 GVNWLPPLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVP 318

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           + E  +R++ SEPY+SP  AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGG
Sbjct: 319 QFEKSRRQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGG 378

Query: 301 SIEWVPCSRIGHVYRSF--MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           S+ WVPCSR+GHVYR    +PY+    +  +   L   N +RV+E WFD+ +K YFY  +
Sbjct: 379 SLLWVPCSRVGHVYRILGKVPYSAPNGSMLI---LSERNLRRVVEVWFDD-YKEYFYRSK 434

Query: 359 PLAMFLDMGDISEQ 372
           P ++ +  G+I +Q
Sbjct: 435 PESLLVSTGNIEKQ 448


>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7-like [Saccoglossus
           kowalevskii]
          Length = 594

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 198/378 (52%), Positives = 254/378 (67%), Gaps = 16/378 (4%)

Query: 3   VFKADGKLGNLE--PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           VFK+   LGN E  PP +  + GPGE  KA       +   D S+ EYG N   S+ IS 
Sbjct: 67  VFKSR-VLGNYENLPPSQEGRTGPGEYAKAVKTTPDEQKQVDRSINEYGFNQYVSDKISL 125

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
           DRTI DLR E+CKYW YP  LP   VI+VFHNEG+S+L+RTVHS+  RTP   L E++LV
Sbjct: 126 DRTIKDLREEQCKYWHYPESLPAVGVIIVFHNEGWSTLLRTVHSLFNRTPPTLLHEVVLV 185

Query: 121 DDFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           DDFS+K  L ++LE+Y++  RF GK++L+RN +REGLIRTR+ GA  S  +V+V+LDAHC
Sbjct: 186 DDFSNKEHLRERLEEYVKEPRFLGKIKLVRNAKREGLIRTRTVGAIHSTADVLVWLDAHC 245

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           EVG+NWLPPLL+PI  +R  +TVP+ID ID   +  RS    +   RG F+W + +K   
Sbjct: 246 EVGINWLPPLLSPIAQNRTTVTVPIIDVIDNMDYTMRSQGSGELS-RGGFDWSLYWKHLP 304

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           + + E +KR  +SEPY+SP  AGGLFAM R +F ELG YDPGL VWGGENFELSFKIW C
Sbjct: 305 MSKEETRKRSLSSEPYRSPAMAGGLFAMARDYFFELGAYDPGLEVWGGENFELSFKIWQC 364

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY----NYKRVIETWFDEKHKAYF 354
           GGS+ WVPCS +GHVYR       GK+  R     +T     NY+RV+E W D+ +K +F
Sbjct: 365 GGSMLWVPCSHVGHVYRI-----LGKVPYRAPNATMTQWSLRNYRRVVEVWMDD-YKEFF 418

Query: 355 YTREPLAMFLDMGDISEQ 372
           Y  +P +  L  GDIS+Q
Sbjct: 419 YRSKPESQLLHFGDISKQ 436


>gi|66472462|ref|NP_001018477.1| N-acetylgalactosaminyltransferase 7 [Danio rerio]
 gi|63100869|gb|AAH95642.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7 [Danio rerio]
          Length = 652

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 195/369 (52%), Positives = 258/369 (69%), Gaps = 8/369 (2%)

Query: 8   GKLGNLEPPL-EPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           G LGN EP   EP+  + GPGEG K + L   Y+ A  AS+ E+G NM  S+ IS DRT+
Sbjct: 125 GTLGNFEPKEPEPHGVQGGPGEGSKPFVLGPEYKDAVQASIKEFGFNMVASDMISLDRTV 184

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECKYW+Y  +L  +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+++DDFS
Sbjct: 185 GDLRHEECKYWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVMIDDFS 244

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCEVGLN 183
           +KA L ++LE+YI+++NG V++ RN +REGLI+ RS GA K + G+V+++LDAHCEVG+N
Sbjct: 245 NKAHLKERLEEYIKQWNGLVKVFRNEKREGLIQARSIGARKATLGKVLIYLDAHCEVGVN 304

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           W  PL+API  DR + TVP+ID ID    T E +   + D   RG ++W +L+K   L  
Sbjct: 305 WYAPLVAPISKDRTVCTVPLIDYIDGNDYTIEPQQGGDEDGLARGAWDWSLLWKRVPLSS 364

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG 
Sbjct: 365 REKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGQ 424

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        V       NY RV+E W+D+ +K YFY   P  
Sbjct: 425 LLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDD-YKDYFYASRPET 482

Query: 362 MFLDMGDIS 370
           + L  GDIS
Sbjct: 483 LTLAYGDIS 491


>gi|410927898|ref|XP_003977377.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
           7-like [Takifugu rubripes]
          Length = 664

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/376 (52%), Positives = 259/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV K  G LGNLEP  P  P    G GEG K + L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 132 PVLKK-GILGNLEPKEPEPPGVPGGLGEGAKPFVLNAEYKDAIQASIKEFGFNMVASDMI 190

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR+I D+R +ECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 191 SLDRSISDIRHDECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 250

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 177
           ++DDFS+K  L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAH
Sbjct: 251 MIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLIYLDAH 310

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR + TVP+ID ID Q +        D +   RG ++W ML+K
Sbjct: 311 CEVGINWYAPLVAPISKDRTVCTVPLIDSIDGQKYTVDPQGGGDQNGFARGAWDWSMLWK 370

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 371 RVPLGDREKQLRKTETEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 430

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSRIGH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 431 WQCGGQLLFVPCSRIGHIYR-LHGWQGNPPPAHVGSSPTLKNYVRVVEVWWDE-YKDYFY 488

Query: 356 TREPLAMFLDMGDISE 371
              P  + L  GDISE
Sbjct: 489 ASRPETLTLAYGDISE 504


>gi|432847870|ref|XP_004066191.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oryzias
           latipes]
          Length = 653

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 195/377 (51%), Positives = 259/377 (68%), Gaps = 11/377 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P+ +  G LGN EP  EP  +G    PGEG K   L   Y+ +  AS+ E+G NM  S+ 
Sbjct: 122 PILRK-GTLGNFEPK-EPEPQGILNGPGEGAKPLILGSEYKDSVQASIKEFGFNMVASDM 179

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DRTI DLR +ECKYW Y   L  +SV++VFHNEG+S+LMRTVHS+IKRTP QYL EI
Sbjct: 180 ISMDRTISDLRNDECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRQYLAEI 239

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDA 176
           +++DDFS+K  L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDA
Sbjct: 240 VMIDDFSNKVHLKERLEEYIKQWNGLVKLFRNDKREGLIQARSIGAKKATKGQVLIYLDA 299

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLY 234
           HCEVG+NW  PL+API  DR + TVP+ID I  +  T E +   + D   RG ++W ML+
Sbjct: 300 HCEVGINWYAPLIAPISKDRTVCTVPLIDSIHGERFTIEPQGGGDEDGFARGAWDWSMLW 359

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K   L +RE K RK  +EPY+SP  AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 360 KRVPLGDREKKLRKTQTEPYRSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYK 419

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IW CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K +F
Sbjct: 420 IWQCGGQLLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDE-YKDFF 477

Query: 355 YTREPLAMFLDMGDISE 371
           Y   P  + L  GDISE
Sbjct: 478 YASRPETLTLAYGDISE 494


>gi|326918604|ref|XP_003205578.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Meleagris
           gallopavo]
          Length = 665

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 196/376 (52%), Positives = 260/376 (69%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 133 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 191

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 192 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 251

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++L+DYI+++NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 252 LIDDFSNKAHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 311

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR   TVP+ID ID  T++   +   + D   RG ++W ML+K
Sbjct: 312 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGNTFKIVPQGGGDEDGFARGAWDWSMLWK 371

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE +KR+  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 372 RVPLSKREKEKRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 431

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 432 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 489

Query: 356 TREPLAMFLDMGDISE 371
              P    L  GDISE
Sbjct: 490 ASRPETKALPYGDISE 505


>gi|198419403|ref|XP_002128971.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
           intestinalis]
          Length = 631

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 198/387 (51%), Positives = 259/387 (66%), Gaps = 19/387 (4%)

Query: 1   RPVFKAD---------GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 51
           +PV K D          KLGN E  L   + GPGE G A H     + A  AS+ E+G N
Sbjct: 91  KPVVKEDFSNYPQLNWRKLGNYEESLA-RRNGPGEYGVAVHATNDEKEAVAASIKEFGFN 149

Query: 52  METSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPA 111
           M  S+ IS DR   DLR +EC++WDYP DLP  SVI+VFHNEG+S+L+RTVHS+I  TP 
Sbjct: 150 MVNSDKISLDRLPKDLRHDECRHWDYPSDLPDVSVIIVFHNEGWSTLVRTVHSVINLTPK 209

Query: 112 QYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR---G 168
           + L EI+++DD S+K  L QKL +YIQRFNG V+L RN  REGLIR RS GA++S    G
Sbjct: 210 KLLYEIVMIDDHSNKEHLGQKLTEYIQRFNGLVKLYRNERREGLIRARSIGAQKSTPADG 269

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRG 226
            V+V+LDAHCEVG NWLPPL+ PI ++RK+ TVP+ID I+ Q + F S    D +   RG
Sbjct: 270 RVLVYLDAHCEVGYNWLPPLIMPIVNNRKVTTVPLIDVINGQDYTFTSQAGGDANGFARG 329

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            ++W ML+K   L + E  +RK+ ++PY+SP  AGGLFA++R +F ++G YDPGL +WGG
Sbjct: 330 AWDWSMLWKRVPLTKEEHNRRKHTTDPYRSPAMAGGLFAIERQYFFDIGLYDPGLEIWGG 389

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP-YNFGKLADRVKGPLITYNYKRVIETW 345
           ENFE+SFKIWMC G + +VPCSR+GHVYR  +P ++     + V       NY RV+ETW
Sbjct: 390 ENFEMSFKIWMCEGEVLFVPCSRVGHVYR--LPGWSGNPPPEYVPSNPSLRNYIRVVETW 447

Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +DE +K YFY   P  + +  GDIS Q
Sbjct: 448 WDE-YKDYFYASRPETLNMPYGDISAQ 473


>gi|345307492|ref|XP_001507110.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Ornithorhynchus anatinus]
          Length = 873

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 194/377 (51%), Positives = 257/377 (68%), Gaps = 11/377 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           PV +  GKLGN EP  EP  +G    PGE  K Y L   Y+ +  AS+ E+G NM  S+ 
Sbjct: 98  PVLQP-GKLGNFEPK-EPEPQGVMGGPGEEAKPYVLGPEYKDSIQASIKEFGFNMVASDM 155

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DR+I DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 156 ISLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEI 215

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
           +L+DDFS+KA L  +L+DYI+++NG V++ RN  REGLI+ RS GA++++ G+V+++LDA
Sbjct: 216 VLIDDFSNKAHLKDRLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 275

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 234
           HCEV +NW  PL+API  DR + TVP+ID I   T+    +   + D + RG ++W ML+
Sbjct: 276 HCEVAVNWYAPLVAPISKDRTVCTVPLIDVISGNTFNIVPQGGGDEDGYARGAWDWSMLW 335

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K   L +RE   RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+K
Sbjct: 336 KRVPLTQREKTLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYK 395

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IW CGG + +VPCSR+GH+YR    +        V       NY RV+E W+D+ +K YF
Sbjct: 396 IWQCGGKLLFVPCSRVGHIYR-LHGWQGNPPPVYVGSSPTLKNYVRVVEVWWDD-YKDYF 453

Query: 355 YTREPLAMFLDMGDISE 371
           Y   P    L  GDISE
Sbjct: 454 YASRPETKALPYGDISE 470


>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
           laevis]
 gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
          Length = 653

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 190/376 (50%), Positives = 261/376 (69%), Gaps = 11/376 (2%)

Query: 2   PVFKADGKLGNLEPPLEP----YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           PV +  G LGN+EP  EP      +GPGEGGK + L   Y+ A  A++ E+G NM  S+ 
Sbjct: 121 PVLRP-GILGNMEPK-EPEPQGVVDGPGEGGKHFMLGPDYKDAIKATIKEFGFNMVASDM 178

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DRTI DLR EECK+W+Y  +L  +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 179 ISLDRTINDLRHEECKFWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 238

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
           +++DDFS+K  L ++L++YI+++NG V++ RN  REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 234
           HCEVG+NW  PL+API  DR   TVP+ID I+  T+E   ++  + D   RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTTCTVPLIDVIEGNTYELIPQAGGDEDGFARGAWDWSMLW 358

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K   L  +E ++RK  +EPY+SP  AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 359 KRVPLTSKEKEQRKTKTEPYRSPAMAGGLFAIEREYFFELGLYDPGLQIWGGENFEISYK 418

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IW CGG + + PCSR+GH+YR    +        V       NY RV+E W+DE ++ YF
Sbjct: 419 IWQCGGKLLFTPCSRVGHIYR-LHGWQGNPTPAHVGSSPTLKNYVRVVEVWWDE-YRDYF 476

Query: 355 YTREPLAMFLDMGDIS 370
           Y   P    L  GDIS
Sbjct: 477 YASRPETKALAYGDIS 492


>gi|344235654|gb|EGV91757.1| N-acetylgalactosaminyltransferase 7 [Cricetulus griseus]
          Length = 607

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 196/376 (52%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 75  PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 133

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 134 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 193

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFSSK  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 194 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 253

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 254 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 313

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 314 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 373

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 374 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 431

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 432 ASRPESKALPYGDISE 447


>gi|449500526|ref|XP_002187477.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Taeniopygia
           guttata]
          Length = 828

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 296 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 354

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 355 SLDRSVNDLRQEECKYWHYDDNLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 414

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++L+DYI+++NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 415 LIDDFSNKAHLQERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 474

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 475 CEVGINWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 534

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 535 RIPLSHKEKSKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 594

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 595 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 652

Query: 356 TREPLAMFLDMGDISE 371
              P    L  GDISE
Sbjct: 653 ASRPETKALPYGDISE 668


>gi|344288243|ref|XP_003415860.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Loxodonta
           africana]
          Length = 657

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKPAVQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L QKL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKQKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR + TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRAVCTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L ERE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTEREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
           [Strongylocentrotus purpuratus]
 gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 670

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 188/377 (49%), Positives = 253/377 (67%), Gaps = 15/377 (3%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           PV K     GN EPP +PY+ GPGE G    L    +   D +  EYG NM  S+ IS D
Sbjct: 135 PVLKE--TTGNYEPPRQPYRTGPGEYGLGVLLDHNEKHLYDKAFEEYGFNMVVSDRISLD 192

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R + DLR +ECK+W YP +LP  SV++VFH EG+S+L+RT+HS+   +P + L E++LVD
Sbjct: 193 RIVADLRDKECKHWHYPTNLPNTSVVIVFHQEGWSTLIRTIHSVFNTSPKELLAEVLLVD 252

Query: 122 DFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           D+S K  L +KL+DYI+  RF+GK+R++RN +REGLIR+R+ GA+++ G+V+ FLDAHCE
Sbjct: 253 DYSDKVHLKKKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKAIGQVLTFLDAHCE 312

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
            G NWLPPLLA I  DR  +  P +D I   T+ + S  + D   RG F+W   YK   +
Sbjct: 313 CGPNWLPPLLAEIAVDRSTIVCPTVDAISSDTFAYTS--QGDGLCRGAFDWDFWYK--RI 368

Query: 240 PEREAKKR---KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           P +    R   K  S+PY SP  AGGL A+DR++F ELGGYDPGL +WGGENFE+SFK+W
Sbjct: 369 PVKPYWHRLGLKQRSQPYPSPVMAGGLLALDRSYFFELGGYDPGLQIWGGENFEISFKVW 428

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETWFDEKHKAYFY 355
           MCGGS+++VPCSR+GHVYR  +PY++   +  V+G  ++  NY RV E W DE +K  FY
Sbjct: 429 MCGGSLKFVPCSRVGHVYRKQVPYSYP--SSGVEGVSVVDLNYMRVAEVWLDE-YKDSFY 485

Query: 356 TREPLAMFLDMGDISEQ 372
             +PL      G+ISEQ
Sbjct: 486 ATKPLLEGKPCGNISEQ 502


>gi|363733313|ref|XP_420521.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gallus gallus]
          Length = 636

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 194/376 (51%), Positives = 259/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 104 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 162

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECK+W Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKHWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L ++L+DYI+++NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 223 LIDDFSNKVHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 282

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR   TVP+ID ID  T++   +   + D   RG ++W ML+K
Sbjct: 283 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGDTFKIVPQGGGDEDGFARGAWDWSMLWK 342

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE +KR+  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 343 RVPLSKREKEKRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 402

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 403 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 460

Query: 356 TREPLAMFLDMGDISE 371
              P    L  GDISE
Sbjct: 461 ASRPETKALPYGDISE 476


>gi|449270894|gb|EMC81540.1| N-acetylgalactosaminyltransferase 7, partial [Columba livia]
          Length = 613

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 194/376 (51%), Positives = 259/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 81  PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESIQASIKEFGFNMVASDMI 139

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 140 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 199

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++L++YI+++NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 200 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 259

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 260 CEVGINWYAPLIAPIAKDRTTCTVPLIDYIDGSDYSIEPQQGGDEDGFARGAWDWSLLWK 319

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L ++E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 320 RIPLSQKEKSKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 379

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 380 WQCGGKLLFVPCSRVGHIYR-LQGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 437

Query: 356 TREPLAMFLDMGDISE 371
              P    L  GDISE
Sbjct: 438 ASRPETKALPYGDISE 453


>gi|12621080|ref|NP_075215.1| N-acetylgalactosaminyltransferase 7 [Rattus norvegicus]
 gi|51315737|sp|Q9R0C5.1|GALT7_RAT RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|4092503|gb|AAC99426.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T6 [Rattus
           norvegicus]
 gi|149032267|gb|EDL87173.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_a
           [Rattus norvegicus]
          Length = 657

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 194/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEYKQAAQASIKEFGFNMAASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL +YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|327268630|ref|XP_003219099.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Anolis
           carolinensis]
          Length = 654

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 191/376 (50%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 122 PVLRP-GILGNFEPKEPEPHGVVNGPGEEAKPYVLGAEYKESVQASIKEFGFNMVASDMI 180

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR+I D+R EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 181 SLDRSINDIRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 240

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++LE+YI+++NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 241 LIDDFSNKAHLKERLEEYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 300

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID  T+    +   + D + RG ++W ML+K
Sbjct: 301 CEVAVNWYAPLIAPISKDRTTCTVPLIDVIDGNTYNIVPQGGGDDDGYARGAWDWSMLWK 360

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 361 RVPLTKREKEMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 420

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + + PCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 421 WQCGGKLLFTPCSRVGHIYR-LQGWQGNPPPAYVGSSPTLKNYVRVVEVWWDE-YKDYFY 478

Query: 356 TREPLAMFLDMGDISE 371
              P    L  GDI++
Sbjct: 479 ASRPETKALAYGDITD 494


>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
           [Saccoglossus kowalevskii]
          Length = 634

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/374 (50%), Positives = 248/374 (66%), Gaps = 10/374 (2%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           VFK  G +GN EPP    + G GEG     L  A       +  E+G NM  S+ IS DR
Sbjct: 115 VFKP-GIVGNFEPPKSERRTGLGEGAIPVQLNPADENKYVKAKREFGFNMVISDQISLDR 173

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           T+ D+R  ECKYW YP DLP ASV+LVF NEG+S+LMRTVHS+   +P+  L EI++VDD
Sbjct: 174 TVKDIRDPECKYWHYPTDLPTASVVLVFINEGWSTLMRTVHSVFNTSPSHLLAEIVMVDD 233

Query: 123 FSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCE 179
           FS K  L  KLE+YI+  RF GK++L+RN +REGLIR R+ GA    RGEV+VFLDAHCE
Sbjct: 234 FSDKDHLKSKLEEYIKQDRFEGKIKLVRNAKREGLIRARTIGAINAERGEVVVFLDAHCE 293

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
              NWLPPLL+ I  +RK +  P++D +D   + +    + D   RG+F W   YK   +
Sbjct: 294 CSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNFGYAP--QADGMARGVFNWDFFYKRIPI 351

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           P +EA +R+ NSEPY+SP  AGGLFA+ R+FF ++GGYD GL +WGGE +E+SFKIWMCG
Sbjct: 352 PPKEANRRERNSEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISFKIWMCG 411

Query: 300 GSIEWVPCSRIGHVY-RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           G +E+VPCSR+GH+Y R  +PY++ +  D +   ++  NY RV E W DE +K YFY  +
Sbjct: 412 GILEFVPCSRVGHIYRRGGIPYSYPQSDDGIS--IVNKNYLRVAEVWMDE-YKEYFYRMK 468

Query: 359 PLAMFLDMGDISEQ 372
           P       GDI+EQ
Sbjct: 469 PELRGKPYGDITEQ 482


>gi|345790686|ref|XP_543898.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Canis lupus
           familiaris]
          Length = 721

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 194/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 189 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 247

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 248 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 307

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 308 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 367

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 368 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 427

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 428 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 487

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 488 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 545

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 546 ASRPESKALPYGDISE 561


>gi|338722468|ref|XP_001915592.2| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Equus
           caballus]
          Length = 621

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 194/376 (51%), Positives = 258/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 89  PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 147

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 148 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 207

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 208 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 267

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+ +T+E   +   + D + RG ++W ML+K
Sbjct: 268 CEVAVNWYAPLIAPISKDRTICTVPIIDVINGKTYEIIPQGGGDEDGYARGAWDWSMLWK 327

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 328 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 387

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 388 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 445

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 446 ASRPESKALPYGDISE 461


>gi|417411949|gb|JAA52393.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 615

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V+      NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVRSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESKALAYGDISE 455


>gi|119896052|ref|XP_602855.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
          Length = 772

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 240 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 478

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 479 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 538

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 539 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 596

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 597 ASRPESKALAYGDISE 612


>gi|440908503|gb|ELR58512.1| N-acetylgalactosaminyltransferase 7, partial [Bos grunniens mutus]
          Length = 615

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESKALAYGDISE 455


>gi|354484375|ref|XP_003504364.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cricetulus
           griseus]
          Length = 784

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 252 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 310

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 311 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 370

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFSSK  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 371 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 430

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W ML+K
Sbjct: 431 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 490

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 491 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 550

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 551 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 608

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 609 ASRPESKALPYGDISE 624


>gi|335301041|ref|XP_001926518.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Sus scrofa]
          Length = 712

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L    + A  AS+ E+G NM  S+ I
Sbjct: 180 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKHAVQASIKEFGFNMVASDMI 238

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 239 SLDRSVNDLRQEECKYWHYDENLLTASIVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 298

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 299 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 358

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 359 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 418

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 419 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 478

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 479 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPISVGSSPTLKNYVRVVEVWWDE-YKDYFY 536

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 537 ASRPESKALPYGDISE 552


>gi|296484976|tpg|DAA27091.1| TPA: N-acetylgalactosaminyltransferase 7-like [Bos taurus]
          Length = 781

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 249 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 307

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 308 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 367

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 368 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 427

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 428 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 487

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 488 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 547

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 548 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 605

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 606 ASRPESKALAYGDISE 621


>gi|359067894|ref|XP_002689501.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
          Length = 617

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 85  PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 143

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 144 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 203

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 204 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 263

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 264 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 323

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 324 RVPLTLREKRLRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 383

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 384 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 441

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 442 ASRPESKALAYGDISE 457


>gi|269784707|ref|NP_653332.3| N-acetylgalactosaminyltransferase 7 isoform 1 [Mus musculus]
 gi|51315950|sp|Q80VA0.2|GALT7_MOUSE RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|13650041|gb|AAK37549.1|AF349573_1 UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 [Mus musculus]
 gi|30851602|gb|AAH52461.1| Galnt7 protein [Mus musculus]
          Length = 657

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I   T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
           garnettii]
          Length = 657

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 194/370 (52%), Positives = 252/370 (68%), Gaps = 8/370 (2%)

Query: 8   GKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           G LGN EP   EP+    GPGE  K   L    + A  AS+ E+G NM  S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECKYW Y  +L  ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 183
           +K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
           W  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K   L  
Sbjct: 310 WYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTL 369

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE   RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG 
Sbjct: 370 REKSLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGK 429

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY   P +
Sbjct: 430 LLFVPCSRVGHIYR-LEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDE-YKDYFYASRPES 487

Query: 362 MFLDMGDISE 371
             L  GDISE
Sbjct: 488 KALPYGDISE 497


>gi|74139820|dbj|BAE31754.1| unnamed protein product [Mus musculus]
 gi|74191634|dbj|BAE30388.1| unnamed protein product [Mus musculus]
 gi|74198878|dbj|BAE30662.1| unnamed protein product [Mus musculus]
          Length = 546

 Score =  369 bits (946), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 14  PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 72

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 73  SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 132

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 133 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 192

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I   T+E   +   + D + RG ++W ML+K
Sbjct: 193 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 252

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 253 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 312

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 313 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 370

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 371 ASRPESKALPYGDISE 386


>gi|301753757|ref|XP_002912714.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Ailuropoda
           melanoleuca]
 gi|281338294|gb|EFB13878.1| hypothetical protein PANDA_000463 [Ailuropoda melanoleuca]
          Length = 657

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L  KL+DY++ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKGKLDDYLKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPREKRMRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPFGDISE 497


>gi|387019377|gb|AFJ51806.1| n-acetylgalactosaminyltransferase 7-like [Crotalus adamanteus]
          Length = 658

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/371 (51%), Positives = 250/371 (67%), Gaps = 10/371 (2%)

Query: 8   GKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
           G LGN EP  EP   G    PGE  K + L   Y+ +  AS+ E+G NM  S+ IS DR+
Sbjct: 131 GILGNFEPK-EPESHGVVGGPGEEAKPFVLGPEYKESIQASIKEFGFNMVASDMISLDRS 189

Query: 64  IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
           I DLR EECKYW Y  +L  +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDF
Sbjct: 190 INDLRQEECKYWHYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDF 249

Query: 124 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGL 182
           S+K  L ++LEDYI+++NG V++ RN  REGLI+ RS GA++++ G+V+++LDAHCEV +
Sbjct: 250 SNKEHLKERLEDYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGKVLIYLDAHCEVAV 309

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELP 240
           NW  PL+API  DR   TVP+ID ID  T+    +   + D   RG ++W ML+K   L 
Sbjct: 310 NWYAPLIAPISKDRTACTVPLIDVIDGNTYNIVPQGGGDEDGFARGAWDWSMLWKRVPLT 369

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           +RE   RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG
Sbjct: 370 KREKAMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGG 429

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
            + +VPCSR+GH+YR    +        V       NY RV+E W+DE  K YFY   P 
Sbjct: 430 QLLFVPCSRVGHIYR-LQGWQGNPPPAYVGSSPTLKNYVRVVEVWWDE-FKDYFYASRPE 487

Query: 361 AMFLDMGDISE 371
              L  GDIS+
Sbjct: 488 TKALAYGDISD 498


>gi|148696676|gb|EDL28623.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_a [Mus
           musculus]
          Length = 615

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I   T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESKALPYGDISE 455


>gi|390370478|ref|XP_793045.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like, partial [Strongylocentrotus purpuratus]
          Length = 658

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 177/315 (56%), Positives = 225/315 (71%), Gaps = 5/315 (1%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G+ EP   P +EGPGEGG A     + +A  D  + EYG N   S+ IS DR I DLR +
Sbjct: 333 GDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDRNIADLRSQ 392

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           +CK+W YP  LP  SVI+VFHNEG+S+L+RTVHS+  R+P+Q L EIILVDDFS+K  L 
Sbjct: 393 QCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLK 452

Query: 131 QKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           ++LEDY+Q  RFNGK++L+RN+ REGLIRTR  GA+ S G+V+++LDAHCEVG+NWLPPL
Sbjct: 453 ERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPL 512

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L PI  +R     P+ID ID   +        D   RG F+W + +K   +P+ E  +R+
Sbjct: 513 LTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVPQFEKSRRQ 571

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + SEPY+SP  AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGGS+ WVPCS
Sbjct: 572 HASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCS 631

Query: 309 RIGHVYRSF--MPYN 321
           R+GHVYR    +PY+
Sbjct: 632 RVGHVYRILGKVPYS 646


>gi|26329091|dbj|BAC28284.1| unnamed protein product [Mus musculus]
          Length = 657

 Score =  367 bits (941), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+ PI  DR I TVP+ID I   T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVPPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  RE + RK  +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|332820787|ref|XP_003310650.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan troglodytes]
 gi|410227832|gb|JAA11135.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410262380|gb|JAA19156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410297750|gb|JAA27475.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410332293|gb|JAA35093.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
          Length = 657

 Score =  367 bits (941), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN +REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNEKREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTTQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|296195170|ref|XP_002745262.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Callithrix jacchus]
          Length = 657

 Score =  366 bits (940), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 193/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|355778494|gb|EHH63530.1| hypothetical protein EGM_16517, partial [Macaca fascicularis]
          Length = 615

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESQALPYGDISE 455


>gi|410956565|ref|XP_003984911.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Felis catus]
          Length = 772

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 240 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 478

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 479 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 538

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 539 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 596

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 597 ASRPESKALPYGDISE 612


>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
           garnettii]
          Length = 657

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/370 (51%), Positives = 251/370 (67%), Gaps = 8/370 (2%)

Query: 8   GKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           G LGN EP   EP+    GPGE  K   L    + A  AS+ E+G NM  S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECKYW Y  +L  ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 183
           +K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           W  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K   L  
Sbjct: 310 WYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSH 369

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG 
Sbjct: 370 KEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGK 429

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY   P +
Sbjct: 430 LLFVPCSRVGHIYR-LEGWQGNPPPVSVGSSPTLKNYVRVVEVWWDE-YKDYFYASRPES 487

Query: 362 MFLDMGDISE 371
             L  GDISE
Sbjct: 488 KALPYGDISE 497


>gi|397505872|ref|XP_003823466.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan paniscus]
          Length = 657

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|426222421|ref|XP_004005390.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Ovis aries]
          Length = 865

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 254/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  +   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 333 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAQPLVLGPEFKHAVQASIKEFGFNMVASDMI 391

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 392 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 451

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 452 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 511

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 512 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 571

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 572 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 631

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 632 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPVYVGSSPTLKNYVRVVEVWWDE-YKDYFY 689

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 690 ASRPESKALAYGDISE 705


>gi|197101721|ref|NP_001124628.1| N-acetylgalactosaminyltransferase 7 [Pongo abelii]
 gi|75042656|sp|Q5RFJ6.1|GALT7_PONAB RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|55725190|emb|CAH89461.1| hypothetical protein [Pongo abelii]
          Length = 657

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|157502212|ref|NP_059119.2| N-acetylgalactosaminyltransferase 7 [Homo sapiens]
 gi|51315961|sp|Q86SF2.1|GALT7_HUMAN RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|28279289|gb|AAH46129.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|28704077|gb|AAH47468.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|119625166|gb|EAX04761.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|193786832|dbj|BAG52155.1| unnamed protein product [Homo sapiens]
 gi|325464563|gb|ADZ16052.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7)
           [synthetic construct]
          Length = 657

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|383412007|gb|AFH29217.1| N-acetylgalactosaminyltransferase 7 [Macaca mulatta]
          Length = 657

 Score =  365 bits (938), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|402870854|ref|XP_003899414.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Papio anubis]
          Length = 657

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|269784709|ref|NP_001161453.1| N-acetylgalactosaminyltransferase 7 isoform 2 [Mus musculus]
 gi|26331462|dbj|BAC29461.1| unnamed protein product [Mus musculus]
          Length = 657

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|148696677|gb|EDL28624.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_b [Mus
           musculus]
          Length = 615

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 255/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   Y+ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 382 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESKALPYGDISE 455


>gi|126331345|ref|XP_001372222.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Monodelphis
           domestica]
          Length = 585

 Score =  364 bits (934), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 189/376 (50%), Positives = 257/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G +GN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR+I DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++L++YI+++NG V++ RN  REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 244 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR + TVP+ID ID   ++   +   + D + RG ++W +L+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTVCTVPIIDIIDGNNFKIMPQGGGDEDGYARGAWDWSLLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE   RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 364 RVPLTQREKTMRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        +       NY RV+E W+D  +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYLGSSPTLKNYIRVVEVWWD-GYKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESKALPYGDISE 497


>gi|426346015|ref|XP_004040686.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gorilla gorilla
           gorilla]
          Length = 650

 Score =  364 bits (934), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 191/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 118 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 176

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 177 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 236

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 237 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 296

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 297 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 356

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 357 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 416

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 417 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 474

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 475 ASRPESQALPYGDISE 490


>gi|403295730|ref|XP_003938783.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Saimiri boliviensis
           boliviensis]
          Length = 659

 Score =  363 bits (932), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 192/376 (51%), Positives = 254/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 127 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 185

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 186 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 245

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 246 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 305

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 306 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 365

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 366 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 425

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 426 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 483

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 484 ASRPESQALPYGDISE 499


>gi|348538240|ref|XP_003456600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oreochromis
           niloticus]
          Length = 649

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 190/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEG---GKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV K  G LGN EP        PG      K + L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 117 PVLKK-GILGNFEPKEPEPPGVPGGPGEGAKPFVLGPEYKDSVQASIKEFGFNMVASDMI 175

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DRTI D+R EECKYW Y   L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 176 SLDRTINDIRHEECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 235

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 177
           L+DDFS+K  L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+V+LDAH
Sbjct: 236 LIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLVYLDAH 295

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEVG+NW  PL+API  DR + TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 296 CEVGINWYAPLIAPISKDRTVCTVPLIDYIDGNEYSMEPQQGGDEDGLARGAWDWSLLWK 355

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L +RE  KR + ++PY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 356 RVPLSQREKAKRTHTTQPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 415

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+D+ +K YFY
Sbjct: 416 WQCGGQLLFVPCSRVGHIYR-LQGWQGNPPPAHVGSSPTLKNYVRVVEVWWDD-YKDYFY 473

Query: 356 TREPLAMFLDMGDISE 371
              P  + L  GDIS+
Sbjct: 474 ASRPETLTLAYGDISD 489


>gi|395542397|ref|XP_003773119.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
           [Sarcophilus harrisii]
          Length = 797

 Score =  363 bits (931), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 190/376 (50%), Positives = 256/376 (68%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEPPL-EPYKE--GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G +GN EP   EP+    GPGE  K Y L   Y+ +  AS+ E+G NM  S+ I
Sbjct: 265 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 323

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR+I DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 324 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 383

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+KA L ++L++YI+++NG V++ RN  REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 384 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 443

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 444 CEVAVNWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 503

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 504 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKI 563

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        +       NY RV+E W+D  +K YFY
Sbjct: 564 WQCGGKLLFVPCSRVGHIYR-LSGWQGNPPPIYLGSSPTLKNYIRVVEVWWD-GYKDYFY 621

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 622 ASRPESKALPYGDISE 637


>gi|348566877|ref|XP_003469228.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cavia
           porcellus]
          Length = 637

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 191/377 (50%), Positives = 255/377 (67%), Gaps = 11/377 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           PV +  G LGN EP  EP  +G    PGE  K   L   ++ A  AS+ E+G NM  S+ 
Sbjct: 105 PVLRP-GILGNFEPK-EPEPQGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDM 162

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 163 ISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 222

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
           +L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDA
Sbjct: 223 VLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 282

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLY 234
           HCEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+
Sbjct: 283 HCEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLW 342

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K   L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+K
Sbjct: 343 KRIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYK 402

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IW CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YF
Sbjct: 403 IWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDYF 460

Query: 355 YTREPLAMFLDMGDISE 371
           Y   P +  L  GDISE
Sbjct: 461 YASRPESKALLYGDISE 477


>gi|441620192|ref|XP_003258074.2| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
           [Nomascus leucogenys]
          Length = 636

 Score =  361 bits (927), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 190/376 (50%), Positives = 253/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G L N EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 104 PVLRP-GILSNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 162

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 223 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 282

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K
Sbjct: 283 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 342

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E  KRK+ +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 343 RIPLSHKEKAKRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 402

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 403 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 460

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 461 ASRPESQALPYGDISE 476


>gi|351701091|gb|EHB04010.1| N-acetylgalactosaminyltransferase 7, partial [Heterocephalus
           glaber]
          Length = 616

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 192/378 (50%), Positives = 257/378 (67%), Gaps = 12/378 (3%)

Query: 2   PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSN 56
           PV +  G LGN EP  EP  +G    PGE  K   L   ++ A  AS+  E+G NM  S+
Sbjct: 83  PVLRP-GILGNFEPK-EPEPQGVVGGPGEEAKPLILGPEFKHAVQASIIKEFGFNMVASD 140

Query: 57  HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
            IS DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL E
Sbjct: 141 MISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAE 200

Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLD 175
           I+L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LD
Sbjct: 201 IVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLD 260

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGML 233
           AHCEV +NW  PL+API  DR I TVP+ID I+  T++   +   + D + RG ++W ML
Sbjct: 261 AHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYQIVPQGGGDEDGYARGAWDWSML 320

Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
           +K   L  RE + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+
Sbjct: 321 WKRVPLTPREKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISY 380

Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           KIW CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K Y
Sbjct: 381 KIWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPLYVGSSPTLKNYVRVVEVWWDE-YKDY 438

Query: 354 FYTREPLAMFLDMGDISE 371
           FY   P +  L  GDISE
Sbjct: 439 FYASRPESKALLYGDISE 456


>gi|6318186|emb|CAB60270.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7 [Homo
           sapiens]
          Length = 657

 Score =  360 bits (925), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 190/376 (50%), Positives = 253/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR + DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRNVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGL A++R FF ELG YDP L +WGGENFE+S+KI
Sbjct: 364 RVPLTPQEKRLRKTKTEPYRSPAMAGGLCAIEREFFFELGLYDPSLQIWGGENFEISYKI 423

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 424 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 481

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 482 ASRPESQALPYGDISE 497


>gi|355687724|gb|EHH26308.1| hypothetical protein EGK_16238, partial [Macaca mulatta]
          Length = 615

 Score =  360 bits (924), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 191/376 (50%), Positives = 254/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 322 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W  GG   +VPCSR+GH+YR    +        V       NY RV+E W+DE +K YFY
Sbjct: 382 WQGGGKFLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDE-YKDYFY 439

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 440 ASRPESQALPYGDISE 455


>gi|260789880|ref|XP_002589972.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
 gi|229275159|gb|EEN45983.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
          Length = 522

 Score =  359 bits (921), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 179/365 (49%), Positives = 242/365 (66%), Gaps = 5/365 (1%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           +GN EP  E   + PGEG   Y L   Y+   D S+ E+G N+  S+ IS DRTI D+R 
Sbjct: 1   MGNWEPEPERISDAPGEGAIPYKLGPEYKDDIDKSIKEFGFNIVASDKISLDRTIKDIRD 60

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
            ECKYW Y   LP  SVI+VF+NE +S +MRTVHS+IKRTP + L EI+LVDDFS+K   
Sbjct: 61  PECKYWHYDTKLPNMSVIIVFYNEAWSVVMRTVHSVIKRTPPELLAEIVLVDDFSTKEHW 120

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPPL 188
            Q+L+DYI +F G V+L+RN +REGLI+ RS GA+E ++G+++V+LD+HCEVG+NW P L
Sbjct: 121 KQRLDDYIVQFKGLVKLVRNKQREGLIQARSIGAREATKGKILVYLDSHCEVGINWAPAL 180

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH--HYRGIFEWGMLYKENELPEREAKK 246
           ++PI  +R   TVP+ID ID   +   +    D   H RG ++W +L+K+     RE  +
Sbjct: 181 ISPIAVNRTTCTVPLIDVIDGNNYNIYAQGGGDEYGHARGAWDWSLLWKKVPNTPRERAR 240

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
            KY++EPY+SP  AGGLFA+DR +F ELG YDPGL +WGGENFE+S+K+W CGG + + P
Sbjct: 241 HKYHTEPYRSPAMAGGLFAIDREYFFELGLYDPGLKIWGGENFEISYKVWQCGGEVLFTP 300

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CSR+GH+YR    +            ++  NY RV+E W+DE +K YFY   P       
Sbjct: 301 CSRVGHIYR-LKGWAGNPPPQHSGSSVVLQNYMRVVEVWWDE-YKEYFYASRPEIRNHPY 358

Query: 367 GDISE 371
           GDISE
Sbjct: 359 GDISE 363


>gi|313226887|emb|CBY22032.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 188/377 (49%), Positives = 247/377 (65%), Gaps = 7/377 (1%)

Query: 1   RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           + +++  GKLGN EP     KE   G G+ GK  +  +    A   S+ E+G NM  S+ 
Sbjct: 82  KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDK 141

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DR   D+R  +CKY DYP  LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ++VDD S+K  L   L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 235
           CE   NWLPPLLAPI  + +I TVP+IDGID   + F S    D   R  G ++W  L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTSQGGGDRWGRATGAWDWSFLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              LPE E KK     +P+ SP  AGGLFA++R +F ++  YDPGL +WGGENFELS+K+
Sbjct: 322 RIALPESEDKKLPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKL 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG + +VPCSR+GH+YR    +        VK      NY+RVIETW+D+  K +FY
Sbjct: 382 WMCGGGMLFVPCSRVGHIYR-LEGWEGNPPPKTVKSNPSMRNYRRVIETWWDDWSK-FFY 439

Query: 356 TREPLAMFLDMGDISEQ 372
              P A  LD GDI  Q
Sbjct: 440 VARPEAKTLDFGDIGPQ 456


>gi|109076193|ref|XP_001085532.1| PREDICTED: n-acetylgalactosaminyltransferase 7 [Macaca mulatta]
          Length = 630

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 189/376 (50%), Positives = 252/376 (67%), Gaps = 9/376 (2%)

Query: 2   PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP  P  P    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 98  PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 156

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 157 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 216

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 217 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 276

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV +NW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 277 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 336

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              L  +E + RK  +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KI
Sbjct: 337 RVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKI 396

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           W CGG + +VPCSR+GH+YR    +        V       NY   +E   DE +K YFY
Sbjct: 397 WQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPTLKNYLSSVEVCGDE-YKDYFY 454

Query: 356 TREPLAMFLDMGDISE 371
              P +  L  GDISE
Sbjct: 455 ASRPESQALPYGDISE 470


>gi|198437817|ref|XP_002130165.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
           intestinalis]
          Length = 647

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 187/373 (50%), Positives = 242/373 (64%), Gaps = 8/373 (2%)

Query: 2   PVFKADGKLGNLE-PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           P F  D  LGN E    +  + G GE G+A  L  +  +   + +GE+G N   S+ IS 
Sbjct: 90  PKFVND-DLGNYELKAPDQKRAGAGEYGEAVQLDSSLDSQVKSVIGEFGFNTVASDRISL 148

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
           DR   DLR EECK+ DYP  LP  SVI+VFHNE +S LMRTVH++I  TP QYL EI+++
Sbjct: 149 DRAPKDLRHEECKHIDYPSHLPSVSVIIVFHNEAWSPLMRTVHNVINNTPRQYLHEIVMI 208

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DD S K  L  KL++Y+ +FNG V++ RN  REGLIR RS GAK+S GE++V+LDAHCE 
Sbjct: 209 DDGSHKDHLGSKLDEYVTKFNGIVKVYRNDRREGLIRARSIGAKKSSGEILVYLDAHCEA 268

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENE 238
             NWLPPL+ PI +D +  TVP+ID ID   + F      D +   RG ++W   +K   
Sbjct: 269 EPNWLPPLITPILNDHRACTVPLIDVIDGNKYTFTEQAGGDENGLARGAWDWSFQWKRIP 328

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           + ++E  +R   SEPY+SP  AGGLFA+DR FF ELG YD GL +WGGENFELS+K+WMC
Sbjct: 329 ITKKEKARRNRMSEPYRSPAMAGGLFAIDRNFFFELGLYDDGLEIWGGENFELSYKVWMC 388

Query: 299 GGSIEWVPCSRIGHVYRSFMP-YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           GG + +VPCSR+GHVYR  +P +        V    +  NYKRVIETW+D+  K YFYTR
Sbjct: 389 GGQLLFVPCSRVGHVYR--LPGWRGNPPPAYVPKDAVFRNYKRVIETWWDDYSK-YFYTR 445

Query: 358 EPLAMFLDMGDIS 370
            P    +D GD+S
Sbjct: 446 RPEVKSIDTGDLS 458


>gi|32425405|gb|AAH35303.1| GALNT7 protein, partial [Homo sapiens]
          Length = 495

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 176/337 (52%), Positives = 238/337 (70%), Gaps = 5/337 (1%)

Query: 38  RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
           + A  AS+ E+G NM  S+ IS DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+
Sbjct: 1   KQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWST 60

Query: 98  LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
           LMRTVHS+IKRTP +YL EI+L+DDFS+K  L +KL++YI+ +NG V++ RN  REGLI+
Sbjct: 61  LMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQ 120

Query: 158 TRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-- 214
            RS GA++++ G+V+++LDAHCEV +NW  PL+API  DR I TVP+ID I+  T+E   
Sbjct: 121 ARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIP 180

Query: 215 RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLEL 274
           +   + D + RG ++W ML+K   L  +E + RK  +EPY+SP  AGGLFA++R FF EL
Sbjct: 181 QGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTEPYRSPAMAGGLFAIEREFFFEL 240

Query: 275 GGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLI 334
           G YDPGL +WGGENFE+S+KIW CGG + +VPCSR+GH+YR    +        V     
Sbjct: 241 GLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGHIYR-LEGWQGNPPPIYVGSSPT 299

Query: 335 TYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
             NY RV+E W+DE +K YFY   P +  L  GDISE
Sbjct: 300 LKNYVRVVEVWWDE-YKDYFYASRPESQALPYGDISE 335


>gi|313220437|emb|CBY31290.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 187/377 (49%), Positives = 247/377 (65%), Gaps = 7/377 (1%)

Query: 1   RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           + +++  GKLGN EP     KE   G G+ GK  +  +    A   S+ E+G NM  S+ 
Sbjct: 82  KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDT 141

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DR   D+R  +CKY DYP  LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ++VDD S+K  L   L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 235
           CE   NWLPPLLAPI  + +I TVP+IDGID   + F +    D   R  G ++W  L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTTQGGGDRWGRATGAWDWSFLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
              LPE E KK     +P+ SP  AGGLFA++R +F ++  YDPGL +WGGENFELS+K+
Sbjct: 322 RIALPEPEDKKLPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKL 381

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG + +VPCSR+GH+YR    +        VK      NY+RVIETW+D+  K +FY
Sbjct: 382 WMCGGGMLFVPCSRVGHIYR-LEGWEGNPPPKTVKSNPSMRNYRRVIETWWDDWSK-FFY 439

Query: 356 TREPLAMFLDMGDISEQ 372
              P A  LD GDI  Q
Sbjct: 440 VARPEAKTLDFGDIGPQ 456


>gi|47575716|ref|NP_001001200.1| polypeptide N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
           tropicalis]
 gi|45501097|gb|AAH67317.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
           tropicalis]
          Length = 653

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 189/375 (50%), Positives = 252/375 (67%), Gaps = 11/375 (2%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           PV +  G LGNLEP  EP  +G   G     K + L   Y+ A  AS+ E+G NM  S+ 
Sbjct: 121 PVLRP-GILGNLEPK-EPEPQGVVGGPGEGGKPFELGPDYKDAVKASIKEFGFNMVASDM 178

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DRTI DLR EECKYW+Y  +L  +SV++VFHNEG+S+L+RT+HS+IKRTP QYL EI
Sbjct: 179 ISMDRTINDLRHEECKYWNYDENLLTSSVVIVFHNEGWSTLVRTIHSVIKRTPRQYLAEI 238

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 176
           +++DDFS+K  L  +L++Y++++NG V++ RN  REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKGRLDEYLKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID--YQTWEFRSVYEPDHHYRGIFEWGMLY 234
           HCEVG+NW  PL+API  DR   TVP+ID ID    T E +   + D   RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTACTVPLIDYIDGNLYTIEPQQGGDEDGFARGAWDWSMLW 358

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K   L  RE  KRK+ +EPY SP  AGGLFA++R +F ELG YDPGL +WGGENFE+S+K
Sbjct: 359 KRIPLTVREKAKRKHKTEPYWSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYK 418

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IW CGG + + PCSR+GH+YR    +        V       NY RV+E W+DE +K YF
Sbjct: 419 IWQCGGKLLFTPCSRVGHIYR-LHGWQGNPTPVYVGASPTLKNYIRVVEVWWDE-YKDYF 476

Query: 355 YTREPLAMFLDMGDI 369
           Y   P    L  GDI
Sbjct: 477 YASRPETKALPYGDI 491


>gi|313242250|emb|CBY34413.1| unnamed protein product [Oikopleura dioica]
          Length = 644

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)

Query: 9   KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           ++GN EP       GPGEGG    K     E  +   DA + EYG NM  S+ IS DR  
Sbjct: 81  EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECK++ YP  LP +SVI VFHNEG+S+L+R++HS+I  TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199

Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
           +K  +   +LE++I+++NG V+L RN  REGLIR RS GA+++ G V+++LDAHCEV  N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
           W+ PL+ P+  D +I TVP++D ID  T+ F  ++  + ++  RG ++W +L+K   L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE  ++++ + PY+SP  AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G 
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        VKG  +  NY RV+E W+DE  K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437

Query: 362 MFLDMGDISE 371
             +D GD++E
Sbjct: 438 KHVDPGDLTE 447


>gi|313230492|emb|CBY18708.1| unnamed protein product [Oikopleura dioica]
          Length = 644

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)

Query: 9   KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           ++GN EP       GPGEGG    K     E  +   DA + EYG NM  S+ IS DR  
Sbjct: 81  EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECK++ YP  LP +SVI VFHNEG+S+L+R++HS+I  TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199

Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
           +K  +   +LE++I+++NG V+L RN  REGLIR RS GA+++ G V+++LDAHCEV  N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
           W+ PL+ P+  D +I TVP++D ID  T+ F  ++  + ++  RG ++W +L+K   L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE  ++++ + PY+SP  AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G 
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        VKG  +  NY RV+E W+DE  K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437

Query: 362 MFLDMGDISE 371
             +D GD++E
Sbjct: 438 KHVDPGDLTE 447


>gi|313230491|emb|CBY18707.1| unnamed protein product [Oikopleura dioica]
          Length = 510

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/370 (47%), Positives = 248/370 (67%), Gaps = 10/370 (2%)

Query: 9   KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           ++GN EP       GPGEGG    K     E  +   DA + EYG NM  S+ IS DR  
Sbjct: 81  EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
            DLR EECK++ YP  LP +SVI VFHNEG+S+L+R++HS+I  TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199

Query: 125 SKADLDQ-KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
           +K  +   +LE++I+++NG V+L RN  REGLIR RS GA+++ G V+++LDAHCEV  N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 241
           W+ PL+ P+  D +I TVP++D ID  T+ F  ++  + ++  RG ++W +L+K   L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           RE  ++++ + PY+SP  AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G 
Sbjct: 320 RERARQEHMTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQ 379

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           + +VPCSR+GH+YR    +        VKG  +  NY RV+E W+DE  K YFY R+P A
Sbjct: 380 MLFVPCSRVGHIYR-MKGWRGNGTPSYVKGNFVDRNYVRVVEVWWDEYSK-YFYERKPNA 437

Query: 362 MFLDMGDISE 371
             +D GD++E
Sbjct: 438 KHVDPGDLTE 447


>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
 gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
          Length = 454

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 229/350 (65%), Gaps = 11/350 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GPGE G+         +  DA+  E+G N   S+ IS +RTI D R + CK   YP++LP
Sbjct: 1   GPGENGEPVETKAEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLP 60

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
           KASV++VFHNEG+S+LMRTVH+++ R+P   L+EI++VDDFS+K  L QKL+DY ++  G
Sbjct: 61  KASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKL-G 119

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           K++++R  ER GLI+ R  GA  + GEV++FLDAHCE    WLPPLL  I  +R+    P
Sbjct: 120 KIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALNRRTAVCP 179

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
            ID ID++T++++ +   D + RG F W   YKE  +   E  KR+  ++  KSP  AGG
Sbjct: 180 TIDFIDHKTFQYKPM---DPYIRGTFNWRFDYKERAVRPEEMAKRRDPTQEVKSPVMAGG 236

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA++R FF ELG YDPG+ +WGGE +E+SFK+W CGG +E +PCSR+GHVYR  +PY +
Sbjct: 237 LFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHVPYTY 296

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                  K      N++RV E W DE +K + Y + P    +D GDIS++
Sbjct: 297 P------KHDATLVNFRRVAEVWMDE-YKDWLYDKRPEIKSVDYGDISDR 339


>gi|313227738|emb|CBY22887.1| unnamed protein product [Oikopleura dioica]
          Length = 1030

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 169/352 (48%), Positives = 232/352 (65%), Gaps = 8/352 (2%)

Query: 25  GEGG-KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           GEGG     L    +    A+LG +G NM  S+ ++ DR   DLRMEECK WDYP  LP 
Sbjct: 493 GEGGLSPIRLTSEDQTKVTAALGLWGFNMVASDKVNMDRVPADLRMEECKRWDYPDKLPA 552

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ-KLEDYIQRFNG 142
            SVILVFHNEGFS+L+RTVHSI+  +P + L E++++DD S++  +    ++ YI+R++G
Sbjct: 553 VSVILVFHNEGFSTLLRTVHSIVNYSPPEMLHEVVMLDDGSTREYITNGTIDRYIERWDG 612

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            V++  N +REGLIR R+ G K S G V+VFLDAHCEV  NWLPPL+ PI  + K+ ++P
Sbjct: 613 LVKIFHNEKREGLIRARTIGGKHSTGSVLVFLDAHCEVEPNWLPPLITPIAKNYKVSSLP 672

Query: 203 VIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
           +ID ID  T+ F      D +   RG ++W   +K   L +RE  +R   +EP++SP  A
Sbjct: 673 MIDAIDGNTYVFEPQQGGDENNLARGAWDWNFDWKRIPLNQREKARRATITEPFRSPAMA 732

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP- 319
           GGLFA+ R +F ELG YD  L +WGGENFELS+K+W CGG + +VPCSR+GH+YR  MP 
Sbjct: 733 GGLFAISRKWFTELGWYDDKLEIWGGENFELSYKLWQCGGELLFVPCSRVGHIYR--MPG 790

Query: 320 YNFGKLADRVKGP-LITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           +      D +KG   I  NY RVIETW+D+ +K Y+Y R P    +D+GD++
Sbjct: 791 WGGNGTPDELKGKNFIAVNYNRVIETWWDDNYKKYYYERRPENKNVDVGDLT 842


>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 409

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 162/255 (63%), Positives = 199/255 (78%), Gaps = 5/255 (1%)

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           +VDDFS K  L  +L+ Y+++F+GKV+L+RN EREGLIRTRS GAKE+ G+V++FLDAHC
Sbjct: 1   MVDDFSDKEHLKDRLDVYLKQFDGKVKLVRNAEREGLIRTRSIGAKEAVGDVVIFLDAHC 60

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKEN 237
           EV +NWLPPLLAPI  +RKIMTVPVIDGID   W +R VY   D HYRGIFEWG+LYKE 
Sbjct: 61  EVNVNWLPPLLAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYRGIFEWGLLYKET 120

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
           EL  +E  +RK+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW 
Sbjct: 121 ELSSQELLRRKHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQ 180

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG I +VPCS +GHVYRS MPY FGKL+ +   P+I+ N  RVI+TW DE  K Y+Y R
Sbjct: 181 CGGGILFVPCSHVGHVYRSHMPYGFGKLSGK---PVISTNMLRVIKTWMDEYDK-YYYIR 236

Query: 358 EPLAMFLDMGDISEQ 372
           EP A     G+IS Q
Sbjct: 237 EPSAKHRLPGNISSQ 251


>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
 gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
          Length = 561

 Score =  333 bits (855), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 169/352 (48%), Positives = 226/352 (64%), Gaps = 13/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + GPGE GK   L    R  G  +  E G N++ SN IS DR IPD+R   C    Y  D
Sbjct: 40  RTGPGEQGKPADLTAEER--GPHAYEECGFNIKASNKISLDRAIPDIRHPNCASKKYVRD 97

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++ FHNEG+++L+RTVHS++ R+P Q + EIILVDDFS ++ L + LEDY+ + 
Sbjct: 98  LPDVSLVIPFHNEGWTTLLRTVHSVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKL 157

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           + KVR++R  +REGLIRTR  GA+ ++G+V++FLD+HCE  +NWLPPLL PI  ++K + 
Sbjct: 158 SPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIV 217

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P ID ID   + + +  +     RG F+W M YK   +P+    K    S+P++SP  A
Sbjct: 218 CPNIDVIDKDDFHYET--QAGDAMRGAFDWEMYYKRIPIPDE--IKNPDPSDPFESPVMA 273

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+W CGG +   PCSR+GHVYR F+PY
Sbjct: 274 GGLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRKFVPY 333

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +K + Y R P     DMGDIS Q
Sbjct: 334 KVP------AGVNLGKNLKRVAEVWMDE-YKEHLYKRRPHLRKTDMGDISGQ 378


>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
 gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 20 [Mus musculus]
          Length = 601

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 174/352 (49%), Positives = 228/352 (64%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E  R   D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 81  RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           + KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 199 S-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like [Cavia porcellus]
          Length = 601

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 230/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E    + D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDSDDSAYRENGFNIFVSNNIALERSLPDIRHTNCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+
Sbjct: 136 LETLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYV 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|313226886|emb|CBY22031.1| unnamed protein product [Oikopleura dioica]
          Length = 685

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 185/380 (48%), Positives = 243/380 (63%), Gaps = 12/380 (3%)

Query: 2   PVFKADGKLGNLE--PPL-EPYKEGPGEGGKAYH-LPEAYRAAGDASLGEYGMNMETSNH 57
           P +++DGK GN E  P + E   +GPGE G A H LPE      +  +  +G N+  S+ 
Sbjct: 144 PFYRSDGKPGNWEDRPHVDESGHDGPGEHGAAVHTLPEEEEQVKEI-IKTFGFNLVNSDK 202

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           IS DR   DLR +EC   DYP  LP  SV++VFHNEG+  L+RT HS++ RTP + L EI
Sbjct: 203 ISMDRLPKDLRDKECINIDYPEKLPMVSVVVVFHNEGWGPLVRTFHSVVNRTPPELLGEI 262

Query: 118 ILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           +++DD S    K  L   LE+YI+R++GKV+L RN  REGLIR RS GA+ +  EV+VFL
Sbjct: 263 VIIDDGSVIKDKPHLGDPLEEYIKRWDGKVKLYRNARREGLIRARSIGAQHAIFEVLVFL 322

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGM 232
           DAHCE G NWLPPL+API  + +I TVP+ID ID Q + F      DH+ R  G +EW  
Sbjct: 323 DAHCEAGYNWLPPLIAPIARNDRISTVPLIDSIDGQRYTFSGQAGGDHNGRAQGGWEWNF 382

Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
           L+K   LP++EA+K  + +E Y SP  AGGLFA++R  F  +G YDPGL +WGGE +E+S
Sbjct: 383 LWKRYPLPKKEAEKLSHGTEMYPSPAMAGGLFAINREHFNNVGMYDPGLEIWGGEQYEIS 442

Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
           +K+WMCGG + +VPCSR+GHVYR    +      + V       NY+RVIE W+D+  K 
Sbjct: 443 YKLWMCGGGVYFVPCSRVGHVYR-LEGWGGNPPPEYVPSNPSFRNYRRVIEVWWDDWTK- 500

Query: 353 YFYTREPLAMFLDMGDISEQ 372
           YFY   P    L  GDISEQ
Sbjct: 501 YFYWNRPELQKLPYGDISEQ 520


>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Loxodonta africana]
          Length = 601

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 174/355 (49%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EALRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLEDY+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like, partial [Cricetulus griseus]
          Length = 555

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 173/352 (49%), Positives = 229/352 (65%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 35  RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 92

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLEDY+ RF
Sbjct: 93  LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIVEIILVDDFSDREHLKDKLEDYMARF 152

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           + KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 153 S-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 211

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK+  +P     +R   S+P++SP  A
Sbjct: 212 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKKIPIPPE--LQRADPSDPFESPVMA 267

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 268 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 327

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G ++  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 328 KVP------SGTILARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 372


>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Monodelphis domestica]
          Length = 573

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 173/352 (49%), Positives = 229/352 (65%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E  R   D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 53  RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLEK 110

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+ RF
Sbjct: 111 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKDKLEEYMARF 170

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           + KVR++R  +REGLIRTR  GA  ++GEV+ FLD+HCEV +NWLPPLL  I  +RK + 
Sbjct: 171 S-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNRKTIV 229

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 230 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 285

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 286 GGLFAVDRRWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 345

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 346 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 390


>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 1 [Otolemur garnettii]
          Length = 601

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E  R   D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ +VR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 2 [Otolemur garnettii]
          Length = 600

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E  R   D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 77  EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 134

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 194

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ +VR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 195 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRADPSDPFESP 309

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 310 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 369

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 370 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 417


>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Nomascus leucogenys]
          Length = 601

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMLDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Felis catus]
          Length = 601

 Score =  330 bits (846), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Callithrix jacchus]
          Length = 601

 Score =  330 bits (845), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
 gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
           AltName: Full=Polypeptide GalNAc transferase 17;
           Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 17;
           AltName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 17; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 17
 gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 20 [Homo sapiens]
          Length = 601

 Score =  330 bits (845), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Saimiri boliviensis boliviensis]
          Length = 601

 Score =  330 bits (845), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
          Length = 584

 Score =  330 bits (845), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 61  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 178

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401


>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 1
           [Macaca mulatta]
 gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
          Length = 601

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 2
           [Macaca mulatta]
          Length = 584

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 61  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401


>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
          Length = 601

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GD+S Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDLSAQ 418


>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Equus
           caballus]
          Length = 601

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R   REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Ovis aries]
          Length = 601

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 229/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GD+S Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDLSAQ 418


>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Canis lupus
           familiaris]
          Length = 601

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E  R   D++  E G N+  SN I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 1 [Papio anubis]
          Length = 601

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 2 [Papio anubis]
          Length = 584

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN I+ +R++PD+R   CK+  Y
Sbjct: 61  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 118

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 293

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 294 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 353

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 354 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 401


>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
           troglodytes]
          Length = 601

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Gorilla gorilla gorilla]
          Length = 555

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 228/355 (64%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 32  EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 89

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 90  LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 149

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 150 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 208

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 209 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 264

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 265 VMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 324

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 325 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 372


>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
           [Heterocephalus glaber]
          Length = 522

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 172/350 (49%), Positives = 226/350 (64%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   LP
Sbjct: 1   GKGEHGKPYPLTE--EDGDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLP 58

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+ RF+ 
Sbjct: 59  NTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS- 117

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K +  P
Sbjct: 118 KVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCP 177

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  AGG
Sbjct: 178 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGG 233

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY  
Sbjct: 234 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKV 293

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 294 P------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 336


>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
           norvegicus]
          Length = 601

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 173/352 (49%), Positives = 225/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 81  RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              VR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 199 -PIVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   SEP++SP  A
Sbjct: 258 CPMIDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSEPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
           1 [Taeniopygia guttata]
 gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
           2 [Taeniopygia guttata]
          Length = 601

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 171/352 (48%), Positives = 227/352 (64%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 81  RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEA 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMLRF 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMYDVPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
          Length = 601

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 171/352 (48%), Positives = 227/352 (64%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 81  RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMVRF 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
          Length = 601

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  +   GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSRKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIPLNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
          Length = 577

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/364 (43%), Positives = 232/364 (63%), Gaps = 5/364 (1%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           K+ +  P     + GPGE G A HL    +  G+A + ++ MN+  S+ +S DR+IPD R
Sbjct: 59  KMRHKRPDYSKQRSGPGENGAAVHLSGKEKEKGEADMKKWFMNVVASDKLSMDRSIPDTR 118

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
             EC+   Y  DLP ASV+++F +E ++ L+RTVHS++ R+P   L E+IL+DDFS + +
Sbjct: 119 HAECRSVHYDDDLPSASVVIIFTDEAWTPLLRTVHSVVNRSPLHLLHEVILLDDFSQREE 178

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L  KL++YI+RF G V+LIR  ER GLIR +  GA E+ GEVIVFLD+HCE    WL PL
Sbjct: 179 LKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEANEGWLEPL 238

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LA I   R  +  P+ID I  +T ++    + + +  G F W + ++ + + + E  +RK
Sbjct: 239 LARIKEKRTAVLCPIIDYISAETMQYSG--DANVNAVGGFWWSLHFRWDSIGKAERDRRK 296

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
              EP +SPT AGGL A +R +FLE+GGYDPG+ +WGGEN E+SF++WMCGGSIE++PCS
Sbjct: 297 SAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISFRVWMCGGSIEFIPCS 356

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GH++R+  PYN       +   +   N KR+ E W D+ +K  +Y   P     D+GD
Sbjct: 357 HVGHIFRAGHPYNMTGPGGNLD--VHGTNSKRLAEVWMDD-YKRLYYLHRPDLKTKDVGD 413

Query: 369 ISEQ 372
           +SE+
Sbjct: 414 LSER 417


>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Oryctolagus cuniculus]
          Length = 601

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 171/355 (48%), Positives = 227/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y
Sbjct: 78  EAMRSGKGEHGKPYPLTE--EDHDDSAYKENGFNIFVSNNIALERSLPDIRHANCKHKMY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            RF+ KVR++R  +REGLIRTR  GA  + GEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLF++DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFSVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 418


>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 675

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 169/352 (48%), Positives = 224/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK + + +A R   D +  E G N+  S+ IS +R+IPD+R   CK   Y   
Sbjct: 158 RSGNGEQGKPFPMTDADRV--DQAYRENGFNIYVSDRISLNRSIPDIRHPNCKQKLYAEK 215

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS +  L Q LE+Y+ R 
Sbjct: 216 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDREHLKQPLEEYMVRL 275

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 276 -PKVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 334

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P    K+    SEP++SP  A
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKEDP--SEPFESPVMA 390

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 391 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGCMEDIPCSRVGHIYRKYVPY 450

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD + Q
Sbjct: 451 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDTAAQ 495


>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Pan paniscus]
          Length = 518

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 170/348 (48%), Positives = 225/348 (64%), Gaps = 14/348 (4%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   LP  
Sbjct: 2   GEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLPNT 59

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+ RF+ KV
Sbjct: 60  SIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KV 118

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
           R++R  +REGLIRTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K +  P+I
Sbjct: 119 RIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMI 178

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
           D ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  AGGLF
Sbjct: 179 DVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGGLF 234

Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
           A++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY    
Sbjct: 235 AVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKVP- 293

Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                 G  +  N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 294 -----SGTSLARNLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 335


>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 677

 Score =  324 bits (831), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 169/352 (48%), Positives = 224/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GKA+ L +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   
Sbjct: 158 RSGNGEQGKAFPLTDADRV--DQAYRENGFNIFVSDRISLNRSVPDIRHPNCKQKLYAER 215

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS K  L   LE+Y+ R 
Sbjct: 216 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDHLKGALEEYMVRL 275

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 276 P-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANINWLPPLLDRIALNRKTIV 334

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P    K     SEP++SP  A
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPAELQKNDP--SEPFESPVMA 390

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 391 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 450

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 451 KV------PGGVSLARNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDVAAQ 495


>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 1 [Xenopus (Silurana) tropicalis]
          Length = 600

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 171/355 (48%), Positives = 225/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D    E G N+  SN I+  R++PD+R   CK+  Y
Sbjct: 77  EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 134

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HS+I RTP   +EE+ILVDDFS +  L +KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 194

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
             +  KVR++R  +REGLIRTR  GA  ++GEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 195 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   SEP++SP
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSEPFESP 309

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 310 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSRVGHIYRKY 369

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE +  Y Y R P    L  GDIS Q
Sbjct: 370 VPYKVP------TGTSLARNLKRVAETWMDE-YAEYIYQRRPEYRHLSTGDISSQ 417


>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 2 [Xenopus (Silurana) tropicalis]
          Length = 601

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 171/355 (48%), Positives = 225/355 (63%), Gaps = 14/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK Y L E      D    E G N+  SN I+  R++PD+R   CK+  Y
Sbjct: 78  EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 135

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+ FHNEG++SL+RT+HS+I RTP   +EE+ILVDDFS +  L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 195

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
             +  KVR++R  +REGLIRTR  GA  ++GEV+ FLD+HCEV +NWLPPLL  I  + K
Sbjct: 196 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID+  + + +  +     RG F+W M YK   +P     +R   SEP++SP
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSEPFESP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG +  VPCSR+GH+YR +
Sbjct: 311 VMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSRVGHIYRKY 370

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +PY          G  +  N KRV ETW DE +  Y Y R P    L  GDIS Q
Sbjct: 371 VPYKVP------TGTSLARNLKRVAETWMDE-YAEYIYQRRPEYRHLSTGDISSQ 418


>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 607

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 170/354 (48%), Positives = 231/354 (65%), Gaps = 18/354 (5%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK + + E  R   D +  E G N+  SN IS +R++PD+R E C+   Y   
Sbjct: 88  RAGNGEQGKPFPVTETDRV--DQAYRENGFNIYVSNRISLNRSLPDIRHENCRQKLYAEK 145

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  ++I+ FHNEG+SSL+RTVHS+I R+P + + EIILVDDFS K  L   LE+Y++RF
Sbjct: 146 LPNTTIIIPFHNEGWSSLLRTVHSVINRSPPRLVAEIILVDDFSDKEHLKVALEEYMKRF 205

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 206 P-KVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPLLDRIVQNRKTIV 264

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID ID+  + + +  +     RG F+W M YK   +P   A+ R  + +EP++SP  
Sbjct: 265 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP---AEMRTDDPTEPFESPVM 319

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++P
Sbjct: 320 AGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVP 379

Query: 320 YNFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y       +V G + +  N KRV E W DE +  Y Y R P    L  GD+S Q
Sbjct: 380 Y-------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMSAQ 425


>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like [Anolis carolinensis]
          Length = 602

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 168/352 (47%), Positives = 227/352 (64%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y L E      D++  E G N+  SN+I+ +R++PD+R   CK+  Y   
Sbjct: 81  RSGKGEQGKPYPLTE--EDNDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG++SL+RT+HSII RTP   + EIILVDDFS +  L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPNSLIAEIILVDDFSDREHLKEKLEEYMARF 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEV+ FLD+HCEV +NWLPPLL  I  + K + 
Sbjct: 199 -VKVRIVRTKKREGLIRTRLLGASIAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRTDPSDPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY
Sbjct: 314 GGLFAVNRKWFWDLGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV ETW DE    Y Y R P    L  GD+S Q
Sbjct: 374 KVP------SGTSLARNLKRVAETWMDE-FAEYVYQRRPEYRHLSTGDLSAQ 418


>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 520

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 166/361 (45%), Positives = 235/361 (65%), Gaps = 13/361 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P     + GPGEGG   +L    +  G+A + ++ MN+  S+ IS DR++PD R ++C+ 
Sbjct: 6   PDYSKKRIGPGEGGTGVYLTGKQKVQGEADMKKWFMNVVASDLISLDRSLPDRRHKQCRK 65

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y  DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L  KLE
Sbjct: 66  ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKGKLE 125

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +YI+RF  KVRL+R  ER+GLIR +  GAKE+ G+V+VFLD+HCEVG  WL PLLA I  
Sbjct: 126 EYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
            R  +  P+I+ I  +T  + +   P H   G F W + ++ + +P+  +      +EP 
Sbjct: 186 KRSAVLCPIINHISPETLTYSANDRPAH--VGGFWWSLHFRWDPMPKEYSDADP--TEPI 241

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           +SPT AGGL A+DR +F E+GGYDP + +WGGEN E+SF++WMCGGS+E++PCS +GH++
Sbjct: 242 RSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEMSFRVWMCGGSVEFIPCSHVGHIF 301

Query: 315 RSFMPYNF---GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           R+  PYN    G   D V G     N KR+ E W D+  K Y+  R  L    D+GD+SE
Sbjct: 302 RAGHPYNMIGPGNNKD-VHGT----NSKRLAEVWMDDYKKFYYIHRLDLKE-KDVGDLSE 355

Query: 372 Q 372
           +
Sbjct: 356 R 356


>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Takifugu rubripes]
          Length = 600

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 168/353 (47%), Positives = 230/353 (65%), Gaps = 16/353 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GKA+ L ++ R   D +  E G N+  S+ IS +R++PD+R  +CK   Y   
Sbjct: 81  RTGNGEQGKAFPLTDSDRV--DQAYRENGFNIYISDRISLNRSLPDIRHADCKQKLYAEK 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K  L   LE+Y++R 
Sbjct: 139 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAELILVDDFSDKEHLKVPLEEYMKRM 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKSIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   S+P++SP  A
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPAE--MQRDDPSQPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                  +V G + +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 374 -------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMTPQ 418


>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oreochromis niloticus]
          Length = 600

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 167/353 (47%), Positives = 229/353 (64%), Gaps = 16/353 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK + L E  R   D +  E G N+  S+ IS +R++PD+R E C+   Y   
Sbjct: 81  RMGNGEQGKPFPLTENDRV--DQAYRENGFNIYVSDRISLNRSLPDIRHENCRQKLYAEK 138

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+ FHNEG+SSL+RTVHS++ R+P++ + E+ILVDDFS K  L   LE+Y++R 
Sbjct: 139 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSRLITEVILVDDFSDKEHLKVALEEYMKRM 198

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKAIV 257

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P     +R   SEP++SP  A
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPPE--MQRDDPSEPFESPVMA 313

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 314 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKLWMCGGRMEDIPCSRVGHIYRKYVPY 373

Query: 321 NFGKLADRVKGPL-ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                  +V G + +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 374 -------KVPGGISLAKNLKRVAEVWMDE-YAEYVYQRRPEYRHLSAGDMTAQ 418


>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 520

 Score =  320 bits (820), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 164/360 (45%), Positives = 231/360 (64%), Gaps = 11/360 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P     + GPGE G   +L    +  G+A + ++ MN+  S+ IS DR++PD R ++C  
Sbjct: 6   PDYSKKRIGPGEDGTGVYLTGKQKVQGEADMKKWFMNLVASDLISLDRSLPDHRHKQCHK 65

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y  DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L +KLE
Sbjct: 66  ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKEKLE 125

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +YI+RF  KVRL+R  ER+GLIR +  GAKE+ G+V+VFLD+HCEVG  WL PLLA I  
Sbjct: 126 EYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
            R  +  P+I+ I  +T  + +   P +   G F W + +  + +P+         +EP 
Sbjct: 186 KRSAVLCPIINHISAETLTYSANDRPTN--VGGFSWSLHFLWDPMPKEYFDADP--TEPI 241

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           +SPT AGGL A+DR++F E+GGYDP + +WGGEN E+SF++WMCGGSIE++PCS +GH++
Sbjct: 242 RSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSFRVWMCGGSIEFIPCSHVGHIF 301

Query: 315 RSFMPYNFGKLADR--VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R   PYN     D   V G     N KR+ E W D+  K Y+  R  L    D+GD+SE+
Sbjct: 302 RDGHPYNMIGPGDNKDVHGT----NSKRLAEVWMDDYKKFYYIHRLDLK-GKDVGDLSER 356


>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Takifugu rubripes]
          Length = 552

 Score =  320 bits (819), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 167/352 (47%), Positives = 224/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GKA+ + +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   
Sbjct: 33  RSGNGEQGKAFPMTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKQKLYAEK 90

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K  L   L++Y+ R 
Sbjct: 91  LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEVILVDDFSDKEHLKVPLDEYMVRL 150

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEVI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 151 -PKVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 209

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P    K+    SEP++SP  A
Sbjct: 210 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPLELQKEDP--SEPFESPVMA 265

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E  PCSR+GH+YR ++PY
Sbjct: 266 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDTPCSRVGHIYRKYVPY 325

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 326 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLAAGDMAVQ 370


>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Callithrix jacchus]
          Length = 667

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 152 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 209

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 210 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 269

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 270 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 328

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 329 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 384

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 385 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 444

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 445 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 487


>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
           anubis]
          Length = 637

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 122 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 179

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 180 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 239

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 240 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 298

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 299 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 354

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 355 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 414

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 415 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 457


>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oreochromis niloticus]
          Length = 587

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 165/352 (46%), Positives = 223/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK + L +A R   D +  E G N+  S+ IS +R++PD+R   CK+  Y   
Sbjct: 68  RSGNGEQGKPFPLTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKHKLYAEK 125

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  ++I+ FHNEG+SSL+RTVHS++ R+P   + EIILVDDFS K  L   LE+Y+ R 
Sbjct: 126 LPNTTIIIPFHNEGWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEHLKVALEEYMVRL 185

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  ++GEV+ FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 186 -PKVRILRTKKREGLIRTRLLGAAAAKGEVLTFLDSHCEANVNWLPPLLDRIAQNRKTIV 244

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P    K     SEP++SP  A
Sbjct: 245 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPTELQKDDP--SEPFESPVMA 300

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 301 GGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 360

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 361 KVP------GGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDMTVQ 405


>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
 gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
          Length = 556

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 164/371 (44%), Positives = 222/371 (59%), Gaps = 17/371 (4%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RPVF+          P       PGE G+   +P+ Y+   +        N   S+ IS 
Sbjct: 39  RPVFQP-------ALPQNHKPAAPGEYGRPVDVPKEYQQLSEELFQRNHFNQWVSDRISL 91

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            RT+PD R E CK   YP+DLP  SV++VF+NE +S+LMRTVHS++ R+P   L E+ILV
Sbjct: 92  QRTLPDPRPEMCKSMTYPVDLPSTSVVIVFYNEAWSTLMRTVHSVLDRSPPDLLHEVILV 151

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DD  S  +L Q LE+Y+++ + KVRL RN++REGLIR R RG +++   ++ FLDAHCEV
Sbjct: 152 DD--SSDELHQPLEEYVRQLD-KVRLHRNSQREGLIRARLRGLEQTSAPIVTFLDAHCEV 208

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            + WL PLL  I+ DR  +  P ID ID   + ++  Y P    RG F W + +K +  P
Sbjct: 209 TIGWLEPLLNRIHQDRTTVVCPEIDSIDLNNFAYK--YGPSGVLRGTFNWDLSFKWSIAP 266

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
             E  +R   ++P +SPT AGGLFA+DR +FLELG YD GL +WG EN ELSFK+W CGG
Sbjct: 267 TSERLRRTSATDPMRSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSFKVWQCGG 326

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
            +E +PCS +GHV+R   PY+       +       NY+RV E W D+ +K +FY R P 
Sbjct: 327 KLEIIPCSHVGHVFREVQPYDTSVSLHSIANK----NYQRVAEVWMDD-YKKFFYQRHPY 381

Query: 361 AMFLDMGDISE 371
                 GDISE
Sbjct: 382 LTDQSFGDISE 392


>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Saimiri boliviensis boliviensis]
          Length = 682

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 167 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKHYLETLP 224

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 225 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 284

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 285 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 343

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 344 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 399

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 400 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 459

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 460 ------PAGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 502


>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Sarcophilus harrisii]
          Length = 651

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 224/350 (64%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ I+ +R++PD+R   C    Y   LP
Sbjct: 133 GNGEQGKPYPITDAERV--DQAYRENGFNIFVSDKIALNRSLPDIRHPNCNSKLYLEKLP 190

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS +  L ++LEDY+ +F  
Sbjct: 191 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREHLKKRLEDYMAQF-P 249

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I S+RK +  P
Sbjct: 250 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIASNRKTIVCP 309

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID   + +++  +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 310 MIDVIDNDHFGYKT--QAGDAMRGAFDWEMYYKRIPIPLELQKSDP--SDPFESPVMAGG 365

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 366 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYIPYKI 425

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 426 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRLPEYRHLSTGDVTAQ 468


>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
 gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
           taurus]
          Length = 620

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK + L  A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 105 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 162

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 163 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 222

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 223 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 281

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 282 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 337

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 338 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 397

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 398 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVTAQ 440


>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
           lupus familiaris]
          Length = 603

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 222/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P++ + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 423


>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
 gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
          Length = 520

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 162/360 (45%), Positives = 233/360 (64%), Gaps = 11/360 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P     + GPGE G   +L    +  G+A + ++ MN+  S+ IS DR++PD R E+C+ 
Sbjct: 6   PDYSKKRTGPGEDGSGVYLTGKQKVRGEADMKKWFMNLVASDMISLDRSLPDHRHEQCRK 65

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
            +YP +LP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + DL  +LE
Sbjct: 66  INYPDNLPVASVVIIFTDEAWSPLMRTVHSVINRTPFKLLQEIILVDDFSQRDDLKGRLE 125

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +YI+RF  KVRLIR  ER+GLIR +  GAKE+ G+V++FLD+HCEV   WL PLLA I  
Sbjct: 126 EYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVSEGWLEPLLARIKE 185

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           +R ++  P+ID I  +T  +       +   G F W + ++ + LPE         ++P 
Sbjct: 186 NRSVVLCPIIDHISAETLAYSGSDRLAN--VGGFWWSLHFRWDPLPEEYYGIDP--TKPI 241

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           +SPT AGGLFA+DR +F E+GGYDP + +WGGEN E+SF++WMCGG IE++PCS +GH++
Sbjct: 242 RSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISFRVWMCGGGIEFIPCSHVGHIF 301

Query: 315 RSFMPYNFGKLADR--VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R+  PYN     +   V G     N KR+ E W D+  + Y+  R  L    ++GD+SE+
Sbjct: 302 RAGHPYNMTGPGNNEDVHGT----NSKRLAEVWMDDYKRFYYIHRSDLKE-KNVGDLSER 356


>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
           norvegicus]
 gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
           norvegicus]
 gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
          Length = 603

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423


>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           isoform 5 [Macaca mulatta]
          Length = 603

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 ------PAGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423


>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
           scrofa]
          Length = 624

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 109 GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLP 166

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 167 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-P 225

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 226 NVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 285

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 286 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 341

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 342 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 401

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 402 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 444


>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Hydra magnipapillata]
          Length = 565

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 163/361 (45%), Positives = 230/361 (63%), Gaps = 15/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           EP     +  PGE G A    PE Y    + +   YG N  TS+ ISF+R++PD R +EC
Sbjct: 37  EPTGISNQSSPGEQGIAVVTSPEDY-GKRNQAYTLYGFNQFTSDKISFNRSLPDPRPQEC 95

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   Y   LP  SV+++FHNEG+S+L+RTVHS++ R+P++ L EIIL DD+S K  L ++
Sbjct: 96  KITKYQSRLPTVSVVIIFHNEGWSTLLRTVHSVLNRSPSKLLHEIILCDDYSQKEHLKKQ 155

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LEDYI  +  K++L+R +EREGLIR R  GA  + G++I+FLD+HCE  + WLPPL++ I
Sbjct: 156 LEDYIIPY-PKIKLVRTSEREGLIRARVHGANHANGDIIIFLDSHCEANVGWLPPLVSEI 214

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             + + +T P +D ID+ ++ +R V   D + RG F W   YKE  + E +   RK  +E
Sbjct: 215 EKNYRCVTCPTVDFIDHDSFYYRGV---DPYIRGTFNWRFDYKERGITEHQKAARKSVTE 271

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
             +SP  AGGLFA+ + F+ ELG YDPG+ VWGGE +E+SFK+WMCGG +  +PCSR+GH
Sbjct: 272 GVRSPVMAGGLFAISKKFWEELGKYDPGMYVWGGEQYEISFKLWMCGGEMLNMPCSRVGH 331

Query: 313 VYRSFMPYNFGKLADRVKGPLITY-NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           VYR  +PY + K       P  +  N+KRV E W DE  K + Y   P+    + G+ISE
Sbjct: 332 VYRRNVPYTYNK-------PFASLINFKRVAEVWMDE-FKEFLYRGNPMVRSQNAGNISE 383

Query: 372 Q 372
           +
Sbjct: 384 R 384


>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
          Length = 634

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 119 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 176

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 177 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 236

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 237 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 295

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 296 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 351

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 352 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 411

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 412 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 454


>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
          Length = 652

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK + L  A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 137 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 194

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 195 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 254

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 255 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 313

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 314 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 369

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 370 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 429

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 430 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVTAQ 472


>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
           caballus]
          Length = 561

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 46  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 103

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 104 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 163

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 164 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 222

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 223 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 278

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 279 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 338

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 339 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 381


>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
 gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
          Length = 551

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 36  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 94  NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 371


>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           isoform 4 [Macaca mulatta]
          Length = 602

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 87  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 144

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 205 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 319

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 320 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 379

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 380 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 422


>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
 gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
 gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
          Length = 603

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423


>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
          Length = 603

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423


>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
           Mn2+
 gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
           Galnac-ser On Lectin Domain
          Length = 570

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 55  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 112

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 113 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 172

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 173 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 231

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 232 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 287

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 288 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 347

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 348 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 390


>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
           catus]
          Length = 603

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 423


>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
           mulatta]
          Length = 565

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 50  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 107

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 108 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 167

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 168 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 226

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 227 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 282

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 283 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 342

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 343 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 385


>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
 gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
 gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
           [synthetic construct]
 gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
           [synthetic construct]
 gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
 gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [synthetic
           construct]
          Length = 603

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423


>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Homo sapiens]
          Length = 603

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 423


>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
          Length = 582

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ I+ +R++PD+R   C    Y   LP
Sbjct: 67  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKIALNRSLPDIRHPNCNNKRYLETLP 124

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 125 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREHLKKPLEDYMAHFPS 184

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 185 -VRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 243

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 244 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 299

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 300 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 359

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE    + Y R P    L  GD++ Q
Sbjct: 360 ------PAGVSLARNLKRVAEVWMDE-FAEHIYQRRPEYRHLSAGDVAAQ 402


>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
           troglodytes]
          Length = 551

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 36  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 94  NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 371


>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
          Length = 551

 Score =  317 bits (813), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 36  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 93

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 94  NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 153

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 212

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 268

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 269 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 328

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 329 P------AGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 371


>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
          Length = 603

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+V+ FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423


>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
          Length = 603

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 219/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE  K Y + +A R   D +  E G NM  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GYGEQAKPYPMTDAERV--DQAYRENGFNMYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423


>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
          Length = 528

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 13  GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 70

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 71  NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 130

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+V+ FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 131 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 189

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 190 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 245

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 246 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 305

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 306 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 348


>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Ailuropoda melanoleuca]
          Length = 598

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 83  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 140

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 141 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 200

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 201 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 259

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 260 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 315

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 316 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 375

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 376 ------PAGVSLARNLKRVAEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 418


>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oryctolagus cuniculus]
          Length = 603

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 221/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKVDP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423


>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Anolis carolinensis]
          Length = 612

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 163/352 (46%), Positives = 223/352 (63%), Gaps = 14/352 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   
Sbjct: 92  RTGNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNSKLYLEK 149

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L ++LEDY+ +F
Sbjct: 150 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLRKRLEDYMAQF 209

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  + K + 
Sbjct: 210 T-KVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNHKTIV 268

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+  + + +  +     RG F+W M YK   +P    K     S+P++SP  A
Sbjct: 269 CPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP--SDPFESPVMA 324

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 325 GGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 384

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 385 KVP------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVATQ 429


>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 599

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 84  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNRKRYLETLP 141

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 142 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMAHFPS 201

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 202 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 260

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 261 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 316

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 317 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 376

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE    + Y R P    L  GD++ Q
Sbjct: 377 P------AGVSLARNLKRVAEVWMDE-FAEHIYQRRPEYRHLSAGDVAAQ 419


>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Meleagris gallopavo]
          Length = 562

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 43  GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 100

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L ++LEDY+ +F  
Sbjct: 101 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 159

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 160 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 219

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 220 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 275

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 276 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 335

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 336 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 378


>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
           gallus]
          Length = 611

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 93  GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 150

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L ++LEDY+ +F  
Sbjct: 151 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 209

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 210 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 269

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 270 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 325

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 326 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 385

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 386 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVTAQ 428


>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
           livia]
          Length = 560

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 31  GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 88

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L ++LEDY+ +F  
Sbjct: 89  NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 147

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 148 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 207

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 208 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP--SDPFESPVMAGG 263

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 264 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 323

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 324 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 366


>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
          Length = 602

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 169/356 (47%), Positives = 221/356 (62%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE G+   L  +  A  +      G N   S+ IS +R++PD+R  +C+   Y
Sbjct: 77  EEKRTGMGEHGRPAFLSPSLDARKEKLYQVNGFNAALSDEISLNRSVPDIRHPDCRKKKY 136

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+LMRT  S+I R+P   LEEIILVDD S+K +L +KL+DYI
Sbjct: 137 SKNLDPVSVIVSFHNEHFSTLMRTCWSVINRSPPSLLEEIILVDDASTKVELKKKLDDYI 196

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            ++  KV ++R  +R GLIR R  GAK +R +V+VFLD+H E  +NWLPPLL PI  + K
Sbjct: 197 AQYLPKVSIVRLAKRSGLIRGRLAGAKAARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 256

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + K+    +EP+KSP
Sbjct: 257 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR---PAEPFKSP 310

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDPGL +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 311 IMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 370

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      +G  +  NYKRV E W DE +  Y Y R P    LD GD+SEQ
Sbjct: 371 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YAEYIYKRRPHLRALDPGDLSEQ 419


>gi|444727227|gb|ELW67729.1| N-acetylgalactosaminyltransferase 7 [Tupaia chinensis]
          Length = 606

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 157/303 (51%), Positives = 213/303 (70%), Gaps = 5/303 (1%)

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           CKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS+K  L +
Sbjct: 5   CKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKE 64

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLA 190
           +L++YI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAHCEV +NW  PL+A
Sbjct: 65  RLDEYIKMWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVA 124

Query: 191 PIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           PI  DR   TVP+ID ID   +  E +   + D   RG ++W +L+K   L  +E  KRK
Sbjct: 125 PISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLNHKEKAKRK 184

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + +EPY+SP  AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +VPCS
Sbjct: 185 HKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCS 244

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           R+GH+YR    +        V       NY RV+E W+DE +K YFY   P +  L  GD
Sbjct: 245 RVGHIYR-LEGWQGNPPPVYVGSSPTLKNYIRVVEVWWDE-YKDYFYASRPESKALPYGD 302

Query: 369 ISE 371
           ISE
Sbjct: 303 ISE 305


>gi|345307949|ref|XP_001508273.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Ornithorhynchus anatinus]
          Length = 593

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 223/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNNKLYLEKLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L ++LEDY+ RF  
Sbjct: 146 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKRLEDYMARF-P 204

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           +VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 205 RVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P+   K     S+P++SP  AGG
Sbjct: 265 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPQELQKPDP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+D+ +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDKKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 381 P------TGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 423


>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Otolemur garnettii]
          Length = 603

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 220/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GNGEQGRPYPMSDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LE Y+  F  
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEAYMALFPS 205

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 320

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 321 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 380

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 381 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 423


>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cavia porcellus]
          Length = 602

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 218/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + E  R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 87  GNGEQGRPYPMTEGERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEVLP 144

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R   REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 205 -VRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 319

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 320 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 379

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W D+ +  Y Y R P    L  GD+  Q
Sbjct: 380 P------AGVSLARNLKRVAEVWMDD-YAEYIYQRRPEYRHLSAGDVVAQ 422


>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
          Length = 600

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 167/350 (47%), Positives = 219/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y L E      D+   E G N+  SN+I+ DR++PD+R   CK   Y  +LP
Sbjct: 82  GKGEHGKPYPLVED--ECDDSVYKENGFNIYVSNNIALDRSLPDIRHPNCKQKLYLENLP 139

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RT+HSI  RTP   + EIILVDD+S +  L   L +Y+ RF  
Sbjct: 140 NTSIIIPFHNEGWSSLLRTLHSISNRTPDHLIAEIILVDDYSDREHLKAHLAEYMSRF-P 198

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  +REGLIRTR  GA  +RGEV+ FLD+HCE  +NWLPPLL  I  + K +  P
Sbjct: 199 KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPLLDQIAQNPKTIVCP 258

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P          S+PY+SP  AGG
Sbjct: 259 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPELQGPDP--SDPYQSPVMAGG 314

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA++R +F ELGGYD GL +WGGE FE+SFK+WMCGGS+  VPCSR+GH+YR ++PY  
Sbjct: 315 LFAVNRQWFWELGGYDTGLEIWGGEQFEISFKVWMCGGSMYDVPCSRVGHIYRKYVPYKV 374

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV ETW DE +  Y Y R P    L  GD++ Q
Sbjct: 375 P------SGTSLARNLKRVAETWMDE-YTEYIYQRRPEYRHLSTGDLTAQ 417


>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cricetulus griseus]
          Length = 715

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 219/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 200 GNGEQGRPYPMTDAERE--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 257

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 258 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 317

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 318 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 376

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 377 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 432

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR  +PY  
Sbjct: 433 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKSVPYKV 492

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L  GD+  Q
Sbjct: 493 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVVAQ 535


>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Takifugu rubripes]
          Length = 556

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 161/371 (43%), Positives = 233/371 (62%), Gaps = 13/371 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K G+L P L        EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R++ CK   YP D+P  S+++VFHNE +S+L+RTVHS+I R+P   L EI+LVDD
Sbjct: 96  SLPDVRLDGCKTKVYPDDVPNTSIVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L +KLE+Y++     VR++R  +R GLIR R RGA  ++G+VI FLDAHCE  +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DR  +  P+ID I  +T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY+F        G +I  N +R+ E W D+  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDD-FKDFFYIISPGV 387

Query: 362 MFLDMGDISEQ 372
           M +D GD+S +
Sbjct: 388 MRVDYGDVSSR 398


>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 612

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 162/352 (46%), Positives = 221/352 (62%), Gaps = 14/352 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           +GPGE G A+ LP       D      G N   S+ I+ +R++PD+R   C+   Y   L
Sbjct: 100 KGPGEQGAAFFLPAGMEKKKDELYKVNGFNALASDFIALNRSLPDIRNPGCQKKRYVSKL 159

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ FHNE +++L+RT  S++ R+P + ++EIIL DD+S+K  L + LEDYI +  
Sbjct: 160 PTVSVIVPFHNEHWTTLLRTATSVLNRSPPELIKEIILADDYSNKEQLKKPLEDYIAKHW 219

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KVR++R T REGLIR R  GA+++ G+V++FLD+H E  +NWLPPLL PI  D + +  
Sbjct: 220 NKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPLLEPIAKDYRTVVC 279

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID IDY+T+ +R+    D   RG F+W + YK    LPE  A      +EP+KSP  A
Sbjct: 280 PFIDVIDYETFAYRA---QDEGARGSFDWELYYKRLPLLPEDLANP----TEPFKSPVMA 332

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R +F ELGGYD GL VWGGE +ELSFKIW CGG++   PCSR+GH+YR F P+
Sbjct: 333 GGLFAISRRYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPCSRVGHIYRKFAPF 392

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
               + D      +  NY+RV E W DE +K Y Y R P    L+ GD++ Q
Sbjct: 393 PNPGIGD-----FVGRNYRRVAEVWMDE-YKEYLYMRRPHYRNLEPGDLTAQ 438


>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           17 [Ciona intestinalis]
          Length = 870

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 168/354 (47%), Positives = 221/354 (62%), Gaps = 22/354 (6%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GPGE G A HL    R+   ++  E G N+  SN IS +R++PD+R + C    Y   LP
Sbjct: 355 GPGELGVAVHLSTEERSR--SAYSENGFNILVSNRISLNRSLPDIRHKNCASRKYLAQLP 412

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS----KADLDQKLEDYIQ 138
            AS+I+ FHNEG ++L+RT+HSII RTP   L EIILVDD S+    K+ LDQ+L  Y Q
Sbjct: 413 DASIIIPFHNEGRTTLLRTIHSIINRTPKILLREIILVDDCSTVDHLKSSLDQELSKYRQ 472

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
                V+L+R  +REGLIR R  G  +++G  IV LD+H EV  NWLPPLL PI  DRK+
Sbjct: 473 -----VKLVRLAKREGLIRARLAGVHQAKGNTIVILDSHVEVTNNWLPPLLEPIALDRKV 527

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +T P+ID I+    +F  + +P    RG F+W + YK   +P    K+ K  S+P++ P 
Sbjct: 528 ITCPMIDIINKD--DFHYLTQPGDAMRGAFDWELYYKRIPIPPE--KQLKDPSDPFEDPV 583

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+DR +F E+G YD GL +WGGE +ELSFK WMCGG I   PCSR+GH+YR FM
Sbjct: 584 MAGGLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFKAWMCGGKILDAPCSRVGHIYREFM 643

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+         G  I  N+KRV E W DE +  YFY + P    +  GD+S+Q
Sbjct: 644 PYSLP------PGTNINKNFKRVAEVWMDE-YAEYFYKKRPHVRGIHPGDLSKQ 690


>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Taeniopygia guttata]
          Length = 555

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 162/350 (46%), Positives = 222/350 (63%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK Y + +A R   D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 37  GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 94

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + E++LVDDFS +  L ++LEDY+ +F  
Sbjct: 95  NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREHLKKRLEDYMAQF-P 153

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R   REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 154 SVRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 213

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+  + + +  +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 214 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP--SDPFESPVMAGG 269

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY  
Sbjct: 270 LFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKV 329

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  + Y R P    L  GD++ Q
Sbjct: 330 P------TGVSLARNLKRVAEVWMDE-YAEFIYQRRPEYRHLSAGDVAAQ 372


>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
          Length = 626

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 164/355 (46%), Positives = 222/355 (62%), Gaps = 8/355 (2%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L   + G G GGKA  L  A     +  + +   N+  SN IS++RT+PD+R   CK   
Sbjct: 111 LNKIENGLGAGGKAVKLFGAELQEAEEIMKKEAFNLFISNRISYNRTLPDVRDSMCKGLT 170

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP ASVI++F NE +S L+RT+ S+I R+P ++L+EI+L+DDFS + +L  KLE Y
Sbjct: 171 YDTILPSASVIIIFTNEAWSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQGKLERY 230

Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           I+ +    VRL+R  ER+GLIR R  GAKE+ GEVI+FLD+HCE  L WL PLL  I  D
Sbjct: 231 IETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVIIFLDSHCEATLGWLEPLLQRIKED 290

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
           ++ + VP+ID ID +T E+     P+    G F W   +   ++P+RE K+R     P  
Sbjct: 291 KRAVLVPIIDVIDDKTLEYYH-GSPESFQIGSFTWSGHFTWMDIPKREIKRRGSRVGPTN 349

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SPT AGGLFA+DR +F +LG YD G+ VWGGEN E+SF+IWMCGGS+E +PCSR+GH++R
Sbjct: 350 SPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSFRIWMCGGSLETIPCSRVGHIFR 409

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           SF PY F    D         N  RV+E W D+ +K  FY        +D+GD S
Sbjct: 410 SFHPYTFPGNKDTH-----GINTARVVEVWMDD-YKELFYMHRGDLKTIDIGDTS 458


>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase 10-like [Metaseiulus
           occidentalis]
          Length = 590

 Score =  312 bits (800), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 163/359 (45%), Positives = 228/359 (63%), Gaps = 16/359 (4%)

Query: 18  EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEY-GMNMETSNHISFDRTIPDLRMEECKYW 75
           E   +GPGE G A  LP +A        L +  G N   S+ I+ +R++PD+R  EC+  
Sbjct: 70  EKLAQGPGEQGAAVELPKDAETEQRKEKLYKVNGFNAAVSDLIALNRSLPDIRHSECQNI 129

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK-ADLDQKLE 134
            Y   LP AS+++ FHNE  S L+RT+ S+++R+P   ++EIILVDDFSSK + +  +LE
Sbjct: 130 RYAARLPTASIVIPFHNEHLSVLLRTITSVLRRSPKSLIKEIILVDDFSSKKSXVSTELE 189

Query: 135 DYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           +Y+   F  +V+L+R T+REGLIR R  GA+ + G+V++FLD+H E  +NWLPPLL PI 
Sbjct: 190 NYLSSHFGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEANVNWLPPLLDPIA 249

Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
            +R+ +  P ID I Y+T+ +RS    D   RG F+W + YK   L   + K+    +EP
Sbjct: 250 RNRRTVVCPFIDVIHYETFAYRS---QDEGARGAFDWELYYKRLPLLSEDLKR---PTEP 303

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
           ++SP  AGGLFA+DR++F ELGGYD GL VWGGE +ELSFKIW CGG +   PCSR+GH+
Sbjct: 304 FRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFKIWQCGGQMFDAPCSRVGHI 363

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           YR F P+    + D      +  NY+RV E W DE +K + Y R P    L  GD+S+Q
Sbjct: 364 YRKFAPFPNPGIGD-----FVGRNYRRVAEVWMDE-YKEFLYNRRPHYRTLGYGDVSKQ 416


>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Ornithorhynchus anatinus]
          Length = 558

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 225/361 (62%), Gaps = 9/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L PP++   EGPGE GK   +P+  +            N+  S  I+F+R++PD+R+E C
Sbjct: 46  LRPPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLEGC 105

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 106 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 165

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G VI FLDAHCE  + WL PLLA I
Sbjct: 166 LESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLARI 225

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 226 KFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 283 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 342

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 343 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 397

Query: 372 Q 372
           +
Sbjct: 398 R 398


>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Nomascus leucogenys]
          Length = 557

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 227/361 (62%), Gaps = 9/361 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P+   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E
Sbjct: 45  GDILKPITKNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L 
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
             LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396

Query: 370 S 370
           S
Sbjct: 397 S 397


>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Strongylocentrotus purpuratus]
          Length = 521

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 165/350 (47%), Positives = 216/350 (61%), Gaps = 10/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A  L    +          G N   S+ IS DR +PD+R   CK   Y   LP 
Sbjct: 1   PGERGVAVKLTPEMKKTEKKDTSANGFNERVSDMISMDRALPDIRNPRCKEITYLAKLPN 60

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            SVI+ FHNE  S+L RTVHSI  R+P + + EIILVDDFS +A L   L+DY+  F  K
Sbjct: 61  VSVIIPFHNEALSTLKRTVHSIFNRSPPELIHEIILVDDFSDRAYLKGPLDDYMSAFP-K 119

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V++IR  +REGLIRTR  GA  + G+V++FLD+HCE   NWLPPLL  I  +R+ +  P+
Sbjct: 120 VKIIRLEKREGLIRTRLLGAGPATGDVVLFLDSHCEANYNWLPPLLERIALNRRRIVCPM 179

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID I  + + + S  +     RG F+W + YK   + E E K+R + S+P+++P  AGGL
Sbjct: 180 IDVISNEDFHYES--QAGDVMRGAFDWELYYKRIPISEAENKRRSHESDPFRTPIMAGGL 237

Query: 264 FAMDRAFFL-ELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           FA+DR +F+ ELGGYD GL +WGGE ++LSFK+WMCGG +E +PCSR+GH+YR FM Y  
Sbjct: 238 FAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVGHIYRKFMSYTV 297

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              A      +I  N  RV+E W DE  K YFY R P     D GDIS+Q
Sbjct: 298 PGGAG-----VINKNLLRVVEVWMDEWGK-YFYERRPYLKGQDYGDISKQ 341


>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oryzias latipes]
          Length = 557

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 160/366 (43%), Positives = 228/366 (62%), Gaps = 13/366 (3%)

Query: 8   GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
           G+  +L  P    ++GPGEGGK   +P+  +            N+  S  I+ +R++PD+
Sbjct: 44  GRADSLSRP----RDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDV 99

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R+E CK   YP DLP+ SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S + 
Sbjct: 100 RLEGCKNKLYPDDLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPRSLLEEIVLVDDASERD 159

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LE Y++R    VR++R  +R GLIR R +GA  S G+VI FLDAHCE  L WL P
Sbjct: 160 FLKRQLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEP 219

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LL  I  D++ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +R
Sbjct: 220 LLTRIKQDKRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 276

Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V 
Sbjct: 277 KGDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVT 336

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D 
Sbjct: 337 CSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDY 391

Query: 367 GDISEQ 372
           GDIS +
Sbjct: 392 GDISTR 397


>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
          Length = 553

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 158/363 (43%), Positives = 228/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P++   EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E
Sbjct: 45  GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEIIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L 
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE+Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA
Sbjct: 165 RPLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I +DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDKTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
           (Silurana) tropicalis]
 gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
          Length = 603

 Score =  311 bits (797), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 161/350 (46%), Positives = 218/350 (62%), Gaps = 14/350 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE GK + + +A     D +  E G N+  S+ IS +R++PD+R   CK   Y   LP
Sbjct: 85  GNGEQGKPFPMTDADHV--DQAYRENGFNIFVSDKISLNRSLPDIRNSNCKNKFYFSKLP 142

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDD+S KA L  +LE Y+  F  
Sbjct: 143 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDKAHLKSRLEKYMANF-P 201

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KV+++R  +REGLIRTR  GA  + GEV+ FLD+HCE  +NWLPPLL P+  + K +  P
Sbjct: 202 KVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLPPLLDPLVQNYKTVVCP 261

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID     F  V +     RG F+W M YK   +P    K     S+P+ SP  AGG
Sbjct: 262 MIDVIDSDN--FGYVTQAGDAMRGAFDWEMFYKRIPIPPELQKGDP--SDPFDSPVMAGG 317

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG +   PCSR+GH+YR ++PY  
Sbjct: 318 LFAINREWFWQLGGYDPGLEIWGGEQYEISFKVWMCGGRMVDSPCSRVGHIYRKYVPYKV 377

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G  +  N KRV E W DE +  Y Y R P    L +GD++ Q
Sbjct: 378 P------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPDYRHLSVGDVAAQ 420


>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
          Length = 624

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 160/353 (45%), Positives = 224/353 (63%), Gaps = 11/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 80
           GPGE GK   +P   +A           N+  S+ IS +R++PD RM+ CK   YP   D
Sbjct: 114 GPGEMGKPVVIPLDRQAESKEKFKINQFNLVASDMISLNRSLPDYRMDACKRKSYPPNSD 173

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++VFHNE +S+L+RTVHSII R+P + L EI+LVDD S + +L +KLEDYI R 
Sbjct: 174 LPDTSVVIVFHNEAWSTLLRTVHSIINRSPRELLNEILLVDDASEREELGKKLEDYIARL 233

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
               R+IR+ ER GLIR R +GAK++RG+VI FLDAHCE    WL PLL  I+ DR  + 
Sbjct: 234 PVSTRVIRSEERTGLIRARLKGAKQARGKVITFLDAHCECTEGWLEPLLYEIHKDRTAVV 293

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   ++E+  +   D  + G F W + ++   +P+RE  +R  + S P K+PT 
Sbjct: 294 CPIIDVIGDDSFEY--ITGSDMTWGG-FNWKLNFRWYPVPQRELDRRGGDRSNPTKTPTM 350

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD G+ +WGGEN E+SF++WMCGG +  V CSR+GHV+R   P
Sbjct: 351 AGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMSFRVWMCGGKVYIVTCSRVGHVFRKTSP 410

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y++     R+    I +N +R++E W DE +K +FY   P       GD+SE+
Sbjct: 411 YSWPGGVARI----INHNTQRIVEVWMDE-YKDFFYKINPGVRSTSYGDVSER 458


>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Danio rerio]
          Length = 556

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 159/364 (43%), Positives = 226/364 (62%), Gaps = 9/364 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +    E PGE GKA  +P+  +            N+  S+ I+ +R++PD+R+
Sbjct: 43  LPALRAVMSRAHEAPGEMGKAVVIPKEEQDKMKELFKINQFNLMASDMIALNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           + CK   YP DLP  S+++VFHNE +S+L+RTVHS I R+P Q L EI+LVDD S +  L
Sbjct: 103 DGCKTKTYPDDLPNTSIVIVFHNEAWSTLLRTVHSAINRSPRQLLYEILLVDDASERDFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            +KLEDY+      VR++R  +R GLIR R RGA  +RG+VI FLDAHCE    WL PL+
Sbjct: 163 KEKLEDYVATLEVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPLM 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I  DR+ +  P+ID I  +T+E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKEDRRAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY+F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVVRVDYGD 394

Query: 369 ISEQ 372
           +S +
Sbjct: 395 VSSR 398


>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
          Length = 559

 Score =  310 bits (795), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKTFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Sarcophilus harrisii]
          Length = 559

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +RT+PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVAIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIST 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
          Length = 540

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 167/356 (46%), Positives = 220/356 (61%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE G+   L  +     +      G N   S+ IS +R++PD+R  +CK   Y
Sbjct: 17  EERRTGMGEHGRPAFLSPSLDVRKEKLYQVNGFNAALSDEISVNRSVPDIRHSDCKKKQY 76

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+L+RT  S++ R+P   LEEIILVDD S+K +L +KL+DY+
Sbjct: 77  LKNLDPVSVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 136

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            +   KV ++R  +R GLIR R  GAK++R +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 137 AQHLPKVLIVRLPKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIARDYK 196

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + K+    +EP+KSP
Sbjct: 197 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR---PAEPFKSP 250

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDPGL +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 251 IMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 310

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      +G  +  NYKRV E W DE +  Y Y R P    LD GD+SEQ
Sbjct: 311 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YAEYIYKRRPHLRTLDPGDLSEQ 359


>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Ailuropoda melanoleuca]
          Length = 557

 Score =  310 bits (794), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 162/361 (44%), Positives = 226/361 (62%), Gaps = 9/361 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G +  PL+   EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E
Sbjct: 45  GFIHIPLQDPHEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L 
Sbjct: 105 GCKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
             LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396

Query: 370 S 370
           S
Sbjct: 397 S 397


>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oreochromis niloticus]
          Length = 557

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 223/353 (63%), Gaps = 9/353 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           ++GPGEGGK   +P+  +            N+  S  I+ +R++PD+R+E CK   YP +
Sbjct: 53  RDGPGEGGKPVVIPKEQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN 112

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP+ SV++VFHNE +++L+RTVHS+I R+P   LEEI+LVDD S +  L Q+LE Y+++ 
Sbjct: 113 LPRTSVVIVFHNEAWTTLLRTVHSVIDRSPHTLLEEIVLVDDASERDFLKQQLERYVRKL 172

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              VR++R  +R GLIR R +GA  S G+VI FLDAHCE    WL PLLA I  DRK + 
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLARIKQDRKTVV 232

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT 
Sbjct: 233 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 289

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R   P
Sbjct: 290 AGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATP 349

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F        G +I  N +R+ E W DE  K +FY   P    +D GDI+ +
Sbjct: 350 YTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDITSR 397


>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Ornithorhynchus anatinus]
          Length = 559

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 159/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P++   EGPGE GK   +P+  +            N+  S  I+F+R++PD+R+E
Sbjct: 45  GDVPEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L 
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE Y+++    V +IR  +R GLIR R +GA  S+G VI FLDAHCE  + WL PLLA
Sbjct: 165 RPLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
           aries]
          Length = 559

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
           taurus]
 gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
           taurus]
 gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
          Length = 559

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Anolis carolinensis]
          Length = 557

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 160/363 (44%), Positives = 227/363 (62%), Gaps = 9/363 (2%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           + G    PL+  +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R
Sbjct: 43  QAGQTMIPLQRNQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVR 102

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           +E CK   YP +LP  SV++VFHNE +S+L+RT++S+I R P   L EIILVDD S +  
Sbjct: 103 LEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDF 162

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L   LE+Y++     V+++R  +R GLIR R RGA  S+G+VI FLDAHCE  L WL PL
Sbjct: 163 LKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LA I  DRKI+  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK
Sbjct: 223 LARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279

Query: 249 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
            + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 339

Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
           S +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D G
Sbjct: 340 SHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYG 394

Query: 368 DIS 370
           D++
Sbjct: 395 DVT 397


>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Equus caballus]
          Length = 559

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Oreochromis niloticus]
          Length = 556

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 155/352 (44%), Positives = 225/352 (63%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           EGPGE GKA ++P+  +            N+  S+ I+ +R++PD+R++ CK   Y  DL
Sbjct: 55  EGPGEMGKAVNIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYSDDL 114

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  S+++VFHNE +S+L+RTVHS+I R+P   L EIILVDD S +  L +KLE+Y++   
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPKHLLVEIILVDDASERDFLKKKLENYVRTLE 174

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R  +R GLIR R RGA  + G+VI FLDAHCE  + WL PLLA I  DR  +  
Sbjct: 175 VPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWLEPLLARIKEDRTAVVC 234

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I  +T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT A
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY
Sbjct: 292 GGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +F        G +I  N +R+ E W D+  K +FY   P  M ++ GD+S +
Sbjct: 352 SFPGGT----GQVINKNNRRLAEVWMDD-FKDFFYIISPGVMRVEYGDVSSR 398


>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Canis lupus familiaris]
 gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Ailuropoda melanoleuca]
 gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
          Length = 559

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
           terrestris]
          Length = 610

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 169/353 (47%), Positives = 218/353 (61%), Gaps = 14/353 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R  +CK   Y  +
Sbjct: 88  RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           L   SVI+ FHNE FS+LMRT  S+I R+PA  L+EIILVDD S+KA+L + LEDYI   
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKAELKKPLEDYITER 207

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KV+L+R  ER GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K   
Sbjct: 208 FTKVKLVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYKTCV 267

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP  A
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSPVMA 321

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F P+
Sbjct: 322 GGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKFPPF 381

Query: 321 -NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            N G      KG  +  NYKRV E W DE +  Y YTR P    L+ G++ EQ
Sbjct: 382 PNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYTRRPHLRSLNPGNLKEQ 427


>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Sus scrofa]
          Length = 559

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
           catus]
          Length = 559

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Nomascus leucogenys]
 gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_d [Homo sapiens]
 gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_d [Homo sapiens]
          Length = 506

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 157/331 (47%), Positives = 211/331 (63%), Gaps = 12/331 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D +  E G N+  S+ IS +R++PD+R   C    Y   LP  S+I+ FHNEG+SSL+RT
Sbjct: 8   DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 67

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
           VHS++ R+P + + EI+LVDDFS +  L + LEDY+  F   VR++R  +REGLIRTR  
Sbjct: 68  VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS-VRILRTKKREGLIRTRML 126

Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
           GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P+ID ID+   +FR   +  
Sbjct: 127 GASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184

Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
              RG F+W M YK   +P    K     S+P++SP  AGGLFA+DR +F ELGGYDPGL
Sbjct: 185 DAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGL 242

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
            +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY          G  +  N KRV
Sbjct: 243 EIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKVP------AGVSLARNLKRV 296

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            E W DE +  Y Y R P    L  GD++ Q
Sbjct: 297 AEVWMDE-YAEYIYQRRPEYRHLSAGDVAVQ 326


>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Otolemur garnettii]
          Length = 558

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 163/359 (45%), Positives = 226/359 (62%), Gaps = 10/359 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L PP   YK GPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E C
Sbjct: 49  LIPPQRDYK-GPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGC 107

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   
Sbjct: 108 KTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 167

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y++  +  V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I
Sbjct: 168 LENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 227

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 228 KEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 284

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +G
Sbjct: 285 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 344

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 345 HVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 398


>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
           [Bos taurus]
          Length = 519

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 8   LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 66

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 67  KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 126

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 127 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 186

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 187 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 243

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 244 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 303

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 304 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 358

Query: 372 Q 372
           +
Sbjct: 359 R 359


>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Callithrix jacchus]
 gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
           boliviensis boliviensis]
          Length = 559

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mustela putorius
           furo]
          Length = 461

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 159/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P++   EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E
Sbjct: 45  GDVLEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L 
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
          Length = 591

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 80  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 138

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 139 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 198

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 199 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 258

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 259 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 315

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 316 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 375

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 376 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 430

Query: 372 Q 372
           +
Sbjct: 431 R 431


>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
          Length = 615

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 166/356 (46%), Positives = 218/356 (61%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GKA  L  +     D      G N   S+ IS +R+IPD+R  +CK   Y
Sbjct: 83  EEKRTGTGEQGKAATLSPSMEDLKDRLYKVNGFNAALSDLISLNRSIPDIRHPDCKNKRY 142

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             DL   SV++ FHNE FS+LMRT  S+I R+P   L EIILVDD S+K +L  KL++Y+
Sbjct: 143 LKDLDPVSVVVSFHNEHFSTLMRTCWSVINRSPPSLLHEIILVDDASTKVELKDKLDEYV 202

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
           ++   KV+++R   R GLIR R  GA+++  +++VFLD+H E  +NWLPPLL PI  D K
Sbjct: 203 KKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILVFLDSHSEANVNWLPPLLEPIAKDYK 262

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + K     SEP+KSP
Sbjct: 263 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKN---PSEPFKSP 316

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDPGL +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 317 VMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 376

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      +G  +  NYKRV E W DE +  + Y R P    +D GD++EQ
Sbjct: 377 PPFPNPG------RGDFLGKNYKRVAEVWMDE-YADFIYRRRPHLRAMDPGDLTEQ 425


>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
           scrofa]
          Length = 506

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 156/331 (47%), Positives = 211/331 (63%), Gaps = 12/331 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D +  E G N+  S+ IS +R++PD+R   C    Y   LP  S+I+ FHNEG+SSL+RT
Sbjct: 8   DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLPNTSIIIPFHNEGWSSLLRT 67

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
           VHS++ R+P + + EI+LVDDFS +  L + LEDY+  F   VR++R  +REGLIRTR  
Sbjct: 68  VHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-PNVRILRTKKREGLIRTRML 126

Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
           GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P+ID ID+   +FR   +  
Sbjct: 127 GASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184

Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
              RG F+W M YK   +P    K     S+P++SP  AGGLFA+DR +F ELGGYDPGL
Sbjct: 185 DAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGL 242

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
            +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY          G  +  N KRV
Sbjct: 243 EIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPYKVP------AGVSLARNLKRV 296

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            E W DE +  + Y R P    L  GD++ Q
Sbjct: 297 AEVWMDE-YAEHIYQRRPEYRHLSAGDVAAQ 326


>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
          Length = 559

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI+ 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIAS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Oryctolagus cuniculus]
          Length = 557

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 161/361 (44%), Positives = 226/361 (62%), Gaps = 9/361 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G L   ++  +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E
Sbjct: 45  GELLELIKENQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L 
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
             LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+
Sbjct: 342 VGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDV 396

Query: 370 S 370
           S
Sbjct: 397 S 397


>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Otolemur garnettii]
          Length = 556

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 229/369 (62%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++  +  V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Taeniopygia guttata]
          Length = 559

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P++   EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E
Sbjct: 45  GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   Y  +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L 
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I +DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Loxodonta africana]
          Length = 559

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP  LP+ SV++VFHNE +S+L+RTVHS++ R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDALPRTSVVIVFHNEAWSTLLRTVHSVLNRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pongo abelii]
          Length = 612

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 559

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
 gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Meleagris gallopavo]
 gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
          Length = 559

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++  P++   EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E
Sbjct: 45  GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   Y  +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L 
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I +DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Callithrix jacchus]
          Length = 561

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 160/359 (44%), Positives = 224/359 (62%), Gaps = 9/359 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L   +   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E C
Sbjct: 46  LRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGC 105

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   
Sbjct: 106 KTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +G
Sbjct: 283 LPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 342

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 343 HVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 396


>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
 gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Nomascus leucogenys]
 gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pan troglodytes]
 gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
           paniscus]
 gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
           gorilla gorilla]
 gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
           sapiens]
 gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [synthetic
           construct]
 gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
           fascicularis]
 gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
          Length = 559

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Cavia porcellus]
          Length = 559

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
          Length = 559

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Saimiri boliviensis boliviensis]
          Length = 556

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 161/362 (44%), Positives = 225/362 (62%), Gaps = 9/362 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+
Sbjct: 43  LPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASEREFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLL
Sbjct: 163 KLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394

Query: 369 IS 370
           +S
Sbjct: 395 VS 396


>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLDFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
          Length = 556

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Ailuropoda melanoleuca]
          Length = 556

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Callithrix jacchus]
          Length = 556

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 161/362 (44%), Positives = 225/362 (62%), Gaps = 9/362 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+
Sbjct: 43  LPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLL
Sbjct: 163 KLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394

Query: 369 IS 370
           +S
Sbjct: 395 VS 396


>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Nomascus leucogenys]
          Length = 556

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
           adamanteus]
          Length = 556

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 160/362 (44%), Positives = 226/362 (62%), Gaps = 9/362 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +   +EGPGE GKA  +P+  +            N+  S+ I+F+R++PD+R+
Sbjct: 43  LPALRAVMSRSQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDMIAFNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           E CK   YP +LP  SV++VFHNE +S+L+RT++S++ R+P   L EIILVDD S +  L
Sbjct: 103 EGCKTKVYPDELPTTSVVIVFHNEAWSTLLRTIYSVMNRSPHYLLSEIILVDDASERDFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              LE+Y++     V++IR  +R GLIR R RGA  S+G+VI FLDAHCE    WL PLL
Sbjct: 163 KLPLENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTTGWLEPLL 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I  DRKI+  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394

Query: 369 IS 370
           +S
Sbjct: 395 VS 396


>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
 gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
           taurus]
 gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
          Length = 556

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTRVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pan troglodytes]
 gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Pongo abelii]
 gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pongo abelii]
 gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
           paniscus]
 gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
          Length = 556

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
          Length = 603

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 165/356 (46%), Positives = 220/356 (61%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +     +      G N   S+ IS +R++PD+R  +C+   Y
Sbjct: 78  EEKRTGIGEHGKPAFLSPSLDVLKEKLYQVNGFNAAVSDEISMNRSVPDIRHPDCRKKKY 137

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+L+RT  S++ R+P   LEEIILVDD S+K +L +KL+DY+
Sbjct: 138 LKNLDPISVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 197

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            +   KV ++R ++R GLIR R  GAK++R +V+VFLD+H E  +NWLPPLL PI  + K
Sbjct: 198 AQHLPKVSIVRLSKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 257

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+ +    D   RG F+W + YK   L   + K+    +EP+KSP
Sbjct: 258 TCVCPFIDVIAYETFEYIA---QDEGSRGAFDWELYYKRLPLLPEDLKR---PTEPFKSP 311

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDPGL +WGGE +ELSFKIW CGG +   PCSR+GHVYR F
Sbjct: 312 IMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHVYRKF 371

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      +G  +  N+KRV E W DE +  Y Y R P    LD GD+SEQ
Sbjct: 372 PPFPNPG------RGDFLGKNFKRVAEVWMDE-YAEYLYKRRPHLRTLDPGDLSEQ 420


>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
          Length = 556

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
 gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
          Length = 556

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
           norvegicus]
 gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
 gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
          Length = 556

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
           aries]
          Length = 556

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
 gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
          Length = 561

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
          Length = 516

 Score =  307 bits (787), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 159/351 (45%), Positives = 222/351 (63%), Gaps = 9/351 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP +
Sbjct: 9   QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE 68

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   LE+Y++  
Sbjct: 69  LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNL 128

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I  DRK + 
Sbjct: 129 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 188

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT 
Sbjct: 189 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 245

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   P
Sbjct: 246 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 305

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           Y F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 306 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 351


>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
           anubis]
          Length = 559

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 226/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
           impatiens]
          Length = 610

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 167/353 (47%), Positives = 217/353 (61%), Gaps = 14/353 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R  +CK   Y  +
Sbjct: 88  RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLRN 147

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           L   SVI+ FHNE FS+LMRT  S+I R+PA  L+EIILVDD S+K +L + LEDYI   
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKVELKKPLEDYITEH 207

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KV+++R  ER GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K   
Sbjct: 208 LTKVKIVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANVNWLPPLLEPIAQDYKTCV 267

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP  A
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSPVMA 321

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F P+
Sbjct: 322 GGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKFPPF 381

Query: 321 -NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            N G      KG  +  NYKRV E W DE +  Y YTR P    L+ G++ EQ
Sbjct: 382 PNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYTRRPHLRSLNPGNLKEQ 427


>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
          Length = 516

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 159/351 (45%), Positives = 222/351 (63%), Gaps = 9/351 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP +
Sbjct: 14  QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 73

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   LE+Y++  
Sbjct: 74  LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 133

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I  DRK + 
Sbjct: 134 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 193

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT 
Sbjct: 194 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 250

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   P
Sbjct: 251 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 310

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           Y F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 311 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 356


>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
           musculus]
          Length = 594

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 38  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 97

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 98  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 157

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 158 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 217

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 218 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 274

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 275 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 334

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 335 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 389

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 390 VKVDYGDVS 398


>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Monodelphis domestica]
          Length = 559

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 225/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LE   +P+ EGPGE GK   +P+  +            N+  S  I+ +RT+PD+R+E C
Sbjct: 48  LETVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIST 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
          Length = 592

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 228/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 559

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 158/361 (43%), Positives = 225/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+  R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVTDRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GD++ 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDVAS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 167/354 (47%), Positives = 224/354 (63%), Gaps = 15/354 (4%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L    +    ++  + G N+  S+ IS DR++ D+R   CK   Y
Sbjct: 92  EAARTGLGEQGKPVTLFGHEKL--HSAYKDNGFNILVSDRISLDRSLHDIRHASCKSKKY 149

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             DLP  SVI+ FHNEG S+L+RT+HS+  R+P   L+EI+LVDD SS+  L ++LE  +
Sbjct: 150 YSDLPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSRP-LYKELESSL 208

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            +F  KV+LIRN  R+GLIR+R RG   ++G V+V LD+H EV  NWLPPLL PI  DRK
Sbjct: 209 AKF-PKVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPPLLHPISLDRK 267

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID + +++  V +P    RG F+W + YK   +P    K+ K  SEP++SP
Sbjct: 268 TVVCPMIDIIDNENFQY--VTQPGDAMRGAFDWELYYKRIPIPNE--KRPKDPSEPFESP 323

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA++R +F E+G YD GL +WGGE +ELSFK+WMCGG I   PCSRIGH+YR F
Sbjct: 324 VMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSFKVWMCGGRILDSPCSRIGHIYRKF 383

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           +PY          GP   YNYKRV E W DE +  +FY R P    +D GD+S+
Sbjct: 384 VPYTIPNNG----GP--NYNYKRVAEVWMDE-YAEFFYRRRPYVRKIDAGDLSK 430


>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
          Length = 519

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 159/351 (45%), Positives = 221/351 (62%), Gaps = 9/351 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP  
Sbjct: 17  QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDQ 76

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   LE+Y++  
Sbjct: 77  LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 136

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I  DRK + 
Sbjct: 137 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 196

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT 
Sbjct: 197 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTM 253

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   P
Sbjct: 254 AGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATP 313

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           Y F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 314 YTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 359


>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
 gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 225/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LE   +P+ EGPGE GK   +P+  +            N+  S  I+F+R++PD+R+E C
Sbjct: 48  LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIAFNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP  LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDSLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 603

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 151/334 (45%), Positives = 214/334 (64%), Gaps = 4/334 (1%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G    + E  +        ++  N   S+ IS  R++ D R ++CK   YP+DLP 
Sbjct: 107 PGELGTGVTVEENEKEKEKLGYEKHAFNQLVSDKISIHRSLKDYRNDQCKVKKYPVDLPP 166

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            SVI+ FHNE +S+L+RTVHS+I RTP QYL+EIILVDD S+  DL Q+L+DYI      
Sbjct: 167 TSVIICFHNEAWSTLLRTVHSVINRTPPQYLKEIILVDDASTSDDLKQRLDDYIPNLK-I 225

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V ++R  +R+GLIR R  GAK+++G ++ FLDAHCE  L W  PLLA I  DR+ + +PV
Sbjct: 226 VSIVRLRDRQGLIRARLEGAKKAKGPILTFLDAHCECTLGWAEPLLAKIKEDRQNVVMPV 285

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID I    + + +V EP    RG+F+W + +    +P  E ++RK+ S+  K+P  AGGL
Sbjct: 286 IDEISETNFNYNAVPEP--FQRGVFKWRLEFTWRPIPSYEEQRRKHESDGIKTPVMAGGL 343

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F+++R +F E+G YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R   PY+F 
Sbjct: 344 FSINRDYFYEMGSYDTGMDIWGGENIEISFRIWMCGGSIEMLPCSRVGHVFRPRFPYSFP 403

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
                  G +++ N  RV + W DE  K ++  R
Sbjct: 404 NRRGG-DGDVVSRNLMRVADVWMDEYAKHFYNIR 436


>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 224/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LE   +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKTNQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  SRG+VI FLDAHCE    WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Anolis carolinensis]
          Length = 556

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 159/362 (43%), Positives = 225/362 (62%), Gaps = 9/362 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+
Sbjct: 43  LPALRAVMSRSQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           E CK   YP +LP  SV++VFHNE +S+L+RT++S+I R P   L EIILVDD S +  L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              LE+Y++     V+++R  +R GLIR R RGA  S+G+VI FLDAHCE  L WL PLL
Sbjct: 163 KVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I  DRKI+  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394

Query: 369 IS 370
           ++
Sbjct: 395 VT 396


>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
 gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
          Length = 398

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 157/326 (48%), Positives = 212/326 (65%), Gaps = 14/326 (4%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S+ ++ DR+IPD R + C    YP  LP ASVI++FHNE +S+L+RTVHS++ 
Sbjct: 13  YQFNELASSKVALDRSIPDNRPQSCLSLSYPTKLPTASVIIIFHNEAWSTLLRTVHSVLA 72

Query: 108 RTPAQYLEEIILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           R+P   L EI+LVDD S   +   L  KLE YI +F  KV+LIR  +REGLIR R  GAK
Sbjct: 73  RSPPYLLREIVLVDDHSRLDTYGHLGSKLESYISQFT-KVQLIRAPKREGLIRARLIGAK 131

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
           +++GEV+VFLD+HCE  L WL PLLA I  +R I+  P I+ ID +T  F   +E   + 
Sbjct: 132 QAKGEVLVFLDSHCEANLGWLEPLLARIGENRSIVVTPDIEVIDLRT--FGYTHEHGANN 189

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RGIF W + +K   +PE E ++RK +S+P +SPT AGGLFA+D+++F E+G YD  +  W
Sbjct: 190 RGIFNWELTFKWRGIPEYERRRRKSDSDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFW 249

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN E+SF+IWMCGGS+E +PCS++GHV+R   PY  G+ A       I  N  R+ E 
Sbjct: 250 GGENVEISFRIWMCGGSLEIIPCSKVGHVFRESQPYKIGEGA-------IDRNNMRLAEV 302

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDIS 370
           W D+ +K  FY   P     D GD+S
Sbjct: 303 WMDD-YKKIFYAMRPQLKGKDYGDVS 327


>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           3 [Canis lupus familiaris]
          Length = 556

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 163/369 (44%), Positives = 227/369 (61%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   Y  +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYADELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 559

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 158/361 (43%), Positives = 225/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVIIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMVEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+I  I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KQDRRTVVCPIICVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Monodelphis domestica]
          Length = 556

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 159/362 (43%), Positives = 223/362 (61%), Gaps = 9/362 (2%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           L  L   +   +EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+
Sbjct: 43  LPALRAVISRNQEGPGEMGKAVRIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L EIILVDD S +  L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEIILVDDASERDFL 162

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              LE+Y++     V++IR  +R GLIR R RGA  S+G+VI FLDAHCE  L WL PLL
Sbjct: 163 KMALENYVKNLEVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
           A I   RK +  P+ID I    +E+ +    D  Y G F W + ++   +P+RE  +RK 
Sbjct: 223 ARIKESRKTVVCPIIDLISDDNFEYTA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS
Sbjct: 280 DRTLPVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCS 339

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD
Sbjct: 340 HVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGD 394

Query: 369 IS 370
           +S
Sbjct: 395 VS 396


>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
 gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 224/361 (62%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LE   +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LELVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +EEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  SRG+VI FLDAHCE    WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS 
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISS 398

Query: 372 Q 372
           +
Sbjct: 399 R 399


>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Oryzias latipes]
          Length = 557

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 155/363 (42%), Positives = 225/363 (61%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G +   +    EGPGE GKA ++ +  +            N+  S+ I+ +R++PD+R++
Sbjct: 45  GQVVTVISRSHEGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLD 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   Y  DLP  S+++VFHNE +S+L+RTVHS+I R+P   L EI+LVDD S +  L 
Sbjct: 105 GCKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           +KLE Y++     V+++R  +R GLIR R RGA  + G+VI FLDAHCE    WL PLLA
Sbjct: 165 KKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  DR  +  P+ID I  +T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY+F        G +I  N +R+ E W DE  K +FY   P  M +D GD+
Sbjct: 342 VGHVFRKATPYSFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDV 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Anolis carolinensis]
          Length = 559

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 157/363 (43%), Positives = 226/363 (62%), Gaps = 9/363 (2%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           G++   ++   EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E
Sbjct: 45  GDVPELVQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   Y  +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEIILVDD S +  L 
Sbjct: 105 GCKTKVYSDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHILEEIILVDDASERDFLK 164

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + LE+Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA
Sbjct: 165 RLLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I +DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS 
Sbjct: 282 RTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSH 341

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI
Sbjct: 342 VGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDI 396

Query: 370 SEQ 372
           S +
Sbjct: 397 SSR 399


>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
          Length = 613

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 158/358 (44%), Positives = 219/358 (61%), Gaps = 15/358 (4%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           +E  + GPGE G A  L        DA     G N   S+ IS  R++ D+R  +C+   
Sbjct: 86  IEKQRTGPGEQGAAVILSSDEEKKKDALYKVNGFNGFASDKISLQRSLKDIRHPQCRTQK 145

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP  SV++ FHNE +S+L+RT  S++ R+P + + EIILVDDFSSK    + L+D+
Sbjct: 146 YWNKLPTVSVVVPFHNEHWSTLLRTAESVLVRSPPELIHEIILVDDFSSKEHCGKPLDDH 205

Query: 137 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           +   + GKV++I   +REGLIRTR  GA+E+ G+V++FLD+HCE  +NWLPPLL PI  D
Sbjct: 206 LATHYGGKVKVIHQPKREGLIRTRLAGAREATGDVLIFLDSHCEANVNWLPPLLDPIAED 265

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPY 254
            + +  P ID +DY+T+ +R+    D   RG F+W   YK   L PE      K+ + P+
Sbjct: 266 YRTVVCPFIDVVDYETFAYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPARPF 318

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           KSP  AGGLFA+   +F ELGGYDPGL +WGGE +ELSFK+W CGG +   PCSR+GH+Y
Sbjct: 319 KSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKLWQCGGQMLDAPCSRVGHIY 378

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R F P+    + D      +  NY+RV E W DE +  + Y R P    +  G+I+EQ
Sbjct: 379 RKFAPFPNPGVGD-----FVGRNYRRVAEVWMDE-YAEFLYKRRPQYRSIQPGNITEQ 430


>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Oryzias latipes]
          Length = 556

 Score =  304 bits (778), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 154/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           EGPGE GKA ++ +  +            N+  S+ I+ +R++PD+R++ CK   Y  DL
Sbjct: 55  EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 114

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  S+++VFHNE +S+L+RTVHS+I R+P   L EI+LVDD S +  L +KLE Y++   
Sbjct: 115 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 174

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V+++R  +R GLIR R RGA  + G+VI FLDAHCE    WL PLLA I  DR  +  
Sbjct: 175 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 234

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I  +T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT A
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY
Sbjct: 292 GGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +F        G +I  N +R+ E W DE  K +FY   P  M +D GD+S +
Sbjct: 352 SFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDVSSR 398


>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Oryzias latipes]
          Length = 558

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 154/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           EGPGE GKA ++ +  +            N+  S+ I+ +R++PD+R++ CK   Y  DL
Sbjct: 57  EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 116

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  S+++VFHNE +S+L+RTVHS+I R+P   L EI+LVDD S +  L +KLE Y++   
Sbjct: 117 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 176

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V+++R  +R GLIR R RGA  + G+VI FLDAHCE    WL PLLA I  DR  +  
Sbjct: 177 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 236

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I  +T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT A
Sbjct: 237 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 293

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY
Sbjct: 294 GGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 353

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +F        G +I  N +R+ E W DE  K +FY   P  M +D GD+S +
Sbjct: 354 SFPGGT----GQVINKNNRRLAEVWMDE-FKDFFYIISPGVMRVDYGDVSSR 400


>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           1 [Ciona intestinalis]
          Length = 573

 Score =  303 bits (777), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 158/351 (45%), Positives = 217/351 (61%), Gaps = 9/351 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GPGE GKA  +P+               N+  S  I+ +R++PD+RME CK   YP  LP
Sbjct: 70  GPGEMGKAVIIPKDKEKEKQEKFKINQFNLMASEMIALNRSLPDVRMEGCKSKKYPEKLP 129

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+++VFHNE +S+L+RTVHSII R+P+  LEEIILVDD S +  L   LE Y+++   
Sbjct: 130 TTSIVIVFHNEAWSTLLRTVHSIINRSPSHLLEEIILVDDASERDFLGAPLERYVRKLRT 189

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +R GLIR R RGA  S G+VI FLDAHCE    WL PLL+ I  DR  +  P
Sbjct: 190 LVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPLLSEIAKDRTTVVCP 249

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 261
           +ID I  +T+EF  +   D  Y G F W + ++   +P+RE  +RK + + P +SPT AG
Sbjct: 250 IIDVISDETFEF--MVGSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRSPTMAG 306

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YD G+ +WGGEN E+SF+IW CGG++  V CS +GHV+R   PY 
Sbjct: 307 GLFSIDKSYFEELGTYDAGMDIWGGENLEISFRIWQCGGTLLIVTCSHVGHVFRKATPYT 366

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           F        G +I  N +R+ E W D   K +FY   P  +  + GDISE+
Sbjct: 367 FPGGT----GQIINKNNRRLAEVWMDS-FKNFFYIITPGVLKQEYGDISER 412


>gi|226482458|emb|CAX73828.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
          Length = 603

 Score =  303 bits (777), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 165/358 (46%), Positives = 221/358 (61%), Gaps = 13/358 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           LE  + GPGE G    L    +     ++ E G ++  S  I  DR+I D+R   CK   
Sbjct: 70  LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP ASVI+ F  E + +L+RTV S++ R P+  ++E+ILVDD SS+  L  +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189

Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           I     +GKVR+I   ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           + + +  P ID ID   +E+R+    D   RG F+W + YK   LP R  +   +  EP+
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDNHHPEEPF 303

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGGLFA+   +F ELGGYDPGL++WGGE +ELSFKIWMCGG +   PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIY 363

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R +   NF K      G  +  NYKRV E W DE +K Y Y R P    LD GD++EQ
Sbjct: 364 RKYST-NFPKSQ---LGDFVGRNYKRVAEVWMDE-YKEYLYKRRPSYRHLDPGDLTEQ 416


>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
           [Crassostrea gigas]
          Length = 526

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 159/350 (45%), Positives = 217/350 (62%), Gaps = 14/350 (4%)

Query: 24  PGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           PGE G+A  L P+  +  GD      G N   S+ IS  R++ D+R  +CK   Y   L 
Sbjct: 13  PGEQGQALILSPDEEKKKGDL-YKVNGFNAYASDKISLHRSLKDIRHSDCKKKKYLNHLM 71

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
            ASVI+ FHNE +S+L+RT  S++ R+P   + E+ILVDD+SSK    Q L+DY++    
Sbjct: 72  NASVIVPFHNEHWSTLLRTAWSVLNRSPKHLIHEVILVDDYSSKEHCKQPLDDYVKEHFT 131

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            V+++R  +REGLIRTR  GA+ + G+V++FLD+HCE  +NWLPPLL PI  D K +  P
Sbjct: 132 NVKVVRAKKREGLIRTRLLGARAATGQVLIFLDSHCEANINWLPPLLEPIAEDYKTVVCP 191

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
            ID ID++ + +R+    D   RG F+W   YK   L E +    K+ +EP+KSP  AGG
Sbjct: 192 FIDVIDFENFAYRA---QDEGARGAFDWEFFYKRLPLLEEDL---KHPAEPFKSPVMAGG 245

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+   +F E+GGYDPGL +WGGE +ELSFK+W CGG +   PCSRIGH+YR F P+  
Sbjct: 246 LFAISAKWFWEMGGYDPGLDIWGGEQYELSFKLWQCGGMMVDAPCSRIGHIYRKFAPFPN 305

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             + D      +  NY+RV E W DE +  Y Y R P    +D GD+SEQ
Sbjct: 306 PGVGD-----FVGRNYRRVAEVWMDE-YAEYLYKRRPHYRNIDPGDVSEQ 349


>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Ornithorhynchus anatinus]
          Length = 556

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 161/369 (43%), Positives = 225/369 (60%), Gaps = 13/369 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  + +  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRSQEGPGEMGKAVLISKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP   V++VFHNE +S+L+RTV S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKIYPDELPNTRVVIVFHNEAWSTLLRTVFSVINRSPRSLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++  +  V++IR  +R GLIR R RGA  SRG+VI FLDAHCE   
Sbjct: 156 ASERDFLKTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAASRGQVITFLDAHCECTF 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDIS 370
           + +D GD+S
Sbjct: 388 VKVDYGDVS 396


>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
          Length = 501

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 156/348 (44%), Positives = 219/348 (62%), Gaps = 10/348 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E C
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + 
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +G
Sbjct: 284 LPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVG 343

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           HV+R   PY F        G +I  N +R+ E W DE  K +FY   P
Sbjct: 344 HVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISP 386


>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
           tropicalis]
 gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
          Length = 452

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 154/352 (43%), Positives = 219/352 (62%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           EGPGE GK   +P+  +            N+  S  I+ +R++PD+R+E CK   YP  L
Sbjct: 56  EGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDSL 115

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SV++VFHNE +++L+RTVHS+I R+P   L+EIILVDD S +  L + LE Y+++  
Sbjct: 116 PTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLQEIILVDDASEREFLKRPLETYVKKLT 175

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V ++R  +R GLIR R RGA  S+G+VI FLDAHCE  + WL PLLA I  DR+ +  
Sbjct: 176 VPVHVLRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARIKHDRRTVVC 235

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +R+ + + P ++PT A
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDRTLPVRTPTMA 292

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R   PY
Sbjct: 293 GGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPY 352

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F        G +I  N +R+ E W DE  K +FY   P    +D GDIS +
Sbjct: 353 TFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISTR 399


>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
 gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
          Length = 534

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 165/356 (46%), Positives = 220/356 (61%), Gaps = 18/356 (5%)

Query: 23  GPGEGGKAY-HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           GPGE G+ Y +  E  +      LG  G N   S+ IS +R +PD R + CK   YP  L
Sbjct: 14  GPGEYGRPYVYTEEDNKRKSFGYLGN-GFNAHVSDKISVERALPDTRDQPCKDRLYPSRL 72

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ FHNE +S+L+RTVH +I RTP   L E+ILVDDFSSK +  + L +Y+  F 
Sbjct: 73  PNVSVIIPFHNEHWSTLLRTVHGVIGRTPPHLLGEVILVDDFSSKENCGRPLNEYMATFP 132

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VR++R  +REGLIR R RG + +RG V+VF+DAHCEV +NWLPPLL PI      +T+
Sbjct: 133 -QVRILRMKQREGLIRARLRGVEVARGNVLVFMDAHCEVNVNWLPPLLEPISVSMTTVTI 191

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P ID ID+ T+E++   +     RG+F+W + YK   +P  + + RK   + P+ +P   
Sbjct: 192 PTIDVIDHATFEYKE--QQGGPMRGVFDWQLNYKR--IPVLDGRGRKVRPTLPFSTPVMP 247

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GG+FA+D+ FF  LGGYD GL +WGGE FELSFKIW CGG ++ VPCSR+GHV+R F PY
Sbjct: 248 GGVFAIDKEFFHHLGGYDSGLEIWGGEQFELSFKIWQCGGVLQEVPCSRVGHVFRKFSPY 307

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR----EPLAMFLDMGDISEQ 372
                A       I  NY RV E W D+ +K Y+Y R           D+GD+S Q
Sbjct: 308 -----ATDNDVLQILKNYMRVAEVWMDD-YKQYYYKRMLRGPKNVTNFDLGDLSSQ 357


>gi|383863685|ref|XP_003707310.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Megachile
           rotundata]
          Length = 610

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 164/354 (46%), Positives = 220/354 (62%), Gaps = 16/354 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK   L  +  +  +      G N   S+ IS +R++PD+R  +CK   Y  +
Sbjct: 88  RSGTGEHGKPAFLSPSLDSLKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           L   SVI+ FHNE FS+LMRT  S+I R+PA  LEEIILVDD S+K +L ++L+DY+ + 
Sbjct: 148 LDAVSVIVSFHNEHFSTLMRTCWSVINRSPASLLEEIILVDDASTKVELKKELDDYVAQR 207

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KV++IR  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  + +   
Sbjct: 208 LPKVKIIRLPQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANVNWLPPLLEPIAQNYRTCV 267

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
            P ID I Y+T+E+R+    D   RG F+W + YK    LPE      K+ + P+KSP  
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPE----DLKHPTLPFKSPVM 320

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F P
Sbjct: 321 AGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGEMYDAPCSRVGHIYRKFPP 380

Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++++Q
Sbjct: 381 FPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLTKQ 427


>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
 gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
          Length = 536

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 157/361 (43%), Positives = 221/361 (61%), Gaps = 13/361 (3%)

Query: 16  PLEPYKEGP---GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           P  P+   P   GE G++  +P+  +A  D     +G N   S+H+S  RT+PDLR   C
Sbjct: 22  PTLPHNFNPNAIGENGESVIVPDKAKAESDKLFKNHGFNQWASDHMSLHRTLPDLRPSLC 81

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   +P DLP+ SV++VFHNE  S+L+RTVHS++ R+    + +IILVDDFSS    D  
Sbjct: 82  KSQVFPKDLPQTSVVIVFHNEALSTLLRTVHSVLDRSAPDLIHQIILVDDFSSIKGHD-P 140

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           L+ YI     KV L+RN +REGLIR+R  G   +   ++ FLDAHCEV + WL PLL  +
Sbjct: 141 LKKYIADLK-KVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRV 199

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK-YNS 251
           + +R ++  P ID ID +T+++R+    D   RG+F W M ++    P +E K+R  YN 
Sbjct: 200 HQNRSVVVCPEIDVIDDKTFQYRAGSSGD--IRGVFNWDMKFRWRLTPSQEQKRRNNYNV 257

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
              +SPT AGGLFA+DR +F E+G YD  + +WGGEN ELSF+IW CGG +E +PCS +G
Sbjct: 258 LFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELSFRIWQCGGQLEIMPCSHVG 317

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+R+ +PY F K A    G  I  N  R  E W D  +K + Y R+P    +  G+I+E
Sbjct: 318 HVFRNVIPYKFPKDA----GLTINKNSVRTAEVWMD-GYKEFVYQRQPYMRNIHFGNITE 372

Query: 372 Q 372
           +
Sbjct: 373 R 373


>gi|260789758|ref|XP_002589912.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
 gi|229275097|gb|EEN45923.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
          Length = 292

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 143/284 (50%), Positives = 195/284 (68%), Gaps = 10/284 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           E G N++ SN IS DR IPD+R   C    Y  DLP  S+++ FHNEG+++L+RTVHS++
Sbjct: 13  ECGFNIKASNKISLDRAIPDIRHPNCASKKYVRDLPDVSLVIPFHNEGWTTLLRTVHSVL 72

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            R+P Q + EIILVDDFS ++ L + LEDY+ + + KVR++R  +REGLIRTR  GA+ +
Sbjct: 73  NRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKLSPKVRVVRTKQREGLIRTRLLGAQVA 132

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
           +G+V++FLD+HCE  +NWLPPLL PI  ++K +  P ID ID   + + +  +     RG
Sbjct: 133 KGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIVCPNIDVIDKDDFHYET--QAGDAMRG 190

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W M YK   +P+    K    S+P++SP  AGGLFA+DR +F ELGGYDPGL +WGG
Sbjct: 191 AFDWEMYYKRIPIPDE--IKNPDPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGG 248

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY------NFGK 324
           E +ELSFK+W CGG +   PCSR+GHVYR F+PY      N GK
Sbjct: 249 EQYELSFKVWQCGGRMVDAPCSRVGHVYRKFVPYKVPAGVNLGK 292


>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
           ricinus]
          Length = 582

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 219/350 (62%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G+   + +   A           N+  S+ I+ +R++PD+R+E+CK   YP  LP 
Sbjct: 81  PGENGRGVEIGKDEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 140

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            SV +VFHNE +S+L+RTVHS+I+ +P   LEEIILVDD S +  L ++LEDY+ + +  
Sbjct: 141 TSVDIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVVKLDTP 200

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+++R  +R GLIR R  GA   +G+VI FLDAHCE   NWL PLLA I  DR  +  PV
Sbjct: 201 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLARIAEDRTRVVCPV 260

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+ S  +      G F W + ++   +P+RE  +R  + + P ++PT AGG
Sbjct: 261 IDVISDETFEYISASDLTW---GGFNWKLNFRGYRVPQRELDRRGGDRTLPVRTPTMAGG 317

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +GHV+R   PY F
Sbjct: 318 LFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPYTF 377

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                ++    + +N  R+ E W DE  K +++   P A  +D GD+S +
Sbjct: 378 PGGTSKI----VNHNNARLAEVWLDE-WKEFYFAINPAAKNVDKGDLSHR 422


>gi|226482456|emb|CAX73827.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
          Length = 603

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 164/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           LE  + GPGE G    L    +     ++ E G ++  S  I  DR+I D+R   CK   
Sbjct: 70  LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP ASVI+ F  E + +L+RTV S++ R P+  ++E+ILVDD SS+  L  +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189

Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           I     +GKVR+I   ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           + + +  P ID ID   +E+R+    D   RG F+W + YK   LP R  +   +  +P+
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDSHHPEKPF 303

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGGLFA+   +F ELGGYDPGL++WGGE +ELSFKIWMCGG +   PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIY 363

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R +   NF K      G  +  NYKRV E W DE +K Y Y R P    LD GD++EQ
Sbjct: 364 RKYST-NFPKSQ---LGDFVGRNYKRVAEVWMDE-YKEYLYKRRPSYRHLDPGDLTEQ 416


>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
           mutus]
          Length = 564

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 161/366 (43%), Positives = 225/366 (61%), Gaps = 15/366 (4%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL----- 67
           LEP  +P+ EGPGE GK   +P+  +            N+  S  I+ +R++PD+     
Sbjct: 48  LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVSLPDV 106

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R+E CK   YP +LP  SV++VFHNE +S+L+RTVHSII  +P   LEEI+LVDD S + 
Sbjct: 107 RLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSIINHSPRHMLEEIVLVDDASERD 166

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L + LE Y+++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL P
Sbjct: 167 FLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEP 226

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +R
Sbjct: 227 LLARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 283

Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V 
Sbjct: 284 KGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVT 343

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D 
Sbjct: 344 CSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDY 398

Query: 367 GDISEQ 372
           GDIS +
Sbjct: 399 GDISSR 404


>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus laevis]
 gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
          Length = 556

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 222/352 (63%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           EGPGE GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP +L
Sbjct: 55  EGPGELGKAVIIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDIRLEGCKTKVYPDEL 114

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  S+++VFHNE +S+L+RTVHS+I R+P + + EIILVDD S +  L   LE+Y++   
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDASERDFLKTPLENYVKHLE 174

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V+++R  +R GLIR R  GA  ++G++I FLDAHCE    WL PLLA I  DRK +  
Sbjct: 175 VAVKILRMEQRSGLIRARLSGANVAKGKIITFLDAHCECTFGWLEPLLARIKEDRKTVVC 234

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT A
Sbjct: 235 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 291

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY
Sbjct: 292 GGLFSIDKKYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 351

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F        G +I  N +R+ E W D+  K +FY   P  + +D GD+SE+
Sbjct: 352 TFPGGT----GHVINKNNRRLAEVWMDD-FKDFFYIISPGVVKVDYGDVSER 398


>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Saccoglossus kowalevskii]
          Length = 1082

 Score =  301 bits (771), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 154/351 (43%), Positives = 215/351 (61%), Gaps = 10/351 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GPGE G+   L    +   D +   +G N+  S+ IS +R+I D++   C    Y  DLP
Sbjct: 583 GPGENGQPVLLYGEQKKEADETFDVHGFNVVVSDMISLERSITDVKHSLCDTVRYNKDLP 642

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
            ASVI+ FHNE +S+L+RT++S+I R+  + L+EIILVDD+S + +L   L++YIQ  FN
Sbjct: 643 TASVIISFHNEAWSTLLRTIYSVINRSKIKLLQEIILVDDYSDRDELKVALDEYIQSNFN 702

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV+++  TEREGLIR R  GA ++ G+++VFLD+HCEV  NWL PL+  IY D   +  
Sbjct: 703 NKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLEPLIERIYRDSSTIAC 762

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           PVID ID  ++     Y      RG   WG+ +K   +P  E  +R    EP KSP  AG
Sbjct: 763 PVIDIIDPDSF----AYSASPLVRGGVNWGLQFKWKNVPPVELLRRNSEIEPIKSPIMAG 818

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+DR +F  +G YD  + +WGGE+ ELSF+IW CGG++E VPCSR+GH++R   PY 
Sbjct: 819 GLFAVDRNYFEHIGSYDKDMQIWGGEHLELSFRIWQCGGTLEIVPCSRVGHIFRKSHPYT 878

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                + V     T+N  RV E W D+ +K +FY   P A     GD+SE+
Sbjct: 879 IPGGMENV----FTHNSIRVAEVWMDD-YKRFFYATRPDAQGKTYGDLSER 924


>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Danio rerio]
          Length = 557

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 220/352 (62%), Gaps = 9/352 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           +GPGE GK   + +  +            N+  S  I+ +R++PD+R+E CK   YP DL
Sbjct: 54  DGPGEMGKPVVIAKDQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDDL 113

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P+ SV++VFHNE +++L+RTVHS+I R+P   LEEI+LVDD S +  L ++LE Y+++  
Sbjct: 114 PRTSVVIVFHNEAWTTLLRTVHSVIDRSPRHLLEEIVLVDDASERDFLKRQLEHYVRKLE 173

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R  +R GLIR R +GA  S G+VI FLDAHCE    WL PLL+ I  D+K +  
Sbjct: 174 VPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLSRIKLDKKTVVC 233

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT A
Sbjct: 234 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMA 290

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R   PY
Sbjct: 291 GGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPY 350

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F        G +I  N +R+ E W DE  K +FY   P    +D GDIS +
Sbjct: 351 TFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISTR 397


>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Cavia porcellus]
          Length = 568

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 160/358 (44%), Positives = 221/358 (61%), Gaps = 13/358 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD
Sbjct: 96  SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISP 385


>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Monodelphis domestica]
          Length = 609

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 164/378 (43%), Positives = 223/378 (58%), Gaps = 18/378 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAY-------RAAGDASLGEYGMNMETS 55
           + +   K+  +EP LE   E  G+       PE         +   D    ++  N+  S
Sbjct: 66  LLEPQSKVNKIEPILENNGEDAGKEEDTELSPEMGMIFNERDQELRDLGYQKHAFNLLIS 125

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R  ECK   YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 126 NRLGYHRDVPDTRNAECKEKSYPSDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLH 185

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L+ Y+Q++  GK++++RN +REGLIR R  GA  + GEV+VFL
Sbjct: 186 EIILVDDNSEFDDLKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFL 245

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV   WL PLL PI  DR+ +  PVID I   T     +Y      RG F WG+ +
Sbjct: 246 DSHCEVNKMWLQPLLVPIQEDRRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHF 301

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E +  +    P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 302 KWDLVPFSELEGPEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFR 361

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +TYN  R+   W DE  + YF
Sbjct: 362 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTYNSLRLAHVWLDEYKEQYF 416

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L +    G+ISE+
Sbjct: 417 SLRPELKL-KSYGNISER 433


>gi|380024969|ref|XP_003696257.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
           florea]
          Length = 598

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R   CK   Y
Sbjct: 73  EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+LMRT  S++ R+PA  L+EIILVDD S+K  L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 306

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 307 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 366

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++  Q
Sbjct: 367 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 415


>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 632

 Score =  300 bits (768), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 162/394 (41%), Positives = 233/394 (59%), Gaps = 36/394 (9%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K G+L P L        EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DRKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R++ CK   YP D+P  SV++VFHNE +S+L+RTVHS+I R+P   L EI+LVDD
Sbjct: 96  SLPDVRLDGCKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L +KLE+Y++     VR++R  +R GLIR R RGA  ++G+VI FLDAHCE  +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215

Query: 183 NWLPPLLAPIYSD-----------------------RKIMTVPVIDGIDYQTWEFRSVYE 219
            WL PLLA I  D                       R  +  P+ID I  +T+E+ +   
Sbjct: 216 GWLEPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMA--G 273

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYD 278
            D  Y G F W + ++   +P+RE  +RK + + P ++PT AGGLF++D+ +F E+G YD
Sbjct: 274 SDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYD 332

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNY 338
           PG+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY+F        G +I  N 
Sbjct: 333 PGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGT----GQVINKNN 388

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R+ E W D+  K +FY   P  M +D GD+S +
Sbjct: 389 RRLAEVWMDD-FKDFFYIISPGVMRVDYGDVSSR 421


>gi|380024967|ref|XP_003696256.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
           florea]
          Length = 611

 Score =  300 bits (768), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R   CK   Y
Sbjct: 86  EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+LMRT  S++ R+PA  L+EIILVDD S+K  L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 319

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 320 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 379

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++  Q
Sbjct: 380 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 428


>gi|380024971|ref|XP_003696258.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 3 [Apis
           florea]
          Length = 590

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R   CK   Y
Sbjct: 65  EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 124

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+LMRT  S++ R+PA  L+EIILVDD S+K  L + L+DY+
Sbjct: 125 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 184

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 185 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 244

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP
Sbjct: 245 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 298

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 299 IMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 358

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++  Q
Sbjct: 359 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 407


>gi|350644736|emb|CCD60531.1| n-acetylgalactosaminyltransferase,putative [Schistosoma mansoni]
          Length = 508

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 162/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           LE  + GPGE G  + L    +   + ++ E G ++  S  I  DR+I D+R   CK   
Sbjct: 70  LESLRVGPGENGMPFELSYHDKELSNKTINENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 129

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP  SVI+ F  E + +L+RTV S++ R P+  ++E+ILVDD SS+  L  +L+ +
Sbjct: 130 YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 189

Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +      G VR+I    R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 249

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           + K +  P ID ID  T+E+R+    D   RG F+W + YK   LP R  + R +  EP+
Sbjct: 250 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPF 303

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGGLFA+   +F ELGGYDPGL++WGGE +ELSFKIWMCGG +   PCSRIGH+Y
Sbjct: 304 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIY 363

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R +   NF K      G  +  NYKRV E W DE +K Y Y R P    LD GD+++Q
Sbjct: 364 RKYST-NFPKAE---FGDFVGRNYKRVAEVWMDE-YKEYLYKRRPRYRDLDAGDLTKQ 416


>gi|328781649|ref|XP_003250010.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
           mellifera]
          Length = 598

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 162/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R   CK   Y
Sbjct: 73  EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+L+RT  S++ R+PA  L+EIILVDD S+K  L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 306

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 307 VMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 366

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++  Q
Sbjct: 367 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 415


>gi|328781647|ref|XP_003250009.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
           mellifera]
          Length = 611

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 162/356 (45%), Positives = 214/356 (60%), Gaps = 14/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK   L  +  A  +      G N   S+ IS +R++PD+R   CK   Y
Sbjct: 86  EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SVI+ FHNE FS+L+RT  S++ R+PA  L+EIILVDD S+K  L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  +R GLI+ R  GAK ++ +V+VFLD+H E  +NWLPPLL PI  D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID I Y+T+E+R+    D   RG F+W + YK   L   + +     +EP+KSP
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN---PTEPFKSP 319

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYDP L +WGGE +ELSFKIW CGG +   PCSR+GH+YR F
Sbjct: 320 VMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRKF 379

Query: 318 MPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ N G      KG  +  NYKRV E W DE +  Y Y R P    LD G++  Q
Sbjct: 380 PPFPNPG------KGDFLGKNYKRVAEVWMDE-YAEYIYRRRPHLRSLDPGNLKSQ 428


>gi|321455342|gb|EFX66478.1| hypothetical protein DAPPUDRAFT_302681 [Daphnia pulex]
          Length = 613

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 161/355 (45%), Positives = 219/355 (61%), Gaps = 13/355 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + GPGE G A++L        D+     G N   S+ I+ +RT+ D+R  +CK  +Y
Sbjct: 89  ESKQTGPGEQGLAFYLSPEDEKIKDSLYKVNGFNALVSDRINLNRTLKDIRHPDCKAQNY 148

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             DLP AS+++ FHNE FS L+RT +S + R PA  LE +ILVDD S+K    + L+DY+
Sbjct: 149 LEDLPTASIVVPFHNEHFSVLLRTAYSALNRAPANLLE-VILVDDASTKEHSKKPLDDYV 207

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            +   +VR+I   ER GLIR R  GA+ ++G+VI+FLD+H E  +NWLPPLL PI  D +
Sbjct: 208 TQHMPRVRVIHLAERSGLIRARMAGARRAKGDVIIFLDSHSEANVNWLPPLLDPIAEDYR 267

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P ID ID++T+ +R+    D   RG F+W   YK   L   + K   + + P+KSP
Sbjct: 268 TVVCPFIDVIDFETFAYRA---QDEGARGAFDWEFFYKRLPLLPDDLK---HPARPFKSP 321

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+ + FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSRIGH+YR +
Sbjct: 322 VMAGGLFAISKKFFFELGGYDEGLEIWGGEQYELSFKIWQCGGQMFDAPCSRIGHIYRKY 381

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+      +  KG  +  NYKRV E W DE +K Y Y R P    L++GD+S Q
Sbjct: 382 APF-----PNSAKGDFVGRNYKRVAEVWMDE-YKEYLYKRRPQYRNLEVGDLSSQ 430


>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
          Length = 507

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 149/352 (42%), Positives = 216/352 (61%), Gaps = 15/352 (4%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
           G+   L    +   D    +   N+  S+ I+ +R++ D R  +C    YP  +P ASV+
Sbjct: 2   GRRVELSAEKQEEADKLFKKEAFNIVASDMIALNRSVSDNRDPQCSRVSYPKVMPNASVV 61

Query: 88  LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF--NGKVR 145
           ++FHNE +S L+RTVHS++ R+P +YL E+IL+DDFS +A L +KL+ YI+    +G V+
Sbjct: 62  IIFHNEAWSPLLRTVHSVVNRSPPEYLHEVILLDDFSDRAGLGEKLDGYIKDTWPDGIVK 121

Query: 146 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVID 205
           ++R  ER+GLIR R  GAK + GEV+VFLD+HCE  + WL PL+A I   R  +  P+ID
Sbjct: 122 VVRAPERQGLIRARVLGAKAATGEVLVFLDSHCECNVQWLEPLVARIKESRSALLCPMID 181

Query: 206 GIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
            ID +   +  +        G F W + +    LP+RE K+RK + E  +SPT AGGLFA
Sbjct: 182 VIDAKAMSYNGIGAGS---VGGFWWSLHFSWRPLPQRERKRRKSSVETIRSPTMAGGLFA 238

Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
            DR +F E+GGYDPG+ VWGGEN E+SF++WMCGG++E+VPCSR+GH++RS  PY F   
Sbjct: 239 ADRKYFFEIGGYDPGMDVWGGENLEISFRVWMCGGTLEFVPCSRVGHIFRSSHPYTFPGN 298

Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF-----LDMGDISEQ 372
            D         N KR+ E W D   + +++ R  L +       D GD S++
Sbjct: 299 KD-----THGLNSKRLAEVWMDGYKRLFYHHRRDLLVINPQFNADAGDFSDR 345


>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
           laevis]
 gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
          Length = 559

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 154/357 (43%), Positives = 223/357 (62%), Gaps = 10/357 (2%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L+P +EGPGE GK   + +  +            N+  S  I+ +R++PD+R+E CK   
Sbjct: 52  LKP-QEGPGEMGKPVVILKEEQERMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKV 110

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           YP +LP  SV++VFHNE +++L+RTVHS+I R+P   L EI+LVDD S +  L + LE Y
Sbjct: 111 YPDNLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLREIVLVDDASERDFLKRALETY 170

Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
           +++ +  V +IR  +R GLIR R RGA  S+G+VI FLDAHCE  + WL PLLA I  DR
Sbjct: 171 VKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARINHDR 230

Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYK 255
           + +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +R+ + + P +
Sbjct: 231 RTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDRTLPVR 287

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           +PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R
Sbjct: 288 TPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFR 347

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI+ +
Sbjct: 348 KATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIATR 399


>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
          Length = 587

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 152/325 (46%), Positives = 212/325 (65%), Gaps = 10/325 (3%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S+ +SF R IPD+R + C+  +YP +LP ASV++ F+NE +S L+RTVHSII 
Sbjct: 87  YAFNELISDRLSFHRPIPDVRHQLCQSEEYPAELPSASVVICFYNEAWSVLLRTVHSIID 146

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           RTP+  L EIILVDDFS    L ++L+ Y+     + +L+RNT REGLIR R  G++ + 
Sbjct: 147 RTPSALLHEIILVDDFSDLDHLAEQLDAYVSEHLPQTKLVRNTRREGLIRARVIGSEHAT 206

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           GEV+VFLD+HCEV + W+ PLL+ I+ + K + VP+ID ID  T  FR  YE     RG 
Sbjct: 207 GEVLVFLDSHCEVNVEWIQPLLSHIHGNHKRVAVPIIDIIDQDT--FR--YESSPLVRGG 262

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F WG+ Y+ +++PE   +K++   +P K+PT AGGLFAM+R +F +LG YD G+ VWGGE
Sbjct: 263 FNWGLFYRWDQIPESLLRKQEDYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGE 322

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF++W CGGS+  +PCSR+GH++R   PY        V    IT N  RV   W D
Sbjct: 323 NLEISFRVWQCGGSMHILPCSRVGHIFRKRRPY-----GSPVGVDTITKNSLRVAHVWMD 377

Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
           E  K +F  R+  A   + GD+S++
Sbjct: 378 EYIKYFFQVRKT-ADHAEYGDVSDR 401


>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 621

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 153/348 (43%), Positives = 219/348 (62%), Gaps = 9/348 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G+   +     A           N+  S+ I+ +R++PD+R+E+CK   YP  LP 
Sbjct: 120 PGERGRGVEIGPEEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 179

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            SV++VFHNE +S+L+RTVHS+I+ +P   LEEIILVDD S +  L +KLEDY+ +    
Sbjct: 180 TSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKKLEDYVVKLEVP 239

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+++R  +R GLIR R  GA   +G+VI FLDAHCE   +WL PLLA I  DR  +  PV
Sbjct: 240 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWLEPLLARIAEDRTRVVCPV 299

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+ S    D  + G F W + ++   +P+RE ++R  + + P ++PT AGG
Sbjct: 300 IDVISDETFEYISA--SDMTWGG-FNWKLNFRWYRVPQREVERRGGDRTLPIRTPTMAGG 356

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +GHV+R   PY+F
Sbjct: 357 LFSIDKDYFNELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPYSF 416

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
                R+    + +N  R+ E W DE  K +++   P A  +D GD+S
Sbjct: 417 PGGTSRI----VNHNNARLAEVWLDE-WKDFYFAINPAAKNVDKGDLS 459


>gi|256081587|ref|XP_002577050.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 469

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 162/358 (45%), Positives = 221/358 (61%), Gaps = 13/358 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           LE  + GPGE G  + L    +   + ++ E G ++  S  I  DR+I D+R   CK   
Sbjct: 31  LESLRVGPGENGMPFELSYHDKELSNKTVNENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 90

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP  SVI+ F  E + +L+RTV S++ R P+  ++E+ILVDD SS+  L  +L+ +
Sbjct: 91  YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 150

Query: 137 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +      G VR+I    R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 151 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 210

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           + K +  P ID ID  T+E+R+    D   RG F+W + YK   LP R  + R +  EP+
Sbjct: 211 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPF 264

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGGLFA+   +F ELGGYDPGL++WGGE +ELSFKIWMCGG +   PCSRIGH+Y
Sbjct: 265 DSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIY 324

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R +   NF K      G  +  NYKRV E W DE +K Y Y R P    LD GD+++Q
Sbjct: 325 RKYST-NFPKAE---FGDFVGRNYKRVAEVWMDE-YKEYLYKRRPRYRDLDAGDLTKQ 377


>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 641

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 227/376 (60%), Gaps = 19/376 (5%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHIS 59
           RPV+    K       + P K   GEGG+A +L  EA +   +     +  N   S+ IS
Sbjct: 110 RPVYDISAKKN-----INPMK---GEGGEASYLDTEAEKQYAEKIFANHSFNSVLSDKIS 161

Query: 60  FDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
            DRT+ D+R + C  K+  YP  LP ASVI+ FHNE +S L+RTVHS++ RTP   L +I
Sbjct: 162 LDRTMRDVRGDLCIEKHKTYPRKLPTASVIICFHNEAYSVLLRTVHSVLNRTPPDLLTDI 221

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S   +L + L+D++ + + K+++IRN +R GLIR+R  GA  SRG+V++FLD+H
Sbjct: 222 ILVDDKSEYENLKRPLDDHVAQLSKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSH 281

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
           CE    W  PLLA I      + VP+I+ I+  T ++ +   PD   RG F W + YK  
Sbjct: 282 CETTPGWAEPLLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQ--RGGFSWDLFYKWK 339

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +P  E   RK   +  ++PT AGGLFA+DR +F ++G YD  + +WGGEN E+SF+IWM
Sbjct: 340 PIPLDEQHLRKSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLEMSFRIWM 399

Query: 298 CGGSIEWVPCSRIGHVYRSFM-PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           CGG I+ +PCSR+GH++R F  PY F    ++     ++ N  R+ E W DE +K  +Y 
Sbjct: 400 CGGRIDIIPCSRVGHIFRKFTSPYKFPDGVEKT----LSKNLNRLAEVWLDE-YKELYYQ 454

Query: 357 REPLAMFLDMGDISEQ 372
           + P +   D GDIS++
Sbjct: 455 KRPQSKGKDYGDISQR 470


>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Xenopus laevis]
 gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
          Length = 603

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 164/374 (43%), Positives = 220/374 (58%), Gaps = 14/374 (3%)

Query: 1   RPVFKADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 59
           +P+    G  GN LE   E   +   E G  ++  E  +   D    ++  N+  SN + 
Sbjct: 66  QPIASHQGLNGNQLETKAEANADLSPELGMIFN--EQDQDVRDVGYQKHAFNLLISNRLG 123

Query: 60  FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
           + R +PD R  +C    YP DLP AS+++ F+NE FS+L+RTVHS++ RTPAQ L EIIL
Sbjct: 124 YHRDVPDTRDSKCSKKTYPADLPHASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIIL 183

Query: 120 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           VDD S   DL + L++Y+Q   + KV+L+RN +REGLIR R  GA  + G+V+VFLD+HC
Sbjct: 184 VDDNSELDDLKKDLDNYMQENLSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHC 243

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           EV   WL PLLAPI  + K +  PVID I   T     +Y      RG F WG+ +K + 
Sbjct: 244 EVNEMWLQPLLAPIRENPKTVVCPVIDIISSDTL----IYSSSPVVRGGFNWGLHFKWDP 299

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P  E    +  + P++SPT AGGLF MDR +F  LG YD G+ +WGGEN E+SF+IWMC
Sbjct: 300 VPLSELGGPEGYTAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISFRIWMC 359

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GGS+  VPCSR+GH++R   PY      D      + YN  R+   W DE    YF  R 
Sbjct: 360 GGSLLIVPCSRVGHIFRKRRPYGSPGGHDT-----MAYNSLRLAHVWMDEYKDQYFALR- 413

Query: 359 PLAMFLDMGDISEQ 372
           P     D GDISE+
Sbjct: 414 PELRNKDYGDISER 427


>gi|358331987|dbj|GAA50722.1| putative polypeptide N-acetylgalactosaminyltransferase 10
           [Clonorchis sinensis]
          Length = 738

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 160/357 (44%), Positives = 223/357 (62%), Gaps = 13/357 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + GPGE G A  L    +   +  L + G N   S+ I+ DR++ D+R  +CK   Y
Sbjct: 207 EANRVGPGEQGAAVRLFGEQKVESEKFLNQNGFNTYISDMIAIDRSVADIRHPKCKAMLY 266

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+++ F  E +++L+RT  S +KR+P   ++E+ILVDD S++  L   L+ Y+
Sbjct: 267 LAKLPSVSLVIPFFQENWNALLRTFVSSLKRSPPGLIKEVILVDDGSTREYLKGPLDRYL 326

Query: 138 QRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++   +G VR+IR+ +REGLI  R RGA+ + GEV+VFLD+HCE   NWLPPL+ PI  D
Sbjct: 327 EQHYPDGLVRVIRSPKREGLITARIRGARAATGEVLVFLDSHCEANPNWLPPLVDPIARD 386

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            K++T P ID I   T+E+R+    D   RG F+W + YK   LP +  +   +   P+ 
Sbjct: 387 YKVVTCPFIDVISADTFEYRA---QDEGARGAFDWELFYKR--LP-KLPQDLPHPERPFD 440

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGGLFA+   +F ELGGYDPGL++WGGE +ELSFKIWMCGG +  +PCSRIGH+YR
Sbjct: 441 SPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDIPCSRIGHIYR 500

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +  P +F        G  +  NYKRV ETW DE +K Y Y+R P    +D GD+SEQ
Sbjct: 501 TH-PTDFPSAG---LGDFLGKNYKRVAETWMDE-YKEYIYSRRPHYRHIDAGDLSEQ 552


>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
          Length = 626

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 155/359 (43%), Positives = 220/359 (61%), Gaps = 9/359 (2%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P ++P +E PGE GKA ++P            E   N+  S+ IS +R++ D+R E+CK 
Sbjct: 113 PFVKPQEETPGEMGKAVNIPIEQEKVMLEKFQENQFNLLASDMISLNRSLTDVRFEKCKA 172

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  LP  SV++VFHNE +++L+RT+ S I R+P   L+EIILVDD S K  L +KLE
Sbjct: 173 KRYPTLLPTTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLE 232

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +YI+      RL R   R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I  
Sbjct: 233 EYIKTLPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVE 292

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           DR  +  P+ID I   T+E+  +   D  + G F W + ++   +PERE ++R  + + P
Sbjct: 293 DRSTVVCPIIDVISDTTFEY--IQASDMTWGG-FNWKLNFRWYRVPEREMQRRGGDRTAP 349

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF++W CGG +E VPCS +GHV
Sbjct: 350 LRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPCSHVGHV 409

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY+F      V    +  N  RV E W DE  + ++Y   P A+ + +GD+SE+
Sbjct: 410 FRDKSPYSFPGGVQAV----VLKNAARVAEVWMDEWGE-FYYAMNPGALNVPVGDVSER 463


>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Saccoglossus kowalevskii]
          Length = 524

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 161/367 (43%), Positives = 226/367 (61%), Gaps = 15/367 (4%)

Query: 9   KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
           ++ NL+    P   +GPGE G +        A           N   S+ IS +R IPD+
Sbjct: 3   RVQNLDVTTAPRNPKGPGEYGVSVITRPEDEAKVKTGWKHASFNEFVSDMISVERAIPDV 62

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R EEC+   Y   LP  S+I+ F  E +S+L+R+VHS+I R+P Q ++EIILVDDFSS+ 
Sbjct: 63  RPEECQDKLYSDSLPSTSIIICFTEESWSTLVRSVHSVINRSPPQLIKEIILVDDFSSRE 122

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L   L+ Y++RF  +V+++R   REGLIR R RG + ++GEV+ FLD+H E G+ WL P
Sbjct: 123 YLKAPLDKYMKRF-PQVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEP 181

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           +L  I  DR+ +  P+IDGID   +     Y   +  RG F W M +K   +P+ E K+R
Sbjct: 182 MLQRIKEDRRNVVAPMIDGIDATKFS----YAASNLIRGGFSWEMQFKWKPIPDYEMKRR 237

Query: 248 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
           K  + P +SPT AGGLFA+D+++FLE+G YDPGL +WG EN ELSFKIWMCGG++E +PC
Sbjct: 238 KDETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCGGNLEMIPC 297

Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLD 365
           S +GHV+R+  PY F       +G + T+  N  RV E W DE +K  FY  +P     D
Sbjct: 298 SHVGHVFRASQPYKFP------EGNIKTFMRNNMRVAEVWMDE-YKDIFYALKPQLKGED 350

Query: 366 MGDISEQ 372
            GD++E+
Sbjct: 351 YGDVTER 357


>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
          Length = 639

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 228/360 (63%), Gaps = 12/360 (3%)

Query: 16  PLEPYKEG-PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P+ P + G PGE GK  HLP    +           N+  S+ IS +R++PD+R+E C+ 
Sbjct: 123 PVVPEQAGQPGEMGKPVHLPADQESLMREKFRLNQFNLLASDSISLNRSLPDVRLEGCRD 182

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  LP  S+++VFHNE +S+L+RTV SII R+P + L EIILVDD S +  L ++LE
Sbjct: 183 KSYPGLLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAEIILVDDASERDYLGKELE 242

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           D++  F   V ++R  +R GLIR R  GAK+ +G+VI FLDAHCE    WL PLLA +  
Sbjct: 243 DHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPLLARVAE 302

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +RKI+  P+ID I  +++E+  V   D  + G F W + ++   +P+RE  +R  + ++P
Sbjct: 303 NRKIVVCPIIDVISDESFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMDRRNGDRTQP 359

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 360 LRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMSFRVWQCGGELEIIPCSHVGHV 419

Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY+F G +A      ++  N  RV E W D + K +FY   P A  +++GD+S +
Sbjct: 420 FRDKSPYSFPGGVA-----KIVNKNAARVAEVWMD-RWKDFFYEMNPGARSVEVGDVSSR 473


>gi|355689586|gb|AER98882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mustela putorius
           furo]
          Length = 320

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 147/298 (49%), Positives = 200/298 (67%), Gaps = 7/298 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 20  GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 77

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + LEDY+  F  
Sbjct: 78  NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 137

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLPPLL  I  +RK +  P
Sbjct: 138 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 196

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           +ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  AGG
Sbjct: 197 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMAGG 252

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           LFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 253 LFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 310


>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Rattus norvegicus]
          Length = 608

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 164/376 (43%), Positives = 221/376 (58%), Gaps = 17/376 (4%)

Query: 2   PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P FKA    D    N+E P +   +   E G  ++  E  +   D    ++  NM  SN 
Sbjct: 69  PQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R  EC+   YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA  L EI
Sbjct: 127 LGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D   +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  +       + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF  
Sbjct: 363 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417

Query: 357 REPLAMFLDMGDISEQ 372
           R  L      G+ISE+
Sbjct: 418 RPDLKT-KSFGNISER 432


>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
           norvegicus]
 gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
           norvegicus]
 gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_b [Rattus norvegicus]
          Length = 609

 Score =  297 bits (760), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 164/376 (43%), Positives = 221/376 (58%), Gaps = 17/376 (4%)

Query: 2   PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P FKA    D    N+E P +   +   E G  ++  E  +   D    ++  NM  SN 
Sbjct: 70  PQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 127

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R  EC+   YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA  L EI
Sbjct: 128 LGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 187

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+
Sbjct: 188 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 247

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D   +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 248 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 303

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  +       + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 304 DLVPVSDLGGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 363

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF  
Sbjct: 364 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 418

Query: 357 REPLAMFLDMGDISEQ 372
           R  L      G+ISE+
Sbjct: 419 RPDLKT-KSFGNISER 433


>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 10-like [Loxodonta
           africana]
          Length = 602

 Score =  297 bits (760), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 159/352 (45%), Positives = 215/352 (61%), Gaps = 19/352 (5%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+ Y + +A R   D +  E G N+  S+ IS +R++PD+R   C    Y   LP
Sbjct: 88  GHGEQGRPYPMTDAERV--DQAYRENGFNIYISDKISLNRSLPDIRHPNCNSKRYLEMLP 145

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +  L + L     R +G
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLHKPL----XRLHG 201

Query: 143 KVRLIRNT--EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
               +R +  E EGLIRTR  GA  +  +VI FLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 202 PFPSVRISVPETEGLIRTRMLGASAAIXDVITFLDSHCEANVNWLPPLLDRIARNRKTIV 261

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID+   +FR   +     RG F+W M YK   +P    K     S+P++SP  A
Sbjct: 262 CPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP--SDPFESPVMA 317

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+GH+YR ++PY
Sbjct: 318 GGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRKYVPY 377

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                     G  +  N KRV E W DE +  Y Y R P    L  GD++ Q
Sbjct: 378 KVP------AGVSLARNLKRVAEVWMDE-YAEYIYQRRPEYRHLSAGDVAAQ 422


>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_c [Rattus norvegicus]
          Length = 560

 Score =  297 bits (760), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 159/362 (43%), Positives = 216/362 (59%), Gaps = 13/362 (3%)

Query: 12  NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           N+E P +   +   E G  ++  E  +   D    ++  NM  SN + + R +PD R  E
Sbjct: 35  NIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAE 92

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           C+   YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA  L EIILVDD S   DL  
Sbjct: 93  CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 152

Query: 132 KLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+HCEV + WL PLLA
Sbjct: 153 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 212

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  D   +  PVID I   T      Y      RG F WG+ +K + +P  +       
Sbjct: 213 IILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGGADSA 268

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+
Sbjct: 269 TAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRV 328

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GH++R   PY   +  D      +T+N  R+   W DE  + YF  R  L      G+IS
Sbjct: 329 GHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSFGNIS 382

Query: 371 EQ 372
           E+
Sbjct: 383 ER 384


>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
           [Tribolium castaneum]
 gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
          Length = 602

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 151/359 (42%), Positives = 220/359 (61%), Gaps = 9/359 (2%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P + P    PGE GKA H+P                N+  S+ IS +R++ D+R+E CK 
Sbjct: 86  PTVLPAHGLPGEMGKAVHIPPEQEGLMKEKFKLNQFNLLASDMISLNRSLADVRLEGCKD 145

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  LP  S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L +KLE
Sbjct: 146 KKYPKLLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRPLLKEIILVDDASEREHLGRKLE 205

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +Y+Q     V ++R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  
Sbjct: 206 EYVQTLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVQ 265

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           DRK +  P+ID I  +T+E+  +   D  + G F W + ++   +P+RE ++R  + + P
Sbjct: 266 DRKTVVCPIIDVISDETFEY--ITASDMTWGG-FNWKLNFRWYRVPQREMERRNNDRTAP 322

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 323 LRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMSFRVWQCGGKLEIIPCSHVGHV 382

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY F     ++    + +N  RV E W DE  + ++Y   P A  + +GD+S +
Sbjct: 383 FRDKSPYTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARSVPVGDVSAR 436


>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
 gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score =  296 bits (759), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 165/353 (46%), Positives = 221/353 (62%), Gaps = 14/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L +      +  +  E G N   S+ IS +R++PD+R + C+  DY  +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 140
           P  SVI++F+NE  S LMR+VHS+I R+P + L+EIILVDDFS +  L  +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           +  VR++R   R GLI  RS GA+ +  EV++FLD+H E   NWLPPLL PI  +++   
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P ID ID+ T+ +R+    D   RG F+W   YK   L + + K   Y ++P+KSP  A
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK---YPADPFKSPVMA 383

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 384 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 441

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N   +    KG  +  NYKRV E W DE +K Y Y   + +   +D GD++EQ
Sbjct: 442 NH--VPSPRKGDYLHRNYKRVAEVWMDE-YKNYLYDHADGIYDRIDAGDLTEQ 491


>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
 gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
          Length = 676

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 165/353 (46%), Positives = 221/353 (62%), Gaps = 14/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L +      +  +  E G N   S+ IS +R++PD+R + C+  DY  +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 140
           P  SVI++F+NE  S LMR+VHS+I R+P + L+EIILVDDFS +  L  +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           +  VR++R   R GLI  RS GA+ +  EV++FLD+H E   NWLPPLL PI  +++   
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P ID ID+ T+ +R+    D   RG F+W   YK   L + + K   Y ++P+KSP  A
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK---YPADPFKSPVMA 383

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 384 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 441

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N   +    KG  +  NYKRV E W DE +K Y Y   + +   +D GD++EQ
Sbjct: 442 NH--VPSPRKGDYLHRNYKRVAEVWMDE-YKNYLYDHADGIYDRIDAGDLTEQ 491


>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Cricetulus griseus]
          Length = 497

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 154/344 (44%), Positives = 216/344 (62%), Gaps = 9/344 (2%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
           GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP +LP  SV+
Sbjct: 2   GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61

Query: 88  LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
           +VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   LE+Y++     V++I
Sbjct: 62  IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKII 121

Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
           R  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I  DRK +  P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181

Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
              T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT AGGLF++
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSI 238

Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
           DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY F    
Sbjct: 239 DRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT 298

Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
               G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 299 ----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 337


>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
          Length = 558

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN +   R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 96  DLGYQKHAFNLLISNRLGHHRDVPDTRNAACKDKIYPTDLPVASVVICFYNEALSALLRT 155

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++++Q+   GK++LIRNT+REGLIR R 
Sbjct: 156 VHSVLDRTPARLLHEIILVDDSSDFDDLKGELDEFVQKHLPGKIKLIRNTKREGLIRGRM 215

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 216 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 271

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E +  +  + P KSPT AGGLFAM+R++F ELG YD G
Sbjct: 272 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRSYFSELGQYDSG 331

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 332 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 386

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R P       G++SE+
Sbjct: 387 LAHVWLDEYKEQYFSLR-PDLRTRSYGNVSER 417


>gi|339249613|ref|XP_003373794.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
           spiralis]
 gi|316970007|gb|EFV54023.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
           spiralis]
          Length = 587

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 157/355 (44%), Positives = 219/355 (61%), Gaps = 15/355 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASL--GEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
           ++GPGE G+A++LP          +     G N   S++++ +R+I DLR ++C    Y 
Sbjct: 75  RQGPGEQGEAFYLPNVSSVDHKKGILYKSNGFNALVSDYLALNRSIKDLRPKQCIGRSYL 134

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
             L K SV++ F+NE +++L+RTVHS++ R+P + L+E+IL DDFS K  L Q LE Y++
Sbjct: 135 AKLEKVSVVIPFYNEHWTTLLRTVHSVVNRSPVELLQEVILADDFSDKPFLKQPLEAYVR 194

Query: 139 -RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
             + G VR++R  +REGLIR R  G+K +   V+VFLD+H E G NWLPPLL P+  + +
Sbjct: 195 DTWPGLVRIVRARKREGLIRARLLGSKAAISSVLVFLDSHSECGYNWLPPLLEPVALNYR 254

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +T P +D ID+ T+ +R     D   RG F+W + YK   L   +A    Y   P+ SP
Sbjct: 255 TVTCPFVDVIDHSTFLYRL---QDQGARGSFDWELYYKRLPLLPEDAA---YPDRPFNSP 308

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGG FA+   +F ELGGYD GL +WGGE +ELSFKIW CGG++  VPCS +GH+YR F
Sbjct: 309 VMAGGYFAISTKWFWELGGYDEGLDIWGGEQYELSFKIWQCGGTLIDVPCSHVGHIYREF 368

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+     A+   G  +  NYKRV E W DE +K Y Y R P    LD GDIS+Q
Sbjct: 369 SPF-----ANPGAGDFVGRNYKRVAEVWMDE-YKEYVYMRRPHYRKLDPGDISKQ 417


>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 573

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 159/356 (44%), Positives = 219/356 (61%), Gaps = 15/356 (4%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E ++ G GE GK   LP+  +   +A     G N   S+ I  + ++PD+R   CK   Y
Sbjct: 60  ESHRIGVGEQGKPAFLPDKEKVQKEALYAVNGFNALLSDKIYLN-SLPDIRHPGCKEKKY 118

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +L   SV++ FHNE +S+L+RTV+S++ R+P+  L+EIILVDD+SSK  L +KL+ Y+
Sbjct: 119 RKNLNTVSVVVPFHNEHWSTLLRTVYSVLNRSPSHLLKEIILVDDYSSKPFLKKKLDIYV 178

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
            R   KV++IR  ER GLIR R  GAK+++ +V++FLD+H E  +NWLPPLL PI  + K
Sbjct: 179 DRHLPKVKIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPLLEPIAENYK 238

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKS 256
               P ID I + T+E+R+    D   RG F+W   YK    LPE      K+ +EP++S
Sbjct: 239 TCVCPFIDVIAHDTFEYRA---QDEGRRGAFDWEFFYKRLPLLPE----DLKHPTEPFQS 291

Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
           P  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR 
Sbjct: 292 PVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSFKIWQCGGKMVDAPCSRVGHIYRK 351

Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           F P+    + D      +  NY+RV E W DE +  Y Y R P    +D GD++ Q
Sbjct: 352 FAPFPNPGIGD-----FVGKNYRRVAEVWMDE-YAEYLYKRRPHYRNIDPGDLTVQ 401


>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 615

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 149/359 (41%), Positives = 218/359 (60%), Gaps = 9/359 (2%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           PP+   +   GEGG+   +     A       E   N+  S+ IS +R++ D+R  ECK 
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  +P  S+++VFHNE +S+L+RTV S+I R+P   L+EI+LVDD S +  L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+     + +++R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLLA I  
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +RK +  PVID I   T+E+  V   D  + G F W + ++   +P+RE  +R  + + P
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDRTAP 339

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF+IWMCGG++E  PCS +GHV
Sbjct: 340 LRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRIWMCGGTLEISPCSHVGHV 399

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY F      +    + +N  R+ E W DE  K ++Y   P A  +++GD+SE+
Sbjct: 400 FRKSTPYTFPGGTSHI----VNHNNARLAEVWMDE-WKHFYYAINPGASNVEVGDVSER 453


>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
          Length = 527

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 161/351 (45%), Positives = 217/351 (61%), Gaps = 13/351 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G G+GG  ++  E  +   D    ++  NM  S+ + + R +PD R   CK   YP DLP
Sbjct: 13  GCGQGGMIFN--ERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLP 70

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-N 141
            ASV++ F+NE FS+L+RTVHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  
Sbjct: 71  AASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLP 130

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
           GK+++IRNT+REGLIR R  GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  
Sbjct: 131 GKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVC 190

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           PVID I   T      Y      RG F WG+ +K + +P  E  + +  + P KSPT AG
Sbjct: 191 PVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAG 246

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY 
Sbjct: 247 GLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 306

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             +  D      +T+N  R+   W DE  + YF  R  L      G+ISE+
Sbjct: 307 SPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351


>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Otolemur garnettii]
          Length = 608

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 163/376 (43%), Positives = 229/376 (60%), Gaps = 17/376 (4%)

Query: 2   PVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P F+A    D K G++E P++ + +   E G  ++  E  +   D    ++  N+  SN 
Sbjct: 69  PQFRANRIDDMKDGHVEDPVKDHLKFSSELGMIFN--ERDQELRDLGYQKHAFNVLISNR 126

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RTVHS+I RTP   L E+
Sbjct: 127 LGYHRDVPDTRNAACKEQSYPTDLPVASVVICFYNEAFSALLRTVHSVIDRTPVHLLHEV 186

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA ++ GEV+VFLD+
Sbjct: 187 ILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVFLDS 246

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D++ +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 247 HCEVNVMWLQPLLAAIREDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  E    +  + P KSPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPLSELGGEEGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISFRIW 362

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF  
Sbjct: 363 MCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417

Query: 357 REPLAMFLDMGDISEQ 372
           R  L      G+ISE+
Sbjct: 418 RPDLKT-KSYGNISER 432


>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 615

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 147/359 (40%), Positives = 220/359 (61%), Gaps = 9/359 (2%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           PP+   +   GEGG+   +     A       E   N+  S+ IS +R++ D+R  ECK 
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  +P  S+++VFHNE +S+L+RTV S+I R+P   L+EI+LVDD S +  L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+     + +++R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLLA I  
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +RK +  PVID I   T+E+  V   D  + G F W + ++   +P+RE  +R  + + P
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDRTAP 339

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF++W CGG++E +PCS +GHV
Sbjct: 340 LRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRVWQCGGTLEIIPCSHVGHV 399

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY+F     ++    + +N  RV E W DE  + ++Y   P A  +++GD+SE+
Sbjct: 400 FRDKSPYSFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGASNVEVGDVSER 453


>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
          Length = 579

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 151/356 (42%), Positives = 224/356 (62%), Gaps = 13/356 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
           +EGPGE GK   L       G A + ++ MN+  S+ IS DR +PD R++ CK   +DY 
Sbjct: 72  REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYA 131

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
             LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 A-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
           RF GKVRLIR  +R GLIR +  GA+E+ G++IVFLD+HCE    WL P++  I  +R  
Sbjct: 191 RFGGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +  P+ID I   T  +   +       G F W + +    L E E K+R   ++  +SPT
Sbjct: 251 IVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGL A +R +F E+GGYD  + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+  
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367

Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PYN  G+  ++ V G     N KR+ E W D+  + Y+  RE L    D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTAR 418


>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
           partial [Pongo abelii]
          Length = 587

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 157/332 (47%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVVCFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E +  +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELRGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRSDLKT-KSYGNISER 432


>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
 gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
          Length = 513

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 150/352 (42%), Positives = 218/352 (61%), Gaps = 12/352 (3%)

Query: 25  GEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLDL 81
           G GGK A+   E  +   +     +  N   S+ IS DRT+ D+R E CK  +  YP  L
Sbjct: 8   GGGGKPAFLESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKL 67

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ FH E  S L+RTVHS+I RTP + L E+I+VDDFS  A L + L+D++ +F 
Sbjct: 68  PTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFT 127

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV+++R  +REGL+R R +GA  ++G+V+ FLD+HCE    W  PLLA I +DR+ +  
Sbjct: 128 -KVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNVVC 186

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P I+ I+  T+ ++     D   RG F W + +K   +P  E K R  +S+P ++PT AG
Sbjct: 187 PAIEVINADTFAYQGSTNADQ--RGGFSWDLFFKWKGIPPEEQKLRNDDSDPIRTPTMAG 244

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-PY 320
           GLF++ R +F ++G YD  + +WGGEN ELSF++WMCGG +E V CSR+GHV+R +  PY
Sbjct: 245 GLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKYTSPY 304

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F    +R     +T N+ R+ E W DE +K  +Y ++P A   D GDIS++
Sbjct: 305 KFPDGVERT----LTKNFNRLAEVWMDE-YKDLYYNKKPQAKNSDYGDISKR 351


>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
          Length = 579

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 151/356 (42%), Positives = 225/356 (63%), Gaps = 13/356 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
           +EGPGE GK   L       G A + ++ MN+  S+ IS DR +PD R++ CK   +DY 
Sbjct: 72  REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYS 131

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
             LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 -SLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
           RF GKV+LIR   R GLIR +  GA+E+ G++IVFLD+HCE    WL P++  I  +R  
Sbjct: 191 RFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +  P+ID I   T  +   +       G F W + +    +PE E K+RK  ++  +SPT
Sbjct: 251 IVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGIPEDEQKRRKKPTDYIRSPT 307

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGL A +R +F E+GGYD  + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+  
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGHIFRAGH 367

Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PYN  G+  ++ V G     N KR+ E W D+  + Y+  RE L    D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTSR 418


>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
          Length = 592

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 151/351 (43%), Positives = 215/351 (61%), Gaps = 11/351 (3%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE GK   +P + +        E   N+  S+ I  +R++ D+R  +CK   YP  LP 
Sbjct: 82  PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPTKLPT 141

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RT+ S+I R+P   L+EIILVDD S +  L Q+LEDY+Q     
Sbjct: 142 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGQQLEDYVQTLPVH 201

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             ++R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  DRK +  P+
Sbjct: 202 TYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 261

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+  V   D  + G F W + ++   +P RE ++R ++ + P ++PT AGG
Sbjct: 262 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 318

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E  PCS +GHV+R   PY F
Sbjct: 319 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 378

Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            G +A+     ++  N  RV E W DE  K ++Y   P A     GD+SE+
Sbjct: 379 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 423


>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 756

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 152/354 (42%), Positives = 213/354 (60%), Gaps = 14/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           E PG  GK   +P   ++  D        N+  S+ I  +R++PD+R ++C Y  Y   L
Sbjct: 248 ELPGANGKPVQIPSELQSEADDLFIINSFNLMASDMIGINRSLPDVRPKQCLYKQYSSAL 307

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+VFHNE +S+L+RTVHS+I RTP QYL EIILVDD S  A L  +L+ Y+ +  
Sbjct: 308 PNTSVIIVFHNEAWSALLRTVHSVINRTPRQYLSEIILVDDASIHAHLGHQLDSYVAKLP 367

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V + R   R GLIR R RGA  ++G+V+ FLD+HCE    WL PLLA I  DR  +  
Sbjct: 368 VPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWLEPLLARIAEDRSNVVT 427

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
           PVID I+ Q       YE D+     G+F+W + ++   +  R+    K++ + P  SPT
Sbjct: 428 PVIDVINAQNL----AYEADNQTPAIGVFDWSLTFRWQSIQRRDLPLLKHDPTHPIPSPT 483

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+DR++F+E G YD G  +WG EN E+SFK WMCGG IE +PCS +GH++R   
Sbjct: 484 MAGGLFAIDRSYFIETGMYDSGFEIWGAENLEISFKTWMCGGRIEILPCSHVGHIFRKHA 543

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+   L D      I+YN KR+ E W D  +K +FY   P A+ ++ G+ +++
Sbjct: 544 PYS-NTLTD-----FISYNNKRLAEVWLD-GYKEFFYFMSPSALKVNAGNYTDR 590


>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
          Length = 483

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 212/323 (65%), Gaps = 9/323 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S+ I+ +R++PD+R+E CK   YP +LP  S+++VFHNE +S+L+RTVHS+I R+P
Sbjct: 11  NLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSP 70

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
            + + EIILVDD S +  L   LE+Y++     V+++R  +R GLIR R RGA  ++G++
Sbjct: 71  HRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQI 130

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           I FLDAHCE  + WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W
Sbjct: 131 ITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + ++   +P+RE  +RK + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN 
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENL 247

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF+IW CGGS+E V CS +GHV+R   PY F        G +I  N +R+ E W D+ 
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDD- 302

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            K +FY   P  + +D GD+SE+
Sbjct: 303 FKDFFYIISPGVVKVDYGDVSER 325


>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
          Length = 613

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 159/354 (44%), Positives = 211/354 (59%), Gaps = 16/354 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK   L  A     +      G N   S+ I+ DR +PD+R   CK   Y  D
Sbjct: 93  RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 152

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++ FHNE +++L+RT  S++ R+P   L+E+ILVDD S+K    + L+DY+   
Sbjct: 153 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 212

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR I   ER GLIR R  GA+ +  +V++FLD+H E  +NWLPPLL PI  D K   
Sbjct: 213 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 272

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
            P ID I Y+T+E+R+    D   RG F+W   YK    LPE      ++ +EP+KSP  
Sbjct: 273 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEHPTEPFKSPVM 325

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR + P
Sbjct: 326 AGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAP 385

Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + N G      KG  +  NY+RV E W DE +  Y Y R P    +D GD+++Q
Sbjct: 386 FPNPG------KGDFVGRNYRRVAEVWMDE-YAEYLYKRRPHYRDIDPGDLTKQ 432


>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
          Length = 591

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 159/354 (44%), Positives = 211/354 (59%), Gaps = 16/354 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE GK   L  A     +      G N   S+ I+ DR +PD+R   CK   Y  D
Sbjct: 71  RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 130

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++ FHNE +++L+RT  S++ R+P   L+E+ILVDD S+K    + L+DY+   
Sbjct: 131 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 190

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR I   ER GLIR R  GA+ +  +V++FLD+H E  +NWLPPLL PI  D K   
Sbjct: 191 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 250

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTH 259
            P ID I Y+T+E+R+    D   RG F+W   YK    LPE      ++ +EP+KSP  
Sbjct: 251 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEHPTEPFKSPVM 303

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR + P
Sbjct: 304 AGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAP 363

Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + N G      KG  +  NY+RV E W DE +  Y Y R P    +D GD+++Q
Sbjct: 364 FPNPG------KGDFVGRNYRRVAEVWMDE-YAEYLYKRRPHYRDIDPGDLTKQ 410


>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mus musculus]
 gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
           musculus]
 gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
 gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
 gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
 gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
           musculus]
 gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
 gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
           musculus]
          Length = 608

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 164/376 (43%), Positives = 223/376 (59%), Gaps = 17/376 (4%)

Query: 2   PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P FKA+   +L N  +E P +   +   E G  ++  E  +   D    ++  NM  SN 
Sbjct: 69  PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R  EC+   YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA  L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D   +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  E       + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF  
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417

Query: 357 REPLAMFLDMGDISEQ 372
           R  L      G+ISE+
Sbjct: 418 RPDLKN-KSFGNISER 432


>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           6 [Macaca mulatta]
 gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
          Length = 608

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
           P FKA+     ++  ++ + E P EG   +         E  +   D    ++  NM  S
Sbjct: 69  PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 125 NRLGYRRNVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV + WL PLLA I  DR  +  PVID I   T      Y      RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L      G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432


>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
          Length = 608

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 164/376 (43%), Positives = 223/376 (59%), Gaps = 17/376 (4%)

Query: 2   PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P FKA+   +L N  +E P +   +   E G  ++  E  +   D    ++  NM  SN 
Sbjct: 69  PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R  EC+   YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA  L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D   +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  E       + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF  
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFSL 417

Query: 357 REPLAMFLDMGDISEQ 372
           R  L      G+ISE+
Sbjct: 418 RPDLKN-KSFGNISER 432


>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Ailuropoda melanoleuca]
 gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
          Length = 608

 Score =  293 bits (751), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 162/377 (42%), Positives = 226/377 (59%), Gaps = 17/377 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
           V ++  K+  ++  ++ + E P +G   +         E  +   D    ++  NM  SN
Sbjct: 66  VLESQFKVNRIDDMIDSHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125

Query: 57  HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
            + + R +PD R   CK   YP+DLP ASV++ F+NE  S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185

Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           IILVDD S   DL  +LE+Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
           +HCEV + WL PLLA I  D++ +  PVID I   T      Y      RG F WG+ +K
Sbjct: 246 SHCEVNVMWLQPLLAAIQQDQRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
            + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF 
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416

Query: 356 TREPLAMFLDMGDISEQ 372
            R  L      G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432


>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
 gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
          Length = 675

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 162/352 (46%), Positives = 216/352 (61%), Gaps = 14/352 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+A  L E+ R        E G N   S+ IS +R++PD+R +EC+   Y   LP
Sbjct: 152 GIGEHGEAAKLDESLRDKEQVLSLENGFNALLSDSISVNRSLPDIRHKECRKKQYLSKLP 211

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI++F+NE  S LMR+VHS+I R+P + L+EIILVDDFS +A L + LEDY+     
Sbjct: 212 NVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAPLFKPLEDYVAEHFS 271

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VR++R  +R GLI  RS GA+ +  +V++FLD+H E   NWLPPLL PI  +++    P
Sbjct: 272 MVRIVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLDPIAQNKRAAVCP 331

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHAG 261
            ID ID+  + +R+    D   RG F+W   YK    LPE      K+ S+P+KSP  AG
Sbjct: 332 FIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DLKHPSDPFKSPVMAG 384

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSR+GH+YR   P  
Sbjct: 385 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG--PRQ 442

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
             K  +   G  +  NYKRV E W DE +K Y Y   + +   +D GD++ Q
Sbjct: 443 GVK--NPRSGDYLHKNYKRVAEVWMDE-YKNYLYNHGDGIYDNVDPGDLTAQ 491


>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
          Length = 608

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
           P FKA+     ++  ++ + E P EG   +         E  +   D    ++  NM  S
Sbjct: 69  PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV + WL PLLA I  DR  +  PVID I   T      Y      RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L      G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432


>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
           (Silurana) tropicalis]
 gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus (Silurana) tropicalis]
          Length = 498

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 212/323 (65%), Gaps = 9/323 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S+ I+ +R++PD+R+E CK   YP +LP  S+++VFHNE +S+L+RTVHS+I R+P
Sbjct: 11  NLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSP 70

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
            + + EIILVDD S +  L   LE+Y++     V+++R  +R GLIR R RGA  ++G++
Sbjct: 71  HRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQI 130

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           I FLDAHCE  + WL PLLA I  DRK +  P+ID I   T+E+ +    D  Y G F W
Sbjct: 131 ITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + ++   +P+RE  +RK + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN 
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENL 247

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF+IW CGGS+E V CS +GHV+R   PY F        G +I  N +R+ E W D+ 
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDD- 302

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            K +FY   P  + +D GD+SE+
Sbjct: 303 FKDFFYIISPGVVKVDYGDVSER 325


>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
           musculus]
          Length = 529

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 153/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R  EC+   YP DLP AS+++ F+NE FS+L+RT
Sbjct: 32  DLGYQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRT 91

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R 
Sbjct: 92  VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRM 151

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  D   +  PVID I   T      Y  
Sbjct: 152 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 207

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P +SPT AGGLFAM+R +F +LG YD G
Sbjct: 208 SPVVRGGFNWGLHFKWDLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSG 267

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 268 MDIWGGENLEISFRIWMCGGKLFILPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 322

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 323 LAHVWLDEYKEQYFSLRPDLKN-KSFGNISER 353


>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           10-like [Acyrthosiphon pisum]
          Length = 592

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 154/358 (43%), Positives = 217/358 (60%), Gaps = 15/358 (4%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           +E  + G GE G +  L    R   D      G N   S+ IS +R+IPD+R + C++  
Sbjct: 77  IEKQRTGIGEQGVSASLSSHNRHKYDELYKVNGFNALLSDSISVNRSIPDIRHKLCRFKK 136

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP  SV++ FHNE FS+L+RTV+S++ R+P   L+EIILVDD S+K  L + L+++
Sbjct: 137 YNSKLPTVSVVIPFHNEHFSTLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKRPLDNF 196

Query: 137 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           +       V++I   +R+GLIR R  GA+++  E+++FLD+H E   NWLPPLL PI  D
Sbjct: 197 LSNNLADTVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPPLLEPITED 256

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPY 254
            +    P ID I ++T+E+R+    D   RG F+W   YK   L PE       Y ++P+
Sbjct: 257 YRTCVCPFIDVIAFETFEYRA---QDEGARGAFDWEFFYKRLPLLPEDLL----YPTKPF 309

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           +SP  AGGLFA+   +F ELGGYDPGL +WGGE +ELSFKIW CGG+I   PCSR+GH+Y
Sbjct: 310 RSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKIWQCGGTILDAPCSRVGHIY 369

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R F P+    + D      +  NY+RV E W DE +  Y Y R P    ++ GDI++Q
Sbjct: 370 RKFAPFPNPGIGD-----FVGKNYRRVAEVWMDE-YAEYLYLRRPHYRNINTGDITKQ 421


>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Cricetulus griseus]
 gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
           griseus]
          Length = 608

 Score =  293 bits (750), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 153/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R  +C+   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAKCRGKSYPADLPTASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R 
Sbjct: 171 VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNRKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  D   +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P +SPT AGGLFAM+R +F +LG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPVSELGGADGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSFGNISER 432


>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
          Length = 579

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 225/362 (62%), Gaps = 13/362 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK- 73
           P     +EGPGE GK   L       G A + ++ MN+  S+ IS DR +PD R++ CK 
Sbjct: 66  PDYSQPREGPGEKGKPVVLSGKEAELGHADMKKWFMNVHASDKISLDRDVPDPRIQACKD 125

Query: 74  -YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
             +DY   LPK SVI++F +E ++ L+RTVHS+I R+P + L+EIIL+DD S + +L + 
Sbjct: 126 IKYDYAT-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKRQELQEP 184

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           L+++I+RF GKVRLIR   R GLIR +  GA+E+ G++IVFLD+HCE    WL P++  I
Sbjct: 185 LDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRI 244

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             +R  +  P+ID I   T  +   +       G F W + +    LP+ E K+R   ++
Sbjct: 245 SDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGLPDEELKRRTKVTD 301

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
             +SPT AGGL A +R +F E+GGYD  + +WGGEN E+SF+ WMCGGSIE++PCS +GH
Sbjct: 302 YIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGH 361

Query: 313 VYRSFMPYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           ++R+  PYN  G+  ++ V G     N KR+ E W D+  + Y+  RE L    D+GD++
Sbjct: 362 IFRAGHPYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLT 416

Query: 371 EQ 372
            +
Sbjct: 417 AR 418


>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
 gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
 gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
          Length = 608

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 166/378 (43%), Positives = 225/378 (59%), Gaps = 21/378 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
           P FKA+     ++  ++ + E P EG   +         E  +   D    ++  NM  S
Sbjct: 69  PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 125 NRLGYRRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV + WL PLLA I  DR  +  PVID I   T      Y      RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGEAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L      G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432


>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Anolis carolinensis]
          Length = 498

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 148/321 (46%), Positives = 209/321 (65%), Gaps = 9/321 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S+ I+ +R++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RT++S+I R P
Sbjct: 11  NLMASDMIALNRSLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAP 70

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              L EIILVDD S +  L   LE+Y++     V+++R  +R GLIR R RGA  S+G+V
Sbjct: 71  HYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQV 130

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           I FLDAHCE  L WL PLLA I  DRKI+  P+ID I   T+E+ +    D  Y G F W
Sbjct: 131 ITFLDAHCECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + ++   +P+RE  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN 
Sbjct: 188 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENL 247

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF+IW CGGS+E V CS +GHV+R   PY F        G +I  N +R+ E W DE 
Sbjct: 248 EMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE- 302

Query: 350 HKAYFYTREPLAMFLDMGDIS 370
            K +FY   P  + +D GD++
Sbjct: 303 FKDFFYIISPGVVKVDYGDVT 323


>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
           Acetylgalactosaminyltransferase-T1
          Length = 472

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 206/319 (64%), Gaps = 9/319 (2%)

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S  I+ +R++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   +
Sbjct: 2   SEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMI 61

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EEI+LVDD S +  L + LE Y+++    V +IR  +R GLIR R +GA  SRG+VI FL
Sbjct: 62  EEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFL 121

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           DAHCE    WL PLLA I  DR+ +  P+ID I   T+E+ +    D  Y G F W + +
Sbjct: 122 DAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNF 178

Query: 235 KENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
           +   +P+RE  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF
Sbjct: 179 RWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISF 238

Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           +IW CGG++E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +
Sbjct: 239 RIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNF 293

Query: 354 FYTREPLAMFLDMGDISEQ 372
           FY   P    +D GDIS +
Sbjct: 294 FYIISPGVTKVDYGDISSR 312


>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
 gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
 gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
          Length = 579

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 151/356 (42%), Positives = 223/356 (62%), Gaps = 13/356 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 78
           +EGPGE GK   L       G A + ++ MN+  S+ IS DR +PD R++ CK   +DY 
Sbjct: 72  REGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKISLDRDVPDPRIQACKDIKYDYA 131

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
             LPK SVI++F +E ++ L+RTVHS+I R+P + L+E+IL+DD S + +L + L+++I+
Sbjct: 132 A-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIK 190

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
           RF GKVRLIR   R GLIR +  GA+E+ G++IVFLD+HCE    WL P++  I  +R  
Sbjct: 191 RFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +  P+ID I   T  +   +       G F W + +    L E E K+R   ++  +SPT
Sbjct: 251 IVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGL A +R +F E+GGYD  + +WGGEN E+SF+ WMCGGSIE++PCS +GH++R+  
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367

Query: 319 PYNF-GKLADR-VKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PYN  G+  ++ V G     N KR+ E W D+  + Y+  RE L    D+GD++ +
Sbjct: 368 PYNMTGRNNNKDVHGT----NSKRLAEVWMDDYKRLYYMHREDLRT-KDVGDLTAR 418


>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
 gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 208/325 (64%), Gaps = 11/325 (3%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           +G N+  SN +S  RTI D R E C+   YP +LP AS+++ F+NE ++ L+RT+HS++ 
Sbjct: 21  HGFNLLISNRLSLHRTIKDTRHELCRGKTYPKNLPVASIVICFYNEAWTILLRTIHSVLD 80

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           RTP Q+L EIILVDDFS+  +L  KL+ Y+     K+R++RN +REGLIR R  GA+ + 
Sbjct: 81  RTPHQFLHEIILVDDFSNMLELKSKLDRYLSTMP-KIRIVRNNKREGLIRGRIIGAEAAT 139

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VFLD+HCEV +NWL PLL  I+ D+K +  PVID I   T+E+ S        RG 
Sbjct: 140 GQVLVFLDSHCEVNINWLQPLLQHIHDDQKAVACPVIDVISSDTFEYSS----SPMVRGG 195

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F WG+ +    +P     K +   +P +SPT AGGLFA+DR +F +LG YD G+ +WG E
Sbjct: 196 FNWGLHFTWEPIPPSLLVKPEDYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAE 255

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF+IWMCGGS++ +PCSR+GH++R F PY         KG  ++ N  R+ E W D
Sbjct: 256 NLEISFRIWMCGGSLDILPCSRVGHLFRRFRPY-----GSDSKGDTMSRNSMRLAEVWLD 310

Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
             +K YFY           GDIS++
Sbjct: 311 -GYKKYFYQIRHDLEGKKFGDISQR 334


>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
 gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
          Length = 415

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 142/337 (42%), Positives = 218/337 (64%), Gaps = 12/337 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G+ G+A  +P+  +   +     +  N+  S+ +S  R +PD R + CK   YPL LPK+
Sbjct: 1   GDMGEAVSVPKRLKEKEEEGYELHSFNLVASDMMSLYRRLPDYRNDACKAKKYPLHLPKS 60

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           S+I+ FHNE +S+L+RTVHS+I RTP + LEEI+L+DD S++ +L +KLE+Y+ +    V
Sbjct: 61  SIIICFHNEAWSTLLRTVHSVINRTPPRLLEEILLIDDASNRDELKEKLEEYVAKLK-VV 119

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
           R+IR ++R+GLIR R +GA  ++G ++ FLDAHCE    WL PL A I  +   + +PVI
Sbjct: 120 RIIRLSKRQGLIRARLKGAAAAKGSILTFLDAHCECSKGWLEPLAAKIAENSSNVVMPVI 179

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
           D I   T+ + +V EP H  RG+F W + +    +P+ E ++RK  ++  ++P  AGGLF
Sbjct: 180 DEISDTTFYYHAVPEPFH--RGVFRWRLEFGWKPVPQYEMERRKDEADGIRTPVMAGGLF 237

Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-- 322
           ++D+ +F ++G YD G+ +WGGEN E+SF+IWMCGG+IE +PCSR+GHV+R   PY+F  
Sbjct: 238 SIDKNYFEKIGTYDTGMDIWGGENLEISFRIWMCGGAIEMLPCSRVGHVFRPRFPYSFPA 297

Query: 323 --GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
             G   D     +++ N  RV + W DE  K ++  R
Sbjct: 298 RPGHNTD-----VVSNNLMRVADVWMDEYKKHFYNIR 329


>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
           anubis]
          Length = 608

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 166/378 (43%), Positives = 224/378 (59%), Gaps = 21/378 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
           P FKA+     ++  ++ + E P EG   +         E  +   D    ++  NM  S
Sbjct: 69  PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLH 184

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV + WL PLLA I  DR  +  PVID I   T      Y      RG F WG+ +
Sbjct: 245 DSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L      G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432


>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Gorilla gorilla gorilla]
 gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Gorilla gorilla gorilla]
          Length = 608

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 157/332 (47%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|158300139|ref|XP_320141.4| AGAP012414-PA [Anopheles gambiae str. PEST]
 gi|157013013|gb|EAA00190.4| AGAP012414-PA [Anopheles gambiae str. PEST]
          Length = 596

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 155/356 (43%), Positives = 216/356 (60%), Gaps = 13/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GKA  L ++     D    + G N   S+ IS +R++PD+R   C+   Y
Sbjct: 82  EAKRSGIGEHGKAGQLDKSEHEMKDKLFKKNGFNAVLSDKISLNRSLPDIRHRGCRKKQY 141

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F+NE +S+L+RT  S++ R+P + + EIILVDD S+K  L Q+L++Y+
Sbjct: 142 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELIAEIILVDDCSTKEFLKQQLDEYV 201

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  ER GLI  R  GAK +  +V++FLD+H E  +NWLPPLL PI  D +
Sbjct: 202 TENMPKVKVVRLPERSGLITARLAGAKIATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 261

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID ID+ T+E+R+    D   RG F+W   YK   L  R+ +     +EP++SP
Sbjct: 262 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPRDLQN---PTEPFESP 315

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF E+GGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR +
Sbjct: 316 VMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCSRVGHIYRGY 375

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            P+   +  D      +T NYKRV E W DE +K Y Y R+       D+GDIS Q
Sbjct: 376 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYMRDRKKYENTDVGDISRQ 425


>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Gorilla gorilla gorilla]
          Length = 527

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 157/332 (47%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 30  DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 89

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 90  VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 265

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351


>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
          Length = 584

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 155/368 (42%), Positives = 229/368 (62%), Gaps = 12/368 (3%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LG L   L   + GPGE G A  +  + +        E   ++  S+ IS +R +PD R
Sbjct: 70  ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERTRKFKENQFDVMASDLISINRALPDYR 129

Query: 69  MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
             +C+      D   LP  S+I+VFHNE +S+L+RT+HS+I R+P   ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSN 189

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
           +  L + L+ YI+RF+    LI   ER GLIR R +GAK ++G+V++FLDAH EV   WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  + +DRK +  P+ID I  + +E+  +   D  + G F W + ++   +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306

Query: 246 KRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           +R ++ S P ++PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E 
Sbjct: 307 RRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEI 366

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
            PCSR+GHV+R   PY+F     RV    I +N  R  E W DE +K  FY+  P A  +
Sbjct: 367 HPCSRVGHVFRKHTPYSFPGGTARV----IHHNTARTAEVWMDE-YKDIFYSMVPAARNV 421

Query: 365 DMGDISEQ 372
           D+GD++E+
Sbjct: 422 DVGDLTER 429


>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
          Length = 597

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 164/384 (42%), Positives = 229/384 (59%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGM-----NMET 54
           GNL  P   ++P    YK+G     GE GKA  + ++   +   ++ + GM     N   
Sbjct: 96  GNLAKPKFMVDPNDPIYKKGDTSQAGELGKAVVVDKSKLTSEQKAIYDKGMLNNAFNQYA 155

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 156 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPDHLL 215

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EEI+LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 216 EEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYL 275

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 276 DSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 328

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 329 DWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 388

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 389 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 443

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GD+S +
Sbjct: 444 -YKTYYYERIN-NQLGDFGDVSAR 465


>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 576

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 171/378 (45%), Positives = 227/378 (60%), Gaps = 23/378 (6%)

Query: 5   KADGKLGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETS 55
           K +  L N + P+  YK G    PGEGGKA       L    R   D    +   N   S
Sbjct: 6   KPNKALFNPDSPI--YKSGDENQPGEGGKAVVIDRNKLSLDERKIYDDGFTKNAFNQYIS 63

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           + IS  R++P    EECK   Y  DLP  SVI+ FHNE +S L+RTVHS+++RTP   L 
Sbjct: 64  DMISIHRSLPSYIDEECKNEKYTSDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLA 123

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           E+ILVDDFS  A L   LE Y+++F+ KVR++R  +REGLIR R RGA  S+G VI +LD
Sbjct: 124 ELILVDDFSDMAHLKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAISKGSVITYLD 182

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLY 234
           +HCE    W+ PLL  I  + K +  PVID ID  T+E+   Y   +    G F+W + +
Sbjct: 183 SHCECLEGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQF 240

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
             + +PE++ K R+ + +P KSPT AGGLF++DR FF ELG YDPGL +WGGEN ELSFK
Sbjct: 241 NWHAIPEKDRKGRR-DIDPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSFK 299

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +K Y+
Sbjct: 300 IWMCGGILEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSVRLAEVWMDE-YKKYY 353

Query: 355 YTREPLAMFLDMGDISEQ 372
           Y R    +  D GD+S +
Sbjct: 354 YERINNNLG-DFGDVSSR 370


>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
 gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
          Length = 624

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Takifugu rubripes]
          Length = 557

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 155/360 (43%), Positives = 216/360 (60%), Gaps = 9/360 (2%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           E  L   ++GPGEGGK   +P+  +            N+  S  I+ +R++PD+R+E CK
Sbjct: 46  EDTLTRPRDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCK 105

Query: 74  YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKL 133
              YP +LP+ SV++VFHNE +S+L+RTVHS+I R+P   LEEIILVDD S +  L + L
Sbjct: 106 NKLYPDNLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPL 165

Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           E Y++R    VR++R  +R GLIR R +GA  S G+VI FLDAHCE    WL PLLA I 
Sbjct: 166 EQYVRRLEVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIK 225

Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SE 252
            DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + 
Sbjct: 226 KDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTL 282

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P +    AGG     R +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GH
Sbjct: 283 PVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 342

Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           V+R   PY F        G +I  N +R+ E W DE  K +FY   P    +D GDI+ +
Sbjct: 343 VFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDIATR 397


>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
 gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
          Length = 626

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
           troglodytes]
          Length = 527

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 156/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP AS+++ F+NE FS+L+RT
Sbjct: 30  DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRT 89

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 90  VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 265

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351


>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
          Length = 624

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 565

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 155/353 (43%), Positives = 217/353 (61%), Gaps = 11/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 80
            PGE GK   +                 N+  SN IS +R++PD+RM+ CK   YP    
Sbjct: 61  APGEMGKGVVIAPEEEELKKEMFKINQFNLLASNKISVNRSLPDVRMDGCKKKTYPPHNT 120

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LPK S+++VFHNE +S+L+R VHSII R+P   LEEIILVDD S +  L ++LEDY+++ 
Sbjct: 121 LPKTSIVIVFHNEAWSTLIRNVHSIINRSPRMLLEEIILVDDASERDFLGKELEDYVKKL 180

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             +VR+ R  +R GLIR R RGA  S GEVI FLDAHCE    WL PL+A I  DR  + 
Sbjct: 181 PVRVRVERMDKRSGLIRARLRGAGVSTGEVITFLDAHCECTQGWLEPLMARIAEDRSRVV 240

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I  +T+EF +    D  Y G F W + ++   +P+RE  +RK + + P  +PT 
Sbjct: 241 CPIIDVISDETFEFHA--GSDMTYGG-FNWKLNFRWYSVPKREMDRRKGDRTIPLNTPTM 297

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+ + +F E+G YD G+ +WGGEN E+SF+IWMCGG++E V CS +GHV+R   P
Sbjct: 298 AGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGHVFRKTTP 357

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y+F        G +I  N +R+ E W D+ +K +FY   P +   + GD++ +
Sbjct: 358 YSFPGGT----GAIINKNNRRLAEVWMDD-YKTFFYKISPGSKKSEYGDVTNR 405


>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
          Length = 632

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 164/377 (43%), Positives = 223/377 (59%), Gaps = 34/377 (9%)

Query: 12  NLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGD-----ASLGEYGMNMETSNHISFDR 62
           N + P+  YK+G     GEGGK   + +   +A +     A       N   S+ IS  R
Sbjct: 105 NADSPI--YKKGDKNQAGEGGKPVKINQEQLSAQEREKYAAGFRNNAFNQYVSDMISIHR 162

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++P    EECK   Y  DLP  SVI+ FHNE +S L+RTVHS+I+RTP   L E+ILVDD
Sbjct: 163 SLPSTIDEECKTEKYLDDLPSTSVIICFHNEAWSVLLRTVHSVIERTPEHLLTEVILVDD 222

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS    L + LE+Y+     KVR++R  +REGLIR R +GA  S+G V+ FLD+HCE   
Sbjct: 223 FSDMDHLKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECME 281

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYK 235
            W+ PLL  I  +   +  PVID ID +T+E+        HY        G F+W + + 
Sbjct: 282 GWIEPLLDRIKRNSSTVVCPVIDVIDDETFEY--------HYSKAYFTNVGGFDWSLQFN 333

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
            + +PER+ K RK + +P +SPT AGGLF++DRA+F +LG YDPG  +WGGEN ELSFKI
Sbjct: 334 WHAIPERDRKNRKRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSFKI 393

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W DE +K Y+Y
Sbjct: 394 WMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKKNSVRLAEVWLDE-YKVYYY 447

Query: 356 TREPLAMFLDMGDISEQ 372
            R       D GD+S++
Sbjct: 448 ERIN-NQTGDYGDVSDR 463


>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
          Length = 608

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  S+ + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
 gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
           AltName: Full=Polypeptide GalNAc transferase 11;
           Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 11;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11
 gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
           [Homo sapiens]
 gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
           sapiens]
 gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
           isoform CRA_b [Homo sapiens]
 gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [synthetic construct]
 gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 608

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  S+ + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
          Length = 608

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 156/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  S+ + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E  + +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
          Length = 626

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Pan troglodytes]
 gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           4 [Pan troglodytes]
 gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
 gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
           troglodytes]
          Length = 608

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 156/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP AS+++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Pan paniscus]
 gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Pan paniscus]
          Length = 608

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 165/378 (43%), Positives = 224/378 (59%), Gaps = 21/378 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETS 55
           P FKA+     ++  ++ + E P EG   +         E  +   D    ++  NM  S
Sbjct: 69  PQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLIS 124

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           N + + R +PD R   CK   YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA  L 
Sbjct: 125 NRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLH 184

Query: 116 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFL
Sbjct: 185 EIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFL 244

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCEV + WL PLLA I  DR  +  PVID I   T      Y      RG F WG+ +
Sbjct: 245 DSHCEVNVMWLQPLLATIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHF 300

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+
Sbjct: 301 KWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFR 360

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF
Sbjct: 361 IWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYF 415

Query: 355 YTREPLAMFLDMGDISEQ 372
             R  L      G+ISE+
Sbjct: 416 SLRPDLKT-KSYGNISER 432


>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
          Length = 499

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 146/323 (45%), Positives = 208/323 (64%), Gaps = 9/323 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S  I+ +R++PD+R+E CK   YP +LP  SV++VFHNE +S+L+RTVHS+I R+P
Sbjct: 25  NLMASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSP 84

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              +EEI+LVDD S +  L + LE Y+++    V +IR  +R GLIR R +GA  S+G+V
Sbjct: 85  RHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQV 144

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           I FLDAHCE  + WL PLLA I  DR+ +  P+ID I   T+E+ +    D  Y G F W
Sbjct: 145 ITFLDAHCECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 201

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + ++   +P+RE  +RK + + P ++PT AGGLF++D  +F E+G YD G+ +WGGEN 
Sbjct: 202 KLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENL 261

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF+IW CGG++E V CS +GHV+R   PY F        G +I  N +R+ E W DE 
Sbjct: 262 EISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT----GQIINKNNRRLAEVWMDE- 316

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            K +FY   P    +D GDIS +
Sbjct: 317 FKNFFYIISPGVTKVDYGDISSR 339


>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
 gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
          Length = 623

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
          Length = 608

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 159/370 (42%), Positives = 220/370 (59%), Gaps = 14/370 (3%)

Query: 5   KADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
           K    LGN ++ P++   E   E G  ++  E  +   D    ++  NM  SN + + R 
Sbjct: 75  KIGNALGNHVQDPVKGEVEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLGYHRE 132

Query: 64  IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
           +PD R  +C+   YP DLP ASVI+ F+NE  S+L+RTVHS++ RTPA  L EIILVDD 
Sbjct: 133 VPDTRDVKCREKSYPSDLPSASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDN 192

Query: 124 SSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           S  ADL + L++Y++ +     +L+RN +REGLIR R  GA  + G+V+VFLD+HCEV  
Sbjct: 193 SELADLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNE 252

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLL PI  DR+ +  PVID I   T      Y      RG F WG+ +K + +P  
Sbjct: 253 MWLQPLLTPIREDRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLS 308

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
           E +  +  + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG +
Sbjct: 309 ELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRL 368

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
             +PCSR+GH++R   PY      D      + +N  R+   W DE  + YF  R  L M
Sbjct: 369 LIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLRLAHVWMDEYKEQYFALRPELRM 423

Query: 363 FLDMGDISEQ 372
             + G+I+++
Sbjct: 424 -RNYGNITDR 432


>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
          Length = 588

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 153/354 (43%), Positives = 215/354 (60%), Gaps = 11/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-- 79
           +GPGE G    +P+           E   N+  SN IS +RT+ D+RM  CK  DY    
Sbjct: 82  KGPGEMGAPVKIPKDKEKESKKMFQENQFNLMASNMISLNRTLKDVRMSGCKKHDYANLG 141

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LPK S+I VFHNE +S+L+R++HS+I R+P + LEEIILVDD S K  L ++L+DY++ 
Sbjct: 142 ALPKTSIIFVFHNEAWSTLLRSIHSVINRSPREMLEEIILVDDKSEKDFLGKQLDDYVKN 201

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
               V +IR   REGLIR R  GAK ++GEV+ FLDAH E    WL PLL  I  DR  +
Sbjct: 202 LPVPVHIIRQQHREGLIRARLEGAKIAKGEVLTFLDAHIEASPGWLEPLLYEIKKDRTNV 261

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
             P+ID I   T+EF  +   D  Y G F W + ++   +P+RE  +R  + S P ++PT
Sbjct: 262 ICPIIDVISDDTFEF--LTGSDLTYGG-FNWKLNFRWYPVPQREVDRRGGDRSLPMQTPT 318

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLF++D+++F E+G YD G+ +WGGEN E+SF+IWMCGG++    CS +GHV+R   
Sbjct: 319 MAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSFRIWMCGGTVLIATCSHVGHVFRKAT 378

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY F     ++    I  N +R+ E W D+ +K +FY   P  M    GD+S++
Sbjct: 379 PYTFPGGTSQI----INKNNRRLAEVWMDD-YKKFFYIVNPTVMKHKYGDVSDR 427


>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
          Length = 623

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 165/384 (42%), Positives = 225/384 (58%), Gaps = 36/384 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMET 54
           GNL  P   ++P    YK+G     GE GKA       L    +A  D  +     N   
Sbjct: 88  GNLAKPKFMVDPNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLNNAFNQYA 147

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 148 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLL 207

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EE++LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 208 EEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 268 DSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 320

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 321 DWGLQFNWHSIPERDRKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGEN 380

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 381 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 435

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K Y+Y R       D GDIS +
Sbjct: 436 -YKTYYYERIN-NQLGDFGDISSR 457


>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 621

 Score =  291 bits (745), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 216/355 (60%), Gaps = 8/355 (2%)

Query: 19  PYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
           P + GPGE GKA  + ++     +        N+  SN IS DR++ D R + C    Y 
Sbjct: 96  PLRVGPGEMGKAVTVAKSEEEEMEKMFKVNYFNLMISNRISNDRSLADYRPQGCFAKKYS 155

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
            +LPK SVILV+HNE +S LMRTVHS+I R+P   LEEI+L+DD S++  L + L+DYI 
Sbjct: 156 RNLPKTSVILVYHNEAWSVLMRTVHSVINRSPRHLLEEILLIDDASTREYLGRPLDDYIT 215

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
           +    VR+    ER GLI  R +GA+ ++  V+ FLD+HCE    WL PLL  I ++R  
Sbjct: 216 KLPVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPLLDRIAANRST 275

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSP 257
           +  PVI+ ID +++ F +  E  H   G F+W +++    +P+ E  +   + SEP +SP
Sbjct: 276 VVCPVINQIDDRSFAFVNATEVSH--IGGFDWNIIFNWYNIPQSEKDRIGGDKSEPVRSP 333

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
           T AGGLF++D+++F ELG YDP    WGGEN ELS KIWMCGG +E+VPCS +GHV+R  
Sbjct: 334 TMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELSLKIWMCGGILEFVPCSHVGHVFRKH 393

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            P+ +      V G     N +R+ E W DE +K  FY  +P  M +D GDIS++
Sbjct: 394 NPHKYKNTTYNVVG----RNNRRLAEVWLDE-YKYLFYANQPETMKIDPGDISQR 443


>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Nomascus leucogenys]
          Length = 527

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 158/350 (45%), Positives = 212/350 (60%), Gaps = 11/350 (3%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PG G +     E  +   D    ++  NM  SN + + R +PD R   C+   YP DLP 
Sbjct: 12  PGCGQRGMIFNERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPS 71

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NG 142
           ASV++ F+NE FS+L+RT HS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  G
Sbjct: 72  ASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPG 131

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           K+++IRNT+REGLIR R  GA  + GEV+VFLD+HCEV + WL PLLA I  D+  +  P
Sbjct: 132 KIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCP 191

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           VID I   T      Y      RG F WG+ +K + +P  E    +  + P KSPT AGG
Sbjct: 192 VIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGG 247

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY  
Sbjct: 248 LFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGS 307

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +  D      +T+N  R+   W DE  + YF  R  L      G+ISE+
Sbjct: 308 PEGQD-----TMTHNSLRLAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 351


>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
           (Silurana) tropicalis]
 gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
           [Xenopus (Silurana) tropicalis]
          Length = 563

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 201/329 (61%), Gaps = 11/329 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R  +C    YP DLP AS+++ F+NE FS+L+RT
Sbjct: 66  DVGYQKHAFNLLISNRLGYHRDVPDTRDSKCAKKTYPPDLPMASIVICFYNEAFSALLRT 125

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPAQ L EIILVDD S   DL + L+ Y+Q   + KV+L+RN +REGLIR R 
Sbjct: 126 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDGYMQENLSKKVKLVRNKQREGLIRGRM 185

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V+VFLD+HCEV   WL PLLAPI  + + +  PVID I   T     +Y  
Sbjct: 186 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIKENPRTVVCPVIDIISADTL----IYSS 241

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  S P++SPT AGGLFAMDR +F  LG YD G
Sbjct: 242 SPVVRGGFNWGLHFKWDPVPLAELGGPEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSG 301

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGGS+  VPCSR+GH++R   PY      D      + +N  R
Sbjct: 302 MDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHD-----TMAHNSLR 356

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +   W DE    YF  R P     D GDI
Sbjct: 357 LAHVWMDEYKDQYFALR-PELRNRDFGDI 384


>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
          Length = 654

 Score =  290 bits (743), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRSAACKGKSYPADLPVASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA+ L E+ILVDD S   DL  +L++Y+Q++  GK+++IRN +REGLIR R 
Sbjct: 171 VHSVIDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNKKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPAVRGGFNWGLHFKWDLVPLSELAGAGGATAPIKSPTMAGGLFAMNRQYFSELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-RSYGNISER 432


>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 11-like [Oryzias
           latipes]
          Length = 574

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 151/339 (44%), Positives = 203/339 (59%), Gaps = 11/339 (3%)

Query: 35  EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
           EA +   DA    +  N+  SN +   R +PD R ++C+   YP  LP ASV++ F NE 
Sbjct: 71  EADQEVRDAGYHRHAFNVLISNRLGSHRELPDTRDKQCRKRSYPQALPSASVVICFFNEA 130

Query: 95  FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTERE 153
            S+L+RTVHS++ RTPA  L EIILVDD S   +L + L+  + +   GKVRL+RN +RE
Sbjct: 131 LSALLRTVHSVLDRTPAYLLHEIILVDDQSELEELKEGLDRCVREELQGKVRLVRNRKRE 190

Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           GLIR R  GA  + G+V+VFLD+HCEV  +WL PLLAPI  DR+ +  P+ID I   T  
Sbjct: 191 GLIRGRMIGAAHATGDVLVFLDSHCEVNQDWLQPLLAPIQKDRRTVVCPIIDIISADTL- 249

Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
               Y      RG F WG+ +K + +P  E    +  + P +SPT AGGLFAM+R +F E
Sbjct: 250 ---TYSSSPIVRGGFNWGLHFKWDPVPPSEISGPEGAAGPIRSPTMAGGLFAMNREYFNE 306

Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
           LG YDPG+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      
Sbjct: 307 LGRYDPGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFRKRRPYGSPGGQD-----T 361

Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + +N  R+   W DE  + Y   R P       GDISE+
Sbjct: 362 MAHNSLRLAHVWMDEYKEQYLSLR-PELRNRSYGDISER 399


>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
          Length = 630

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 164/382 (42%), Positives = 225/382 (58%), Gaps = 36/382 (9%)

Query: 11  GNLEPP---LEP----YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGM-----NMET 54
           GNL  P   ++P    YK+G     GE GKA  + +         + + GM     N   
Sbjct: 92  GNLAKPKFMVDPNDPIYKKGDASQAGELGKAVIVDKTKLTPEQKGIYDKGMLNNAFNQYA 151

Query: 55  SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 114
           S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP   L
Sbjct: 152 SDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLL 211

Query: 115 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EEI+LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV+ +L
Sbjct: 212 EEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYL 271

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIF 228
           D+HCE    W+ PLL  I  D   +  PVID ID  T+E+       HH +      G F
Sbjct: 272 DSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGF 324

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +WG+ +  + +PER+ K R    +P +SPT AGGLF++D+ +F +LG YDPG  +WGGEN
Sbjct: 325 DWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGEN 384

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFKIWMCGG++E VPCS +GHV+R   PY +     R    ++  N  R+ E W D+
Sbjct: 385 LELSFKIWMCGGTLEIVPCSHVGHVFRKRSPYKW-----RTGVNVLKRNSIRLAEVWLDD 439

Query: 349 KHKAYFYTREPLAMFLDMGDIS 370
            +K Y+Y R       D GD+S
Sbjct: 440 -YKTYYYERIN-NQLGDFGDVS 459


>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Oryctolagus cuniculus]
          Length = 608

 Score =  290 bits (742), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSEQGGAEGATAPIKSPTMAGGLFAMNRLYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Anolis carolinensis]
          Length = 608

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 151/332 (45%), Positives = 203/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R  +CK   YPLDLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRDAKCKGKKYPLDLPSASIIICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTP+  L EIILVDD S   DL + L+ Y+++     V+L+RN +REGLIR R 
Sbjct: 171 VHSVLDRTPSHLLHEIILVDDNSELVDLKEDLDVYLRKNLPNNVKLVRNGKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V+VFLD+HCEV   WL PLL PI   RK +  PVID I   T      Y  
Sbjct: 231 IGASHATGKVLVFLDSHCEVNELWLQPLLTPIRESRKTVVCPVIDIISADTL----TYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E +  +  + P KSPT AGGLFAMDR +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      + +N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE    YF  R  L M  + G+I+++
Sbjct: 402 LAHVWMDEYKDQYFALRPELRM-RNYGNITDR 432


>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Metaseiulus occidentalis]
          Length = 583

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 149/348 (42%), Positives = 212/348 (60%), Gaps = 9/348 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G+   +PE   A  +        N+  S  I+ +R++PD+R+ EC+   YP  LP 
Sbjct: 78  PGENGEGVEIPEKETALKNEKFKINQFNLLASERIALNRSLPDVRLAECRKKTYPDRLPT 137

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +++L+RTVHSII+ +P + + EIILVDD S    L QKLEDY+ +    
Sbjct: 138 TSIVIVFHNEAWTTLLRTVHSIIQMSPRELIAEIILVDDASEFDHLGQKLEDYVAKLPVP 197

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V ++R  +R GLIR R  GA+   G+VI FLDAHCE    WL PLLA I  D   +  PV
Sbjct: 198 VHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWLEPLLARIAEDNTRVVCPV 257

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +   F  V   D  + G F W + ++   +P+RE  +R  + + P ++PT AGG
Sbjct: 258 IDVISDEN--FAYVPASDQTWGG-FNWKLNFRWYRVPQRENDRRGGDRTLPVRTPTMAGG 314

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LFAMD+A+F +LG YD G+ +WGGEN E+SF+IWMCGG++E V CS +GHV+R   PY F
Sbjct: 315 LFAMDKAYFEKLGKYDEGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGHVFRKSTPYTF 374

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
                   G ++ +N  R+ + W DE  K +++   P+A  +D GD S
Sbjct: 375 PGGT----GKIVNHNNARLADVWLDE-WKDFYFAINPVAKKVDRGDTS 417


>gi|256083753|ref|XP_002578103.1| peptidase [Schistosoma mansoni]
          Length = 1860

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 13/354 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           ++GPGE G    L  + +A    +L   G N+  S  I  DR++ D+R   CK   Y   
Sbjct: 2   RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+  + E + +L+RTV S++ R+P + ++E+ILVDD SS+  L ++L++Y+ R 
Sbjct: 62  LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121

Query: 141 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
              G V +I   EREGLIR R  GAK + G+V++FLD+HCE  +NWLPPLL PI  + + 
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +T P ID ID  T+E+R+    D   RG F+W   YK   LP R +    +   P++SP 
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTDSLHPETPFESPV 235

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG +  VPCSR+GH++R + 
Sbjct: 236 MAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVPCSRVGHIFREY- 294

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           P NF +   ++K   +  N+KRV E W DE +K Y Y   P    +D GD+S+Q
Sbjct: 295 PTNFPQ--PKIKN-FLRRNFKRVAEVWMDE-YKEYIYRSLPECRKVDPGDLSQQ 344


>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
 gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
           malayi]
          Length = 584

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 154/368 (41%), Positives = 228/368 (61%), Gaps = 12/368 (3%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           +LG L   L   + GPGE G A  +  + +        E   ++  S+ IS +R +PD R
Sbjct: 70  ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERARKFKENQFDVMASDLISINRALPDYR 129

Query: 69  MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
             +C+      D   LP  S+I+VFHNE +S+L+RT+HS+I R+P   ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTLHSVINRSPLHLIKEVILIDDLSN 189

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
           +  L + L+ YI+RF+    LI   ER GLIR R +GAK ++G+V++FLDAH EV   WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  + +DRK +  P+ID I  + +E+  +   D  + G F W + ++   +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306

Query: 246 KRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           +R ++ S P ++PT AGGLFA+DR FF ++G YD G+ +WGGEN E+SF++WMCGGS+E 
Sbjct: 307 RRNHDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEISFRVWMCGGSLEI 366

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
            PCSR+GHV+R   PY+F     RV    I +N  R  E W DE +K  FY   P A  +
Sbjct: 367 HPCSRVGHVFRKHTPYSFPGGTARV----IHHNAARTAEVWMDE-YKDIFYGMVPAAKNV 421

Query: 365 DMGDISEQ 372
           D+GD++E+
Sbjct: 422 DVGDLTER 429


>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
 gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
          Length = 499

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 151/353 (42%), Positives = 213/353 (60%), Gaps = 10/353 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLD 80
           G G+ G+A  LP  ++     +   +  N+  S+ IS DR + D+R  +CK  +  YP  
Sbjct: 1   GLGDLGEAATLPTRFKEHAAHAFDNHSFNVMLSDRISLDRRLKDVRGPKCKRKHKLYPRA 60

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+ FHNE  S L+RTVHS++  +P + + +IILVDD+S   DL Q L D+I   
Sbjct: 61  LPTTSVIICFHNEALSVLLRTVHSVLNESPPRLIADIILVDDYSEYDDLKQPLIDHISML 120

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           N KV+LIR   R+GL+  R RGA+E+RGEV+ FLD+HCE    WL PLL  I  DR+ + 
Sbjct: 121 N-KVKLIRMPSRQGLVPARLRGAEEARGEVLTFLDSHCEATPGWLEPLLVRIAEDRRNVV 179

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            PVI+ I+    +FR       H RG F W + +    +PE E K+RK  ++  +SPT A
Sbjct: 180 CPVIEVINAD--DFRYQASDVIHERGGFTWDLFFTWKAIPEAEKKRRKDETDYIRSPTMA 237

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-P 319
           GGLFA+ + +F +LG YD  + +WGGEN E+SF+IWMCGG +E VPCSR+GHV+R +  P
Sbjct: 238 GGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSFRIWMCGGQLEIVPCSRVGHVFRKYTSP 297

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F K         +  N+ R+ E W DE    Y+  +      +D+GDIS++
Sbjct: 298 YKFPKGTTTT----LARNFNRLAEVWMDEYKDHYYRKKTEEERNVDIGDISDR 346


>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Oreochromis niloticus]
          Length = 584

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 162/360 (45%), Positives = 216/360 (60%), Gaps = 20/360 (5%)

Query: 18  EPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC-- 72
           +P    PGE G+A HL   PE  +   D S+  Y +N+  S+ IS  R I D RM+EC  
Sbjct: 74  QPDNNAPGEWGRATHLNLSPEEKKQEQD-SVERYAINIYVSDKISLHRHIQDHRMKECRS 132

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K +DY   LP  SVI+ F+NE +S+L+RT+HS+++ TPA  L+EIILVDDFS +  L  K
Sbjct: 133 KKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLKSK 191

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           L DYI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    W+ PLL  I
Sbjct: 192 LADYISDLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERI 250

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             +   +  PVID ID+ T+EF    + D    G F+W + ++ + +PE E K+RK   +
Sbjct: 251 SENASTIVCPVIDTIDWNTFEF--YMQTDEPMIGGFDWRLTFQWHSVPEMERKRRKSRID 308

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P +SPT AGGLFA+ +A+F  LG YD G+ VWGGEN ELSF++W CGGS+E  PCS +GH
Sbjct: 309 PIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSFRVWQCGGSLEIHPCSHVGH 368

Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           V+    PY           P    N  R  E W D  +K +FY R P A     G+ISE+
Sbjct: 369 VFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKKHFYNRNPPARKEKYGNISER 418


>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
           gallus]
          Length = 556

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 159/371 (42%), Positives = 231/371 (62%), Gaps = 13/371 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R++ CK   YP +LP  SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD
Sbjct: 96  SLPDVRLDGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVHSVVARSPRRLLAEIILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y+++    V+++R  +R GLIR R RGA  +RG+VI FLDAHCE   
Sbjct: 156 ASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTR 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I+ DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIWEDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDISEQ 372
           + +D GD+S +
Sbjct: 388 VKVDYGDVSAR 398


>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 11 [Mustela putorius
           furo]
          Length = 609

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 163/380 (42%), Positives = 225/380 (59%), Gaps = 20/380 (5%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
           V ++  K+  ++  ++ + E P +G   +         E  +   D    ++  NM  SN
Sbjct: 66  VLESQFKVNKIDDTVDNHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125

Query: 57  HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
            + + R +PD R   CK   YP+DLP ASV++ F+NE  S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185

Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           IILVDD S   DL  +LE+Y+Q++  GK+++IRN +REGLIR R  GA  S GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLD 245

Query: 176 AHCEVG---LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 232
           +HCEV    L WL PLLA I  DR+ +  PVID I   T      Y      RG F WG+
Sbjct: 246 SHCEVNVMWLMWLQPLLAAIQQDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGL 301

Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
            +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+S
Sbjct: 302 HFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEIS 361

Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
           F+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W D+  + 
Sbjct: 362 FRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDDYKEQ 416

Query: 353 YFYTREPLAMFLDMGDISEQ 372
           YF  R  L      G+ISE+
Sbjct: 417 YFSLRPDLRT-KSYGNISER 435


>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
 gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
          Length = 615

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 157/378 (41%), Positives = 226/378 (59%), Gaps = 16/378 (4%)

Query: 3   VFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           VF  D +  N L   +E    GPG   +GG    +PE  +   +    E   N+  S  I
Sbjct: 86  VFPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKDIKEKRFLENQFNVVASEMI 145

Query: 59  SFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           S +RT+PD R E C+     L    LP  S+I+VFHNE +++L+RT+HS+I R+P   LE
Sbjct: 146 SINRTLPDYRSEACRTTGNSLKTEGLPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLE 205

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           EIILVDD S +  L + L+ YI++F   V L+   +R GLIR R  G+  ++G++++FLD
Sbjct: 206 EIILVDDKSDRDYLVKPLDAYIKKFPVPVHLVHLEDRSGLIRARLTGSGMAKGKILLFLD 265

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
           AH EV   WL PL+  +  DRK +  P+ID I   T+E+ +  E      G F W + ++
Sbjct: 266 AHVEVTDGWLEPLVTRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFR 322

Query: 236 ENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
              +P+RE  +R  + S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF+
Sbjct: 323 WYAVPKRELNRRGADRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFR 382

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           +WMCGGS+E  PCSR+GHV+R   PY F     +V    I +N  R  E W DE +KA+F
Sbjct: 383 VWMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKV----IHHNAARTAEVWMDE-YKAFF 437

Query: 355 YTREPLAMFLDMGDISEQ 372
           Y   P A  ++ GD++E+
Sbjct: 438 YKMVPAARNVEAGDVTER 455


>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 561

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 222/355 (62%), Gaps = 13/355 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--L 79
           +GPGE G+   +P                N+  SN IS +RT+PD+R++ CK   YP   
Sbjct: 52  KGPGEMGQPVIIPPEEEELKKEMFKINQFNLLASNKISVNRTLPDVRIDGCKKKIYPPSQ 111

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LP  S+I+VFHNE +S+L+R +HSII R+P + LEEIILVDD S +  L ++L+DY++ 
Sbjct: 112 KLPTTSIIIVFHNEAWSTLIRNIHSIINRSPREILEEIILVDDASERDFLGKQLDDYVRG 171

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
            + +VR++R  ER G++  R RGA  S GEV+ FLDAHCE    WL PL+A I  DR  +
Sbjct: 172 LSVRVRVVRMAERSGIVGARLRGAAISTGEVLTFLDAHCECTKGWLEPLIARIAEDRTRV 231

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE-PYKSPT 258
             PVID I  +T+E+ SV E      G F W + ++   + +RE K+RK ++  P  +PT
Sbjct: 232 VSPVIDSISDETFEYNSVPELGC---GGFNWRLNFRWYPMSKREKKRRKGDATIPINTPT 288

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLF++ + +F  +G YD G+ +WGGEN E+SF+IWMCGG++E VPCS +GHV+R   
Sbjct: 289 MAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMSFRIWMCGGTLEIVPCSHVGHVFRGKS 348

Query: 319 PYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY F G +A      ++  N +R+ E W DE +K+++Y   P A   + GDI ++
Sbjct: 349 PYTFPGGVA-----TVVHNNNRRLAEVWMDE-YKSFYYKTVPNARNAEYGDIEDR 397


>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
           aries]
          Length = 606

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 209/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP+DLP ASV++ F+NE  S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRT 168

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++YIQ++  GK+++IRN +REGLIR R 
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRAVVCPVIDIISADTL----AYSS 284

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L    + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430


>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
           glaber]
          Length = 330

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 151/337 (44%), Positives = 210/337 (62%), Gaps = 9/337 (2%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
           GKA  +P+  +            N+  S+ I+ +R++PD+R+E CK   YP +LP  SV+
Sbjct: 2   GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61

Query: 88  LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
           +VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L   LE+Y++     V++I
Sbjct: 62  IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKFTLENYVKNLEVPVKII 121

Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
           R  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA I  DRK +  P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181

Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
              T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P ++PT AGGLF++
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSI 238

Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
           DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +GHV+R   PY F    
Sbjct: 239 DRNYFEEIGTYDAGMDIWGGENLEISFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGT 298

Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
               G +I  N +R+ E W DE  K +FY   P   F
Sbjct: 299 ----GHVINKNNRRLAEVWMDE-FKDFFYIISPGMQF 330


>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Callithrix jacchus]
          Length = 607

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 110 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 169

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L E+ILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 170 VHSVIDRTPAHLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKIIRNTKREGLIRGRM 229

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  D+  +  PVID I   T      Y  
Sbjct: 230 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 285

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ ++ + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 286 SPIVRGGFNWGLHFRWDLVPLSELGGAEGATTPIKSPTMAGGLFAMNRQYFHELGQYDSG 345

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 346 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 400

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 401 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 431


>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
 gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
          Length = 595

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 210/341 (61%), Gaps = 11/341 (3%)

Query: 32  HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
           H PE  +   D     +  N+  S+ I F R IPD R ++C+   YP  LPK S+++ F 
Sbjct: 92  HSPED-QETRDMGYRRHAFNLLISDRIGFHRNIPDTRNDKCRGKSYPSGLPKTSIVICFF 150

Query: 92  NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
           NE +S+L+RTVHS++ RTP + L+EIIL+DDFS ++ L ++LE+YI+     V+L R  +
Sbjct: 151 NEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQSHLKEELEEYIRDHLPMVQLYRTDK 210

Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
           REGLIR R +GA  + G+V++FLD+HCEV   WL PLLA I  DR  +  P+ID I+  T
Sbjct: 211 REGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPLLARIAEDRTRVVCPIIDIINSDT 270

Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
           +E    Y      RG F WG+ +K +++P++  +     + P  SPT AGGLFA+DR +F
Sbjct: 271 FE----YTASPLVRGGFNWGLHFKWDQVPQQLLQGPDGAAAPINSPTMAGGLFAIDREYF 326

Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
            ELG YD G+ +WGGEN E+SF+IWMCGG++E +PCSR+GHV+R   PY      D    
Sbjct: 327 DELGRYDEGMDIWGGENLEISFRIWMCGGTLEIIPCSRVGHVFRKRRPYGSPNGED---- 382

Query: 332 PLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             ++ N  R+   W DE    YF  R P       GDIS++
Sbjct: 383 -TMSKNSLRMAHVWMDEYKDQYFSLR-PEMKTRTYGDISDR 421


>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
 gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
          Length = 415

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 154/368 (41%), Positives = 224/368 (60%), Gaps = 14/368 (3%)

Query: 12  NLEPPLEPYK-EGPGEGGKAYH--LPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           N++   EP     PG  G+A    +P+ ++A  +A       N   S+ I ++R++PD R
Sbjct: 17  NIDATTEPRDPHAPGARGRAVEDAMPQ-HQADIEAGWKAASFNQFVSDLIPYERSLPDTR 75

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
              C   +   DLP  S+I+ F  E +S+L+R+VHS+I R+P   +EEI+L+DD S ++ 
Sbjct: 76  PPRCAEQEVADDLPTTSIIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSH 135

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L QKL+ Y+ +F  +VR++   ER GLIR R +GA+ + G V+ FLD+H E  + WL PL
Sbjct: 136 LKQKLDQYMSKF-PQVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPL 194

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L  I  DR  +  P ID ++  T+ +    E   + RG F+W + ++   LP  EAK+R 
Sbjct: 195 LDRIREDRTRVVCPSIDRVNEATFAYEVANE---NVRGGFDWELFFQWVSLPAVEAKRRT 251

Query: 249 YN---SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +N    E  +SPT AGGLF++DR FF ELGGYDPG  +WGGEN ELSFKIWMCGGS+E +
Sbjct: 252 HNVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIWMCGGSLEIL 311

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
           PCSR+GHV+R   PYN+      ++  ++ +N  R+ E W DE  K Y+     + + L 
Sbjct: 312 PCSRVGHVFRKSQPYNYSNATSIME--VVHHNNVRLAEVWLDEYKKIYYALHPGVEVELA 369

Query: 365 DMGDISEQ 372
            MGDISE+
Sbjct: 370 KMGDISER 377


>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 399

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 150/361 (41%), Positives = 219/361 (60%), Gaps = 10/361 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           LEP  +P+ EGPGE GK   +P+  +           +N+  S  I+ +R++P+ R+E C
Sbjct: 27  LEPVQKPH-EGPGEMGKPVDIPKEDQEKMKEMFKINQVNLMASEMIALNRSLPNDRLEGC 85

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           K   YP +LP  SV++VFHNE +S+L+RTVHS+I  +P   +EEI+LVDD + +  L + 
Sbjct: 86  KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINCSPRHMVEEIVLVDDANERDFLKRT 145

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE Y+++    V +IR   R GLIR R +G   S+G+VI+FLDAHCE  + WL PLL  I
Sbjct: 146 LESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWLEPLLTRI 205

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             DR+ +  P+ID I   T  F  +   D  Y G F W + ++   +P+RE  +RK + +
Sbjct: 206 KQDRRTVVCPIIDVISDDT--FECMAGSDMTYGG-FNWKLNFRWYLVPQREMDRRKGDRT 262

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGG F++DR +F E+G YD G+ +WG EN E+SF+IW CGG++E V CS +G
Sbjct: 263 LPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEISFRIWQCGGTLEIVTCSHVG 322

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV++   PY F        G +I  N +R+ E W DE  K +FY   P    +D GD+S 
Sbjct: 323 HVFQKATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDVSS 377

Query: 372 Q 372
           +
Sbjct: 378 R 378


>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Felis catus]
          Length = 608

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 161/377 (42%), Positives = 224/377 (59%), Gaps = 17/377 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
           V ++  K+  ++  ++ + E P +G   +         E  +   D    ++  NM  SN
Sbjct: 66  VLESQFKVNRIDDMIDNHVEDPEKGNTKFSSELGMIFDERDQELRDLGYQKHAFNMLISN 125

Query: 57  HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
            + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185

Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           IILVDD S   DL  +LE+Y+Q++  GK+++IRNT+REGLIR R  GA  + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
           +HCEV + WL PLLA I  D + +  PVID I   T      Y      RG F WG+ +K
Sbjct: 246 SHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
            + +P  E    +  + P +SPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF 
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416

Query: 356 TREPLAMFLDMGDISEQ 372
            R  L      G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432


>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           2 [Felis catus]
          Length = 527

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 30  DLGYQKHAFNMLISNRLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 89

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPAQ L EIILVDD S   DL  +LE+Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 90  VHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRM 149

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  D + +  PVID I   T      Y  
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSS 205

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P +SPT AGGLFAM+R +F ELG YD G
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSG 265

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 266 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 320

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R P       G+ISE+
Sbjct: 321 LAHVWLDEYKEQYFSLR-PDLRTKSYGNISER 351


>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Nasonia vitripennis]
          Length = 635

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 147/353 (41%), Positives = 214/353 (60%), Gaps = 9/353 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           ++ PGE GKA H+P    A           N+  S+ IS +R++ D+R+  CK   +P  
Sbjct: 123 RDSPGEMGKAVHIPPEQDAIQQELFKLNQFNLMASDMISLNRSLKDVRLSGCKSKKFPKL 182

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L QKLEDY++  
Sbjct: 183 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRALLKEIILVDDASEREHLKQKLEDYVETL 242

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
                + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  D+K + 
Sbjct: 243 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAHDKKTVV 302

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT 
Sbjct: 303 CPIIDVISDDTFEY--ITASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTM 359

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E  PCS +GHV+R   P
Sbjct: 360 AGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGILEISPCSHVGHVFRDKSP 419

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F     ++    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 420 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVPVGDVSER 467


>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
           caballus]
          Length = 609

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 112 DLGYQKHAFNMLISNRLGYHREVPDTRNAACKDKSYPTDLPVASVVICFYNEALSALLRT 171

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L E+ILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 172 VHSVLDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 231

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+++  PVID I   T      Y  
Sbjct: 232 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAVIQEDRRMVVCPVIDIISADTL----AYSS 287

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM R +F ELG YD G
Sbjct: 288 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMSRRYFSELGQYDSG 347

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 348 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 402

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 403 LAYVWLDEYKEQYFSLRPDLRT-KSYGNISER 433


>gi|390350617|ref|XP_784979.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 647

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/352 (42%), Positives = 212/352 (60%), Gaps = 14/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           GE GK        +   DA   +   N+  S+ I+F+R++PD+R ++CK   YP  LP  
Sbjct: 235 GEMGKPVIFEGDMKTHADALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPTT 294

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF---N 141
           SVI++FHNE FS+L+RTVHS+I R+P   L+EIILVDD S++  L  KL+DYI R    +
Sbjct: 295 SVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISRHFHSS 354

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VR+ R   R GLIR R  GA  + G+++ FLD+HCEV + WL PLLA I  DR+ +  
Sbjct: 355 ARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNVVT 414

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P ID ID     ++   +      G F W M ++   +   + ++ K N + P +SPT A
Sbjct: 415 PTIDVIDDNDLAYKGSDQLPQ--VGSFGWTMAFRWTAIQTMDLEEAKRNPTLPIRSPTMA 472

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F+ELG YDPG  +WG EN ELSFK WMCGGS+  + CS +GH++R F PY
Sbjct: 473 GGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSHVGHIFRKFAPY 532

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +         G     N KR+IE W  +  +A++Y   P  + +D GDI +Q
Sbjct: 533 SG-------MGSYFHRNNKRLIEVWLGDA-RAFYYKLHPDVLRIDAGDIQDQ 576


>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
 gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
          Length = 515

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 153/354 (43%), Positives = 210/354 (59%), Gaps = 10/354 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           K+ PGE GKA  +P+ +             N   S+ IS  R +PD R + CK   YP D
Sbjct: 7   KDAPGENGKAVDIPKEFLIESKRLFERNKFNQWASDKISLHRILPDARPKLCKDKVYPGD 66

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++VFHNE +S+L+RT+HS++ RT    L EIILVDD S   +L   L+ YI + 
Sbjct: 67  LPPTSVVIVFHNEAWSTLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHAPLDAYIAKL 126

Query: 141 NGKVRLIRNTEREGLIRTRSRGAK--ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
             KV++IRN +REGLIR+R  G     S+  V+ FLDAHCE    WL PLL  IY+DR  
Sbjct: 127 -AKVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLLERIYNDRST 185

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +  P ID I  + + ++  Y P    RGIF W + ++   +   E K+R+   +P ++PT
Sbjct: 186 VVCPEIDVISDENFAYQ--YGPSGLMRGIFNWDLHFRWRAVSTEEQKRRQSPIDPVRTPT 243

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA++R +F E+G YD  + +WGGEN E+SF+IW CGG++E VPCS +GHV+R   
Sbjct: 244 MAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISFRIWQCGGTLEIVPCSHVGHVFRKSQ 303

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY F K      G     N +RV E W D  +K +FY R+P       GDIS++
Sbjct: 304 PYGFPKGVVDTLGK----NSQRVAEVWMD-GYKEFFYQRQPHLRGHAYGDISKR 352


>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
           mutus]
          Length = 606

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 168

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++YIQ++  GK+++IRN +REGLIR R 
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRQTVVCPVIDIISADTL----AYSS 284

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L    + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430


>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
 gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
          Length = 673

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 162/353 (45%), Positives = 218/353 (61%), Gaps = 15/353 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE G A  L +  +   + +L  E G N   S+ IS +R++PD+R ++C+   Y   L
Sbjct: 146 GIGEQGVAAKLEDESQREYERALSLENGFNALLSDSISVNRSVPDIRHKDCRKKLYLSKL 205

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + L+EIILVDDFS +  L + LEDYI +  
Sbjct: 206 PTVSVIIIFYNEYMSVLMRSVHSLINRSPPELLKEIILVDDFSDRDYLFKPLEDYIAQHF 265

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KVR++R   R GLI  RS GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 266 TKVRVVRLPRRTGLIGARSAGARNATAEVLIFLDSHVEANYNWLPPLLEPIAQNKRTAVC 325

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ +EP+KSP  A
Sbjct: 326 PFIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DLKHPAEPFKSPVMA 378

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 379 GGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPCSRIGHIYRG--PR 436

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT-REPLAMFLDMGDISEQ 372
           N   +++   G  +  NYKRV E W DE +K Y Y   + +   +D GD++ Q
Sbjct: 437 NH--VSNPRGGDYLHKNYKRVAEVWMDE-YKQYLYNGADGVYERIDAGDLTAQ 486


>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
          Length = 535

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 154/354 (43%), Positives = 211/354 (59%), Gaps = 16/354 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE G   HLP              G N   S+ I  +R++PD+R   C+   Y   
Sbjct: 12  ERGIGEHGLPAHLPIKDSEIEKDLYAVNGFNGALSDKIPLNRSLPDIRHPGCQNRLYIES 71

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SV++ FHNE +S+L+RT +S++ R+P   ++E+ LVDD S+K  L ++L+DY+ + 
Sbjct: 72  LPTVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDFLKEQLDDYVSKH 131

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KV++IR   R GLI  R  GA+++  +V+VFLD+H E  +NWLPPLL PI  + K + 
Sbjct: 132 MPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPLLEPIALNYKTVV 191

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTH 259
            P ID + Y T+ +R+    D   RG F+W + YK    LP  EA       EP+ SP  
Sbjct: 192 CPFIDVVAYDTFAYRA---QDEGARGAFDWELFYKRLPVLPADEANM----PEPFPSPVM 244

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+ R FF ELGGYDPGL +WGGE +ELSFK+W CGG +   PCSR+GH+YR F P
Sbjct: 245 AGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPCSRVGHIYRKFAP 304

Query: 320 Y-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + N G       G  +  NY+RV E W DE +  Y Y R P  + +D GDIS+Q
Sbjct: 305 FPNPG------HGDFVGKNYRRVAEVWMDE-YAQYLYKRRPHYLKIDTGDISKQ 351


>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
           lupus familiaris]
          Length = 608

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 159/377 (42%), Positives = 224/377 (59%), Gaps = 17/377 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 56
           V ++  K+  ++  ++ + E P +G   +         E  +   D    ++  NM  SN
Sbjct: 66  VLESQFKVNRIDDKIDNHVEDPEKGNIKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125

Query: 57  HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
            + + R +PD R   C+   +P DLP ASV++ F+NE  S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACRDKSFPADLPAASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185

Query: 117 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           IILVDD S   DL  +LE+Y+Q++  GK+++IRN +REGLIR R  GA  + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNIKREGLIRGRMIGAAHATGEVLVFLD 245

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
           +HCEV + WL PLLA I  D++ +  PVID I   T      Y      RG F WG+ +K
Sbjct: 246 SHCEVNVMWLQPLLAAIQEDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
            + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+I
Sbjct: 302 WDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRI 361

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           WMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE  + YF 
Sbjct: 362 WMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLRLAHVWLDEYKEQYFS 416

Query: 356 TREPLAMFLDMGDISEQ 372
            R  L      G+ISE+
Sbjct: 417 LRPDLRT-KSYGNISER 432


>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           1 [Nomascus leucogenys]
          Length = 608

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 206/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   C+   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPSASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
            HS+I RTPA  L EIILVDD S   DL  +L++Y+Q++  GK+++IRNT+REGLIR R 
Sbjct: 171 AHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  D+  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
           3 [Bos taurus]
 gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
           taurus]
          Length = 606

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 208/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP AS+++ F+NE  S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++YIQ++  GK+++IRN +REGLIR R 
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L    + G+ISE+
Sbjct: 400 LAHVWLDEYKEQYFSLRPDLRT-RNYGNISER 430


>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
 gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
          Length = 481

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 155/364 (42%), Positives = 214/364 (58%), Gaps = 14/364 (3%)

Query: 13  LEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
             P L  Y+    GE G+A  +P  Y+   D        N   S+ IS  RT+PD R   
Sbjct: 5   FRPTLPHYRRNSYGENGQAVVVPAVYKEESDRLFSRNRFNQWASDRISLHRTLPDQRPAA 64

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS---KAD 128
           C+   +P +LP AS+++VFHNE +S+L+RTVHS++ R+  + + EIILVDD S      +
Sbjct: 65  CRKQLFPTNLPPASLVIVFHNEAWSTLLRTVHSVLDRSDPRLMREIILVDDCSEIKGHEE 124

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L   LE YIQ+    V+L+RN +R+GLIR R RG KE    VIVFLDAHCEV   WL PL
Sbjct: 125 LQAPLEKYIQKLK-IVKLVRNKKRQGLIRARLRGYKEVTSPVIVFLDAHCEVVDGWLEPL 183

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           LA I+ +R  +  P ID I ++ +     Y      RG+F W + ++   LP  E ++RK
Sbjct: 184 LARIHENRSNVVCPEIDVISFENFG----YSYASGIRGVFNWNLHFRWRTLPAVEQQRRK 239

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
              +P +SPT AGGLFA+ + +F ++G YD  + +WGGEN E+SF+IW CGG++E +PCS
Sbjct: 240 SVIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSFRIWQCGGNLEIIPCS 299

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY F K A    G  +  N +RV E W D  +K  FY R P       GD
Sbjct: 300 HVGHVFRKSQPYTFPKGA----GETLNKNLQRVAEVWMD-NYKDIFYNRFPNLRQHSYGD 354

Query: 369 ISEQ 372
           IS++
Sbjct: 355 ISKR 358


>gi|268564602|ref|XP_002647197.1| C. briggsae CBR-GLY-10 protein [Caenorhabditis briggsae]
          Length = 623

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 156/355 (43%), Positives = 214/355 (60%), Gaps = 19/355 (5%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 77
           +EGPGE GK   LP+      +A L  Y   G N   S+ IS +R+I D+R +ECK   Y
Sbjct: 95  REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 153

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LED++
Sbjct: 154 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 213

Query: 138 QR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++   +  V+++R  +REGLIR R  GA+E+ GE+++FLDAH E   NWLPPLL PI  D
Sbjct: 214 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 273

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  ++P+ 
Sbjct: 274 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTKPFD 327

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H+YR
Sbjct: 328 SPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAPCSRVAHIYR 387

Query: 316 S-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
             + P+    + D      ++ NYKRV E W DE +K   Y   P     D GD+
Sbjct: 388 CKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGIGNADAGDL 436


>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
 gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 3; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
 gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
 gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
          Length = 612

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 155/380 (40%), Positives = 228/380 (60%), Gaps = 16/380 (4%)

Query: 1   RPVFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETSN 56
           + V+  D +  N L   +E    GPG   +GG    +PE  +   +    E   N+  S 
Sbjct: 82  KQVYPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKTIKEKRFLENQFNVVASE 141

Query: 57  HISFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
            IS +RT+PD R + C+     L    +PK S+I+VFHNE +++L+RT+HS+I R+P   
Sbjct: 142 MISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWTTLLRTLHSVINRSPRHL 201

Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
           LEEIILVDD S +  L + L+ YI+ F   + L+    R GLIR R  G++ ++G++++F
Sbjct: 202 LEEIILVDDKSDRDYLVKPLDSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLF 261

Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
           LDAH EV   WL PL++ +  DRK +  P+ID I   T+E+ +  E      G F W + 
Sbjct: 262 LDAHVEVTDGWLEPLVSRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLN 318

Query: 234 YKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
           ++   +P+RE  +R  + S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+S
Sbjct: 319 FRWYAVPKRELNRRGSDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEIS 378

Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
           F++WMCGGS+E  PCSR+GHV+R   PY F     +V    I +N  R  E W DE +KA
Sbjct: 379 FRVWMCGGSLEIHPCSRVGHVFRKQTPYTFPGGTAKV----IHHNAARTAEVWMDE-YKA 433

Query: 353 YFYTREPLAMFLDMGDISEQ 372
           +FY   P A  ++ GD+SE+
Sbjct: 434 FFYKMVPAARNVEAGDVSER 453


>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 653

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 196/313 (62%), Gaps = 10/313 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           ++  N+  SN + F R++PD R   C+  ++  +LP ASV++ F+NE +S+L+RTVH+++
Sbjct: 154 QHAFNLLISNRLGFYRSLPDTRNPLCRSEEHGAELPTASVVVCFYNEAWSTLLRTVHTVL 213

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKE 165
            RTP   L E+ILVDD S++ DL  +L +Y+  +    VRLIR  +REGLIR R  GA+ 
Sbjct: 214 GRTPRHLLHEVILVDDNSTQVDLGPQLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARN 273

Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
           + GEV+VFLD+HCEV + WL PLL  I ++R  +T P+ID I+  T+E    Y      R
Sbjct: 274 ASGEVLVFLDSHCEVNVGWLEPLLERIRANRATVTCPIIDIINADTFE----YTASPIVR 329

Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
           G F WG+ +K    P   A+K +    P  SPT AGGLFAMDR FF  LG YD G+ +WG
Sbjct: 330 GGFNWGLHFKWESPPAGLARKGRGAIAPIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWG 389

Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
           GEN E+SF+IWMCGG +E +PCSR+GHV+R   PY      D      +T N  RV   W
Sbjct: 390 GENLEISFRIWMCGGQLEIIPCSRVGHVFRRRRPYGSPNGED-----TLTKNSLRVAHVW 444

Query: 346 FDEKHKAYFYTRE 358
            D+  K YF TR 
Sbjct: 445 MDDYKKYYFQTRS 457


>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Takifugu rubripes]
          Length = 583

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 165/372 (44%), Positives = 220/372 (59%), Gaps = 24/372 (6%)

Query: 8   GKLGNLEPPL----EPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISF 60
           G  G L  PL     P    PGE G+A HL   P+  +   D S+  Y +N+  S+ IS 
Sbjct: 60  GPEGQLARPLYVKPPPDTNAPGELGRAAHLNLSPDEKKQEED-SIERYAINIFVSDKISL 118

Query: 61  DRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
            R I D RM+EC  K ++Y   LP  SVI+ F+NE +S+L+RT+HS+++ TPA  L+EII
Sbjct: 119 HRHIQDHRMKECRSKTFNY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEII 177

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           L+DDFS +A L  +L DYI     +VRLIR  +REGL+R R  GA  + GEV+ FLD HC
Sbjct: 178 LIDDFSDRAYLKSQLADYISNLE-RVRLIRTKKREGLVRARLIGATYATGEVLTFLDCHC 236

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           E    W+ PLL  I  +   +  PVID ID+ T+EF    + +    G F+W + ++ + 
Sbjct: 237 ECVPGWIEPLLERIGENSSTIVCPVIDTIDWNTFEF--YMQTEEPMIGGFDWRLTFQWHS 294

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +PERE K+RK   +P +SPT AGGLFA+++ FF  LG YD G+ VWGGEN ELSF++W C
Sbjct: 295 VPERERKRRKSPVDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSFRVWQC 354

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GGS+E  PCS +GHV+    PY           P    N  R  E W D  +K +FY R 
Sbjct: 355 GGSLEIHPCSHVGHVFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKQHFYNRN 404

Query: 359 PLAMFLDMGDIS 370
           P A     GDIS
Sbjct: 405 PPARKETYGDIS 416


>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
          Length = 927

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 150/354 (42%), Positives = 219/354 (61%), Gaps = 11/354 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLD 80
           G GE G+   L E      D + G    N+  S+ I+ +R++PD+R  +C  K +  P +
Sbjct: 98  GVGEDGRPVKLDELEDRLSDDTFGINQFNLIISDKIALNRSLPDVRKHQCRDKIYPAPSE 157

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+V+HNE FS+L+RTV S+I R+P + L+EIILVDDFSS++ L   L++++   
Sbjct: 158 LPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFLKDDLDNFVVTL 217

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             ++++IR   R GLIR R  GA E+ GEV+ FLD+HCE    WL PLLA I  +RK + 
Sbjct: 218 GIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPLLARIKENRKAVV 277

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            PVID I+ +T+ ++   E    +RG F W + ++   +P    K R  + + P +SPT 
Sbjct: 278 CPVIDVINDRTFAYQKGIE---LFRGGFNWNLQFRWYAVPPDIVKGRANDPTMPIQSPTM 334

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++D+ +F ELG YDPG+ +WGGEN E+SF+IW CGG IE +PCS +GH++R   P
Sbjct: 335 AGGLFSIDKRYFEELGAYDPGMEIWGGENIEISFRIWQCGGRIEILPCSHVGHIFRKASP 394

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG-DISEQ 372
           ++F     +  G ++  N  RV E W DE  K  FY   P A+ +    D+SE+
Sbjct: 395 HDF---PGKSSGKILNSNLLRVAEVWMDE-WKYLFYKTAPQALQMRSSIDVSER 444


>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Taeniopygia guttata]
          Length = 608

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 155/362 (42%), Positives = 213/362 (58%), Gaps = 17/362 (4%)

Query: 5   KADGKLGN-----LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 59
           + D K+GN     ++ P++   E   E G  ++  E  +   D    ++  NM  SN + 
Sbjct: 71  QKDNKIGNSFGNHIQDPVKGEIEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLG 128

Query: 60  FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
           + R +PD R  +C+   YP DLP ASV++ F+NE  S+L+RTVHS++ RTPA  L EIIL
Sbjct: 129 YHREVPDTRDAKCREKSYPADLPSASVVICFYNEALSALLRTVHSVLDRTPAHLLHEIIL 188

Query: 120 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           VDD S  ADL + L +Y++ +     +L+RN +REGLIR R  GA  + G+V+VFLD+HC
Sbjct: 189 VDDNSELADLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHC 248

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           EV   WL PLLAPI  D + +  PVID I   T      Y      RG F WG+ +K + 
Sbjct: 249 EVNEMWLQPLLAPIREDPRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDL 304

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P  E +  +  + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMC
Sbjct: 305 VPLAELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMC 364

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG +  +PCSR+GH++R   PY      D      + +N  R+   W DE  + YF  R 
Sbjct: 365 GGRLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLRLAHVWMDEYKEQYFALRP 419

Query: 359 PL 360
            L
Sbjct: 420 EL 421


>gi|360043880|emb|CCD81426.1| putative n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 526

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 156/354 (44%), Positives = 220/354 (62%), Gaps = 13/354 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           ++GPGE G    L  + +A    +L   G N+  S  I  DR++ D+R   CK   Y   
Sbjct: 2   RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+  + E + +L+RTV S++ R+P + ++E+ILVDD SS+  L ++L++Y+ R 
Sbjct: 62  LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121

Query: 141 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
              G V +I   EREGLIR R  GAK + G+V++FLD+HCE  +NWLPPLL PI  + + 
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +T P ID ID  T+E+R+    D   RG F+W   YK   LP R +    +   P++SP 
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTDSLHPETPFESPV 235

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG +  VPCSR+GH++R + 
Sbjct: 236 MAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVPCSRVGHIFREY- 294

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           P NF +   ++K   +  N+KRV E W DE +K Y Y   P    +D GD+S+Q
Sbjct: 295 PTNFPQ--PKIKN-FLRRNFKRVAEVWMDE-YKEYIYRSLPECRKVDPGDLSQQ 344


>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Meleagris gallopavo]
          Length = 556

 Score =  287 bits (735), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 158/371 (42%), Positives = 230/371 (61%), Gaps = 13/371 (3%)

Query: 7   DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           D K  +L P L       +EGPGE GKA  +P+  +            N+  S+ I+ +R
Sbjct: 36  DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           ++PD+R++ CK   YP +LP  SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD
Sbjct: 96  SLPDVRLDGCKTKVYPEELPNTSVVIVFHNEAWSTLLRTVHSVLARSPRRLLAEIILVDD 155

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   LE+Y+++    V+++R  +R GLIR R RGA  +RG+V+ FLDAHCE   
Sbjct: 156 ASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTR 215

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA I  DR+ +  P+ID I   T+E+ +    D  Y G F W + ++   +P+R
Sbjct: 216 GWLEPLLARIREDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           E  +RK + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS
Sbjct: 273 EMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGS 332

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E V CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  
Sbjct: 333 LEIVTCSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGV 387

Query: 362 MFLDMGDISEQ 372
           + +D GD+S +
Sbjct: 388 VKVDYGDVSAR 398


>gi|312372346|gb|EFR20327.1| hypothetical protein AND_20267 [Anopheles darlingi]
          Length = 616

 Score =  287 bits (735), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 154/356 (43%), Positives = 215/356 (60%), Gaps = 13/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GKA  L E      D    + G N   S+ IS +R++PD+R   C+   Y
Sbjct: 86  EAKRTGIGEQGKAGRLSEKEAEMKDKLFKKNGFNAVLSDLISLNRSLPDIRHRGCRKKKY 145

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F+NE +S+L+RT  S++ R+P++ + E+ILVDD S+K  L  +LE Y+
Sbjct: 146 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPSELIAEVILVDDCSTKDFLKGQLELYV 205

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV+++R  ER GLI  R  GAK +  +V++FLD+H E  +NWLPPLL PI +D +
Sbjct: 206 GENMPKVKIVRLPERSGLIAARLAGAKVATADVLIFLDSHTEANVNWLPPLLDPIAADYR 265

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID ID+ T+E+R+    D   RG F+W   YK   L  ++       +EP++SP
Sbjct: 266 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPKDLAN---PTEPFESP 319

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF E+GGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR +
Sbjct: 320 VMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCSRVGHIYRGY 379

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            P+   +  D      +T NYKRV E W DE +K Y Y R+       D+GDIS+Q
Sbjct: 380 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYMRDRKKYDNTDVGDISKQ 429


>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
           scrofa]
          Length = 608

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 207/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   Y  DLP ASVI+ F+NE  S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYRTDLPVASVIICFYNEALSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++YIQ++  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLTGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR  +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSA 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ ++ + +P  E +  +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFRWDLVPLSELEGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432


>gi|261260064|sp|A8Y236.2|GLT10_CAEBR RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
           10; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 10; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
          Length = 629

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 156/355 (43%), Positives = 214/355 (60%), Gaps = 19/355 (5%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 77
           +EGPGE GK   LP+      +A L  Y   G N   S+ IS +R+I D+R +ECK   Y
Sbjct: 101 REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 159

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LED++
Sbjct: 160 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 219

Query: 138 QR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++   +  V+++R  +REGLIR R  GA+E+ GE+++FLDAH E   NWLPPLL PI  D
Sbjct: 220 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 279

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  ++P+ 
Sbjct: 280 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTKPFD 333

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H+YR
Sbjct: 334 SPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAPCSRVAHIYR 393

Query: 316 S-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
             + P+    + D      ++ NYKRV E W DE +K   Y   P     D GD+
Sbjct: 394 CKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGIGNADAGDL 442


>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 600

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 148/351 (42%), Positives = 213/351 (60%), Gaps = 11/351 (3%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE GK   +P + +        E   N+  S+ I  +R++ D+R  +CK   Y   LP 
Sbjct: 87  PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYSAKLPT 146

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RT+ S+I R+P   L+EIILVDD S +  L ++LEDY+      
Sbjct: 147 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGKQLEDYVSTLPVS 206

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             ++R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  DRK +  P+
Sbjct: 207 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 266

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+  V   D  + G F W + ++   +P RE ++R ++ + P ++PT AGG
Sbjct: 267 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPSREMQRRNHDRTAPLRTPTMAGG 323

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E  PCS +GHV+R   PY F
Sbjct: 324 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 383

Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            G +A+     ++  N  RV E W DE  K ++Y   P A     GD+SE+
Sbjct: 384 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 428


>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
 gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
          Length = 600

 Score =  287 bits (734), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 151/332 (45%), Positives = 201/332 (60%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R  +C    YP DLP AS+++ F+NE  S+L+RT
Sbjct: 103 DVGYQKHAFNLLISNRLGYHRDLPDTRDSKCSKKTYPADLPLASIVICFYNEASSALLRT 162

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPAQ L EIILVDD S   DL + L+ Y+Q   + KV+L+RN  REGLIR R 
Sbjct: 163 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDYYMQENLSKKVKLVRNKRREGLIRGRM 222

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V+VFLD+HCEV   WL PLLAPI  + K +  PVID I   T     +Y  
Sbjct: 223 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIRENPKTVVCPVIDIISADTL----IYSQ 278

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P++SPT AGGLFAMDR +F  LG YD G
Sbjct: 279 SPVVRGGFNWGLHFKWDPVPLSELGGPEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSG 338

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGGS+  VPCSR+GH++R   PY      D      + +N  R
Sbjct: 339 MDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYGSPGGHD-----TMAHNSLR 393

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE    YF  R P     D GDI ++
Sbjct: 394 LAHVWMDEYKDQYFALR-PELRNRDFGDIRDR 424


>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
 gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
          Length = 630

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 220/362 (60%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++WMCGG +E  PCSR+G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRVWMCGGVLEIAPCSRVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G   +     ++ +N  R++E W D+  K ++Y+  P A     GD+S
Sbjct: 415 HVFRKSTPYTFPGGTTE-----IVNHNNARLVEVWLDD-WKEFYYSFYPGARKASAGDVS 468

Query: 371 EQ 372
           ++
Sbjct: 469 DR 470


>gi|195454523|ref|XP_002074278.1| GK18434 [Drosophila willistoni]
 gi|194170363|gb|EDW85264.1| GK18434 [Drosophila willistoni]
          Length = 646

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 159/360 (44%), Positives = 218/360 (60%), Gaps = 24/360 (6%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE G+  H+    +   D      G N   S+ IS +R++PD+R EECK   Y   
Sbjct: 121 RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKSRKYLAK 180

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 139
           LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S    L Q+L+DY+   
Sbjct: 181 LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 240

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
           F   V ++RN ER GLI  R  GAK + GEV+VF D+H EV  NWLPPLL PI  D KI 
Sbjct: 241 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 300

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPT 258
           T P++D I++ T+ +   ++     RG F+W   YK+   LPE    K    S PY++P 
Sbjct: 301 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQLPVLPEDSLDK----SLPYRNPV 354

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
             GGLFA++  FF +LGGYD  L +WGGE +ELSFKIWMCGG +  VPCSR+ H++R  M
Sbjct: 355 MMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM 414

Query: 319 -----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
                P N+           +  N+KRV E W D K+K Y YTR+P     +D GD+S Q
Sbjct: 415 DARPNPRNYN---------FVARNHKRVAEVWMD-KYKEYVYTRDPETYEKIDAGDLSRQ 464


>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
 gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
          Length = 518

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 148/350 (42%), Positives = 209/350 (59%), Gaps = 10/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLP 82
           PGE G+   +P  Y+            N   S+ IS  R++PD R+ EC    YP+  LP
Sbjct: 14  PGENGRGVIVPPEYQEESRKLFQRNRFNQWASDRISLHRSLPDARILECSSLKYPIHKLP 73

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
           + SVI+VFHNE +S+L+RTVHS++ R+P + L EIILVDD S   +L   LE Y+ + + 
Sbjct: 74  QTSVIIVFHNEAWSTLLRTVHSVLDRSPPELLREIILVDDSSDHEELHSTLEKYVAKLS- 132

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KV+++RN  REGLIR+R  G   +    + FLDAHCE  + WL PLL  I  +R I+  P
Sbjct: 133 KVKIVRNKAREGLIRSRLNGFAHATSPTVTFLDAHCEANVGWLEPLLYRIMQNRTIVVCP 192

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
            ID I  +T+E+   Y   +  RG F W + ++   +PE E K+R   ++  +SPT AGG
Sbjct: 193 EIDVISDETFEY--TYSSGN-VRGSFNWNLNFRWKAVPEYENKRRAARTDGIRSPTMAGG 249

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF +   +F ++G YD  + +WGGEN ELSF+IW CGG +E +PCS +GHV+R   PY+F
Sbjct: 250 LFTIHSQYFKDIGLYDKQMEIWGGENLELSFRIWQCGGQLEIIPCSHVGHVFRKSQPYSF 309

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            K      G  ++ N +RV E W D  +K YFY R+P       GDIS++
Sbjct: 310 PKGT----GETLSKNLQRVAEVWMD-GYKRYFYKRQPHLKGHPFGDISKR 354


>gi|21707970|gb|AAH34184.1| Galnt11 protein [Mus musculus]
          Length = 411

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 156/352 (44%), Positives = 212/352 (60%), Gaps = 16/352 (4%)

Query: 2   PVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 57
           P FKA+   +L N  +E P +   +   E G  ++  E  +   D    ++  NM  SN 
Sbjct: 69  PQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNR 126

Query: 58  ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           + + R +PD R  EC+   YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA  L EI
Sbjct: 127 LGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEI 186

Query: 118 ILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           ILVDD S   DL  +L++YIQR+   KV++IRN +REGLIR R  GA  + GEV+VFLD+
Sbjct: 187 ILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDS 246

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCEV + WL PLLA I  D   +  PVID I   T      Y      RG F WG+ +K 
Sbjct: 247 HCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKW 302

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P  E       + P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW
Sbjct: 303 DLVPVSELGGPDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIW 362

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
           MCGG +  +PCSR+GH++R   PY   +  D      +T+N  R+   W DE
Sbjct: 363 MCGGKLFILPCSRVGHIFRKRRPYGSPEGQDT-----MTHNSLRLAHVWLDE 409


>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
          Length = 622

 Score =  286 bits (733), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 167/373 (44%), Positives = 224/373 (60%), Gaps = 23/373 (6%)

Query: 10  LGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETSNHISF 60
           L N + P+  YK G    PGEGGKA       L  + +   D    +   N   S+ IS 
Sbjct: 97  LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFNQYVSDMISI 154

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
            R++P    EECK   Y  DLP  SVI+ FHNE +S L+RTVHS+++RTP   L EIILV
Sbjct: 155 HRSLPSYIDEECKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILV 214

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS  A L   LE Y+++F  KVR++R  +REGLIR R +GA  S+G VI +LD+HCE 
Sbjct: 215 DDFSDMAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCEC 273

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENEL 239
              W+ PLL  I  + K +  PVID ID  T+E+   Y   +    G F+W + +  + +
Sbjct: 274 LEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAI 331

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           PE++ K R+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCG
Sbjct: 332 PEKDRKGRR-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCG 390

Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           G +E VPCS +GH++R   PY +    + +K      N  R+ E W DE +K Y+Y R  
Sbjct: 391 GILEIVPCSHVGHIFRKRSPYKWLSGVNVLK-----RNSVRLAEVWMDE-YKKYYYERIN 444

Query: 360 LAMFLDMGDISEQ 372
             +  D GD+S +
Sbjct: 445 NNLG-DFGDVSSR 456


>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
 gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
          Length = 632

 Score =  286 bits (733), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 144/360 (40%), Positives = 218/360 (60%), Gaps = 11/360 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P +   +  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C++
Sbjct: 122 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 181

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++LE
Sbjct: 182 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 241

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+ +   +  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I  
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P+RE  +R  + + P
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 358

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 359 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 418

Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY F G +A      ++ +N  RV E W DE  + ++Y     A     GD+S++
Sbjct: 419 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 472


>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Oreochromis niloticus]
          Length = 619

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 150/339 (44%), Positives = 200/339 (58%), Gaps = 11/339 (3%)

Query: 35  EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
           EA +   D+    +  N+  SN + F R +P+ R  +C+   YP+ LP ASV++ F NE 
Sbjct: 80  EADQEVRDSGYHRHAFNVLISNRLGFHRQLPETRDAQCREKSYPVALPSASVVICFFNEA 139

Query: 95  FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
            S+L+RTVHS++ RTPA  L EIILVDD S   +L  +L+ Y++    GKV+L+RN  RE
Sbjct: 140 LSALLRTVHSVLDRTPAYLLHEIILVDDHSELEELKDELDRYVRAELQGKVQLVRNQRRE 199

Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           GLIR R  GA  + GEV+VFLD+HCEV   WL PLLAPI  D + +  PVID I   T  
Sbjct: 200 GLIRGRMIGASHATGEVLVFLDSHCEVNQAWLQPLLAPIQKDHRTVVCPVIDIISADTL- 258

Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
               Y P    RG F WG+ +K + +P  E    +  S P +SPT AGGLFAM+R +F E
Sbjct: 259 ---AYSPSPIVRGGFNWGLHFKWDPVPPSELSGPEGASGPIRSPTMAGGLFAMNRKYFNE 315

Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
           LG YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      
Sbjct: 316 LGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGHD-----T 370

Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + +N  R+   W D   + Y   R P       GDI E+
Sbjct: 371 MAHNSLRLAHVWMDGYKEQYLSLR-PELRNRSYGDIGER 408


>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
 gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
          Length = 596

 Score =  286 bits (733), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 148/351 (42%), Positives = 214/351 (60%), Gaps = 11/351 (3%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE GK   +P   +        E   N+  S+ I  +R++ D+R  +CK   YP  LP 
Sbjct: 85  PGEMGKPVKIPANQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 144

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RT+ S+I R+P   L+EIILVDD S +  L ++LE+Y++     
Sbjct: 145 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEEYVRTLPVP 204

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             ++R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  DRK +  P+
Sbjct: 205 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 264

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+  V   D  + G F W + ++   +P RE ++R ++ + P ++PT AGG
Sbjct: 265 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 321

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E  PCS +GHV+R   PY F
Sbjct: 322 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEISPCSHVGHVFRDKSPYTF 381

Query: 323 -GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            G +A+     ++  N  RV E W DE  K ++Y   P A     GD+SE+
Sbjct: 382 PGGVAN-----IVLKNAARVAEVWLDE-WKEFYYQMSPGARKASAGDVSER 426


>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
           taurus]
          Length = 605

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 155/332 (46%), Positives = 209/332 (62%), Gaps = 12/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP AS+++ F+NE  S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA+ L EIILVDD S   DL  +L++YIQ++  GK+++IRN +REGLIR R 
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSELGGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSG 344

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 345 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 399

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE +K YF  R  L    + G+ISE+
Sbjct: 400 LAHVWLDE-YKQYFSLRPDLRT-RNYGNISER 429


>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
          Length = 550

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 152/368 (41%), Positives = 223/368 (60%), Gaps = 14/368 (3%)

Query: 10  LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGM-----NMETSNHISFDRTI 64
           LG +E P E   + PGE G A+ + E   ++ +    ++G      N   S+ IS  RT+
Sbjct: 23  LGKVESP-EHNADDPGEMGVAFQVDEKKLSSAEKEEYDFGFKRNAFNQYASDRISVHRTL 81

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           PD R  EC+   +   +PKASVI++FHNE +S L+RTV+SI++R+P ++LEE+ILVDD+S
Sbjct: 82  PDYRDVECRAILHSSKMPKASVIVIFHNEAWSVLLRTVYSILERSPPRFLEEVILVDDYS 141

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
            +  L  +L++++     KVRL+R+ +REGLIR R  GA+ ++G+V+VFLD+HCE    W
Sbjct: 142 DQEHLHDQLDEFVAT-QQKVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGW 200

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 244
           L P+L  I  D   +  P+ID ID +T  +           G F+W M +  + LP  E 
Sbjct: 201 LEPMLDRIGQDWSHVVTPIIDVIDDKTLMYNFNPLSRGFSVGGFDWAMGFTWHALPNHEK 260

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           ++RK  S+P +SPT AGGLFA+DR +F  +G YDPG+ +WGGEN E+SF+IWMCGG++E 
Sbjct: 261 ERRKKISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSFRIWMCGGTLET 320

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           +PCS +GH++R   P +  K      G  +  N  R  E W DE    Y Y         
Sbjct: 321 LPCSHVGHIFRKRNPNHSAK-----HGNFVQRNSVRTAEVWMDE--YKYLYYDRIGNHIG 373

Query: 365 DMGDISEQ 372
           D GD+S++
Sbjct: 374 DFGDVSDR 381


>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
 gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
          Length = 679

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 160/352 (45%), Positives = 212/352 (60%), Gaps = 13/352 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE GK   L +  +   +  +  E G N   S+ IS +R+I D+R + C+  +Y   L
Sbjct: 151 GIGEQGKPAKLDDENQRELERKMSLENGFNALLSDSISVNRSIADIRHKSCRKKEYLAKL 210

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + L+EIILVDDFS +A L   LE+YI    
Sbjct: 211 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLYVPLENYIAEHF 270

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R T+R GLI  RS GA+ + GEV++FLD+H E   NWLPPLL PI  + +    
Sbjct: 271 KNVRVVRLTKRTGLIGARSEGARNATGEVLIFLDSHVEANYNWLPPLLEPIAINERTAVC 330

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ SEP+KSP  A
Sbjct: 331 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPSEPFKSPVMA 383

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSR+GH+YR   P 
Sbjct: 384 GGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG--PR 441

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N   +     G  +  NYKRV E W DE  K  +   + +   +D GD++ Q
Sbjct: 442 NH--VPSPRTGDYLHKNYKRVAEVWMDEYKKYLYDHGDGIYDRVDAGDLTAQ 491


>gi|341897758|gb|EGT53693.1| CBN-GLY-10 protein [Caenorhabditis brenneri]
          Length = 620

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 157/358 (43%), Positives = 212/358 (59%), Gaps = 19/358 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
           E  +EGPGE GK   LP+      +A L  Y   G N   S+ IS +R+I D+R  +CK 
Sbjct: 89  EKAREGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHRDCKK 147

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LE
Sbjct: 148 MTYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207

Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           D++++   +  V+++R  +REGLIR R  GA+E+ GE+++FLDAH E   NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D + +  P +D ID +T+E R     D   RG F+W   YK   L +   K R+  + 
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTK---KDRENPTT 321

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P+ SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381

Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +YR  + P+    + D      ++ NYKRV E W DE +K   Y   P     D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGNADAGDL 433


>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 147/371 (39%), Positives = 224/371 (60%), Gaps = 17/371 (4%)

Query: 9   KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           K+ ++  P+   ++GPGE GK   +   ++   D        N+  S+ I+ +R++PD+R
Sbjct: 7   KIQDMPKPVN--RDGPGEQGKPVIIEPEFKKERDEKWKINEFNLMASDKIALNRSLPDVR 64

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
              C    YP  LP  SVI+VFHNE +S+L+RT HSII R+P + L E+ILVDD S++  
Sbjct: 65  PRGCNDKKYPGKLPTTSVIVVFHNEAWSTLLRTTHSIINRSPRELLMEVILVDDCSTQEH 124

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L + L+DY+ +    V + R   R GLIR+R RG   ++G+V+ +LD+HCE    WL PL
Sbjct: 125 LKKPLDDYVAKLPVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPL 184

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR- 247
           ++ I  DRK    P+ID ID +++ +    E +    G F W + ++   +PE E  +R 
Sbjct: 185 VSRIGDDRKTRVQPIIDIIDDRSFAYIGASESNS---GGFTWQLQHQWVRIPEYEQNRRV 241

Query: 248 ------KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
                 +  +  +++PT AGGLF++++ +F ++G YD G+ VWGGEN E+SF+IWMCGG 
Sbjct: 242 SEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGGENIEMSFRIWMCGGK 301

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           IE +PCSRIGHVYR ++PY+F   +D    P I  N  RV E W D  +K +FY  +   
Sbjct: 302 IEIIPCSRIGHVYRRYIPYSFPNGSD----PTIYRNAMRVAEVWMDH-YKKFFYATQTKL 356

Query: 362 MFLDMGDISEQ 372
             +D GD+S++
Sbjct: 357 HMVDYGDVSDR 367


>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Strongylocentrotus purpuratus]
          Length = 654

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 153/372 (41%), Positives = 220/372 (59%), Gaps = 12/372 (3%)

Query: 3   VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           V+K  GK    +  L+   +G  GE        +  R+  D    ++  N   S  I F 
Sbjct: 109 VYKKQGKPMRAKQRLKADAQGDWGEDELGMVRTDEERSIRDGGYRQHAFNELISQRIGFH 168

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R + D R   CKY  Y  +LP  S+++ F+NE +S+L+RTV+S++ RTP + + E+ILVD
Sbjct: 169 RNVTDTRNPLCKYQVYSEELPTVSIVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVD 228

Query: 122 DFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DFS    L ++L+ Y+ + FNG V +I N +REGLIR R+ GA+ + G+V++FLD+HCEV
Sbjct: 229 DFSELTHLKKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEV 288

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
              WL PLL  I +D   +  P+ID I++ T+     Y      +G F WGM +K + + 
Sbjct: 289 NEQWLEPLLERIKADSHTVVCPIIDIINHDTF----AYTASPLVKGGFNWGMHFKWDTIR 344

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
            R+   ++   +P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW CGG
Sbjct: 345 SRQLVGKEDYVKPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGG 404

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
            +E VPCSR+GHV+R   PY      D       T N  RV E W DE +K +FY  +P 
Sbjct: 405 KLEIVPCSRVGHVFRKRRPYGSPNRQDTT-----TKNAVRVAEVWMDE-YKEHFYQVQPK 458

Query: 361 AMFLDMGDISEQ 372
           A  +D GDIS +
Sbjct: 459 AKNIDYGDISSR 470


>gi|115533032|ref|NP_001041036.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
 gi|182676440|sp|O45947.3|GLT10_CAEEL RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
           10; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 10; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|3880991|emb|CAA16378.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
          Length = 684

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 156/358 (43%), Positives = 215/358 (60%), Gaps = 19/358 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
           E  +EGPGE GK   LPE      +A L  Y   G N   S+ IS +R+I D+R +ECK 
Sbjct: 153 EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 211

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LE
Sbjct: 212 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 271

Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           D++++   +  V+++R  +REGLIR R  GA+++ GE+++FLDAH E   NWLPPLL PI
Sbjct: 272 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 331

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  ++
Sbjct: 332 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD---RESPTK 385

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P+ SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H
Sbjct: 386 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 445

Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +YR  + P+    + D      ++ NYKRV E W D+ +K   Y   P     D GD+
Sbjct: 446 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDD-YKETLYKHRPGVGNADAGDL 497


>gi|115533034|ref|NP_001041037.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
 gi|87251651|emb|CAJ76949.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
          Length = 622

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 156/358 (43%), Positives = 215/358 (60%), Gaps = 19/358 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
           E  +EGPGE GK   LPE      +A L  Y   G N   S+ IS +R+I D+R +ECK 
Sbjct: 91  EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 149

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LE
Sbjct: 150 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 209

Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           D++++   +  V+++R  +REGLIR R  GA+++ GE+++FLDAH E   NWLPPLL PI
Sbjct: 210 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 269

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  ++
Sbjct: 270 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD---RESPTK 323

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P+ SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H
Sbjct: 324 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 383

Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +YR  + P+    + D      ++ NYKRV E W D+ +K   Y   P     D GD+
Sbjct: 384 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDD-YKETLYKHRPGVGNADAGDL 435


>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
 gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
          Length = 633

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 144/360 (40%), Positives = 217/360 (60%), Gaps = 11/360 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P +   +  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C++
Sbjct: 123 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 182

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             YP  LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++LE
Sbjct: 183 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 242

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+ +   +  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I  
Sbjct: 243 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 302

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +R+ +  P+ID I   T+E+  +   D  + G F W + ++   +P+RE  +R  + + P
Sbjct: 303 NRRTVVCPIIDVISDDTFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 359

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 360 LRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 419

Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY F G +A      ++ +N  RV E W DE  + ++Y     A     GD+S++
Sbjct: 420 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 473


>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 202/323 (62%), Gaps = 10/323 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S+ I+ +R++PD+R   C    YP  LP  SVILV+HNE  S+L+R VHSII R+P
Sbjct: 25  NLMASDRIALNRSLPDVRPRGCANKVYPKKLPTTSVILVYHNEARSTLLRNVHSIINRSP 84

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              L EIILVDD S +  L + LEDYI +    V +++   R GLIR R  GA  ++G+V
Sbjct: 85  HDLLAEIILVDDASDQEHLGKSLEDYIAKLPVSVYVVKMKGRSGLIRARMAGAAVAKGQV 144

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCEV   WL P+LA I  DR     PVID I   T++++   +P     G F W
Sbjct: 145 LTFLDSHCEVTEGWLEPMLARIAEDRTTSVCPVIDVISDDTFQYQHGNDPQ---MGGFGW 201

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + +K   +P+RE  +RK + +EP +  T AGGLFA+D+++F ELG YDPG  +WGGEN 
Sbjct: 202 SLFFKWFPVPKREQIRRKGDPTEPVRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENL 261

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           ELSFK+WMCGG +E++PCS +GHV+R   PY+F    + V       N KR+ E W DE 
Sbjct: 262 ELSFKLWMCGGKLEFIPCSHVGHVFRKKSPYHFPPGTNYV-----NKNNKRLAEVWLDE- 315

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P     D GDIS++
Sbjct: 316 YKNFYYRISPSVAKTDPGDISDR 338


>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
           glaber]
          Length = 608

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 204/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAVCKEKSYPTDLPVASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL  +L++YIQ++   K++LIRN  REGLIR R 
Sbjct: 171 VHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYIQKYLPAKIKLIRNPRREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA ++ D   +  PVID I   T  + S    
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAVVHGDPHTVVCPVIDIISADTLAYSS---- 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGADSATAPIKSPTMAGGLFAMNRQYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDT-----MTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-KSYGNISER 432


>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
           adamanteus]
          Length = 608

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 147/332 (44%), Positives = 201/332 (60%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R  +CK   YP DLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRDRKCKEKIYPHDLPSASIIICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           +HS++ RTP+  L EIILVDD S  ADL + L+ Y+ +    KV+L+RN  REGLIR R 
Sbjct: 171 IHSVLDRTPSHLLHEIILVDDRSELADLKEDLDIYLTKDLPNKVKLVRNENREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V+VFLD+HCEV   WL PLL PI   R+ +  PVID I   T      Y  
Sbjct: 231 VGASHATGKVLVFLDSHCEVNEMWLQPLLTPIQESRRTVVCPVIDIISADTL----TYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E +  +  + P KSPT AGGLFAMDR +F  LG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLLEMEGPEQATAPIKSPTMAGGLFAMDREYFNALGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      + +N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLVIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R P     + G+I+++
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRTRNYGNITDR 432


>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
           griseus]
          Length = 494

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 147/306 (48%), Positives = 192/306 (62%), Gaps = 6/306 (1%)

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
           L+   C    Y   LP  S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS +
Sbjct: 15  LQNLNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDR 74

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L + LEDY+  F   VR++R  +REGLIRTR  GA  + G+VI FLD+HCE  +NWLP
Sbjct: 75  EHLKKPLEDYMALF-PSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLP 133

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           PLL  I  +RK +  P+ID ID+   +FR   +     RG F+W M YK   +P    K 
Sbjct: 134 PLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKA 191

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
               S+P++SP  AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +P
Sbjct: 192 DP--SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIP 249

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CSR+GH+YR  +PY            L   N KRV E W DE +  Y Y R P    L  
Sbjct: 250 CSRVGHIYRKSVPYKVPAGPADPCNCLSLQNLKRVAEVWMDE-YAEYIYQRRPEYRHLSA 308

Query: 367 GDISEQ 372
           GD+  Q
Sbjct: 309 GDVVAQ 314


>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
 gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
          Length = 584

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 222/357 (62%), Gaps = 14/357 (3%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + GPGE G A  +  + +        E   ++  S+ IS +R +PD R  +C+      D
Sbjct: 82  RNGPGEMGSAVIIDPSQQEERKKKFNENQFDVMASDLISINRALPDYRSSKCREAARKYD 141

Query: 81  ---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
              LP  S+I+VFHNE +S+L+RT+HS+I R+P   ++E+IL+DD S++  L   L+ YI
Sbjct: 142 ITSLPTVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYI 201

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
           +RF+    LI   ER GLIR R +GAK ++G+V++FLDAH EV   WL PLL  +  DRK
Sbjct: 202 KRFSLPFHLIHLPERSGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVSVDRK 261

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKS 256
            +  P+ID I  + +E+  +   D  + G F W + ++   +P RE ++R ++ S P ++
Sbjct: 262 RVVAPIIDVISDENFEY--ITASDITWGG-FNWHLNFRWYPVPMREMERRNHDRSVPLQT 318

Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
           PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E  PCSR+GHV+R 
Sbjct: 319 PTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRK 378

Query: 317 FMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             PY+F G  A+     +I  N  R  E W DE +K  FY   P A  +D+GD++E+
Sbjct: 379 HTPYSFPGGTAN-----VIHRNAARTAEVWMDE-YKDIFYKMVPAAKNVDIGDLTER 429


>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
          Length = 590

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 150/332 (45%), Positives = 200/332 (60%), Gaps = 14/332 (4%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R ++C+   Y + LP AS+++ F NE FS+L+RT
Sbjct: 97  DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTP   L EIILVDD S   DL + L+ Y+Q+    KV+++RN +REGLIR R 
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV   WL PLL PI  +RK +  PVID I   T     VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E           +SPT AGGLFAMDR +F ELG YD G
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSELNS---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRG 329

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  VPCSR+GH++R   PY      D      + +N  R
Sbjct: 330 MDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 384

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W D+  + YF  R P     D GDISE+
Sbjct: 385 LAHVWMDDYKEQYFALR-PELRNRDYGDISER 415


>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
          Length = 590

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 150/332 (45%), Positives = 200/332 (60%), Gaps = 14/332 (4%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R ++C+   Y + LP AS+++ F NE FS+L+RT
Sbjct: 97  DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTP   L EIILVDD S   DL + L+ Y+Q+    KV+++RN +REGLIR R 
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV   WL PLL PI  +RK +  PVID I   T     VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E           +SPT AGGLFAMDR +F ELG YD G
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSELNS---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRG 329

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  VPCSR+GH++R   PY      D      + +N  R
Sbjct: 330 MDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 384

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W D+  + YF  R P     D GDISE+
Sbjct: 385 LAHVWMDDYKEQYFALR-PELRNRDYGDISER 415


>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
           saltator]
          Length = 605

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 145/353 (41%), Positives = 212/353 (60%), Gaps = 9/353 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           K  PGE G A H+P    A           N+  S+ IS +R++ D+R++ CK   Y   
Sbjct: 100 KGSPGEMGAAVHIPPENEAKQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKNKKYNKY 159

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++VFHNE +++L+RTV S+I R+P   L+E+ILVDD S +  L Q LEDYI   
Sbjct: 160 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQDLEDYIATL 219

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
                + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I +DR  + 
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 279

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT 
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTM 336

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   P
Sbjct: 337 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSP 396

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F     ++    + +N  RV E W DE  + ++Y   P A  +D+GD+SE+
Sbjct: 397 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 444


>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
 gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
          Length = 670

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 151/368 (41%), Positives = 216/368 (58%), Gaps = 11/368 (2%)

Query: 7   DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
           D  L  L   ++    G GE G    +  + ++   A   E   N+  S  IS +RT+PD
Sbjct: 56  DSALQTLLAAMKSKSPGAGEMGSPVIIQSSLQSEVKARFKENQFNVVASERISLNRTLPD 115

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            R   C+   Y     K SV++VFHNE +S+LMRTV S+I R+   YLEEIILVDD S K
Sbjct: 116 YRSSACRSIKYEKISLKTSVVIVFHNEAWSTLMRTVQSVINRSSVDYLEEIILVDDASEK 175

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
            +L   +E +++       LIR  +R GLI  R RGA+ ++G+V+ FLDAH EV   WL 
Sbjct: 176 DELIALVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDVLTFLDAHVEVTDGWLE 235

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           PLL+ I  DR  +  PVID I   T+++ +  E      G F W M ++  +   RE K+
Sbjct: 236 PLLSRISEDRTRVVAPVIDVISDDTFQYVTAAESTW---GGFSWTMNFRWYQASAREQKR 292

Query: 247 R-KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           R K  + P ++PT AGGLF++DR +F ++G YD G+ +WGGEN E+SF++WMCGG++E  
Sbjct: 293 RGKNKTTPIRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEISFRVWMCGGTLEIN 352

Query: 306 PCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           PCS +GHV+R   PY F G  ++ + G     N +R  E W DE +K ++Y   P AMF 
Sbjct: 353 PCSHVGHVFRKQTPYTFEGGTSNVIYG-----NARRTAEVWMDE-YKEFYYKMTPSAMFA 406

Query: 365 DMGDISEQ 372
            +G+IS++
Sbjct: 407 PLGNISDR 414


>gi|301759365|ref|XP_002915552.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Ailuropoda melanoleuca]
          Length = 448

 Score =  285 bits (728), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 206/330 (62%), Gaps = 11/330 (3%)

Query: 43  ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 102
           A   +YG N   S  +  +R +PD R + C    YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159

Query: 103 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 162
            S+I  TP   LEEIILVDD SS  DL +KL+  ++ F GK++LIRN +REGLIR+R  G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219

Query: 163 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 222
           A  + G+V+VFLD+HCEV   WL PLLAPI  D K++  P+ID ID++T E+R    P  
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275

Query: 223 HYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLL 282
             RG F W + +K + +   E    +  ++P +SP  AGG+FA++R +F E+G YD  + 
Sbjct: 276 VVRGAFTWHLEFKWDNVLSYEIDGPEGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDME 335

Query: 283 VWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVI 342
           +WG EN ELS +IWMCGG +  +PCSR+GH+ +   P   G +        +TYN  R++
Sbjct: 336 LWGAENLELSLRIWMCGGQLFILPCSRVGHISKHRFPNQPGLMK------AVTYNNLRLV 389

Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             W DE +K  F+ R+P    +  G+ISE+
Sbjct: 390 HVWLDE-YKEQFFLRQPGLKSVAYGNISER 418


>gi|281339845|gb|EFB15429.1| hypothetical protein PANDA_003532 [Ailuropoda melanoleuca]
          Length = 447

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 206/330 (62%), Gaps = 11/330 (3%)

Query: 43  ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 102
           A   +YG N   S  +  +R +PD R + C    YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159

Query: 103 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 162
            S+I  TP   LEEIILVDD SS  DL +KL+  ++ F GK++LIRN +REGLIR+R  G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219

Query: 163 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 222
           A  + G+V+VFLD+HCEV   WL PLLAPI  D K++  P+ID ID++T E+R    P  
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275

Query: 223 HYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLL 282
             RG F W + +K + +   E    +  ++P +SP  AGG+FA++R +F E+G YD  + 
Sbjct: 276 VVRGAFTWHLEFKWDNVLSYEIDGPEGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDME 335

Query: 283 VWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVI 342
           +WG EN ELS +IWMCGG +  +PCSR+GH+ +   P   G +        +TYN  R++
Sbjct: 336 LWGAENLELSLRIWMCGGQLFILPCSRVGHISKHRFPNQPGLMK------AVTYNNLRLV 389

Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             W DE +K  F+ R+P    +  G+ISE+
Sbjct: 390 HVWLDE-YKEQFFLRQPGLKSVAYGNISER 418


>gi|308457549|ref|XP_003091148.1| CRE-GLY-10 protein [Caenorhabditis remanei]
 gi|308258137|gb|EFP02090.1| CRE-GLY-10 protein [Caenorhabditis remanei]
          Length = 620

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 155/358 (43%), Positives = 214/358 (59%), Gaps = 19/358 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
           E  +EGPGE GK   +P+      +A L  Y   G N   S+ IS +R+I D+R ++CK 
Sbjct: 89  EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207

Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           D++++   +  V+++R  +REGLIR R  GA+E+ GE+++FLDAH E   NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  + 
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTT 321

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P+ SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381

Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +YR  + P+    + D      ++ NYKRV E W DE +K   Y   P     D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGSADAGDL 433


>gi|354478320|ref|XP_003501363.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Cricetulus griseus]
          Length = 435

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 209/339 (61%), Gaps = 17/339 (5%)

Query: 34  PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 93
           PE Y+        +YG+N+  S  +   R +PD R + C    YP +LP AS+I+ FHNE
Sbjct: 76  PEFYKG-----FAQYGLNVVISRRLGIQREVPDSRDKICHQKHYPFNLPTASIIICFHNE 130

Query: 94  GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 153
            F++L+RTV S+I  TP+ +LEEIILVDD S   DL +KL+ +++ F GK++LIRN +RE
Sbjct: 131 EFNTLLRTVSSVINLTPSHFLEEIILVDDMSDTDDLKEKLDYHLELFRGKIKLIRNKKRE 190

Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           GLIR+R  GA  + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID ID  T +
Sbjct: 191 GLIRSRMIGASRASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPMIDVIDDTTLD 250

Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
               Y      RG F+W ++++ + +   E    +  S+P +SP  AGG+FA+DR +F E
Sbjct: 251 ----YTAAPLVRGAFDWDLMFRWDNVFSYEMDGPEGTSKPIRSPAMAGGIFAIDRHYFTE 306

Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
           LG YD  + +WGGEN ELS +IWMCGG +  +PCSR+GH+ +     NF   A +     
Sbjct: 307 LGQYDKDMDLWGGENVELSLRIWMCGGQLFILPCSRVGHIAKI---QNFNNAALKA---- 359

Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +++N  RV   W DE HK  F+ R P   +   G+ISE+
Sbjct: 360 LSWNLLRVAHVWLDE-HKDNFFLRRPYLKYEPYGNISER 397


>gi|308452095|ref|XP_003088913.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
 gi|308244364|gb|EFO88316.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
          Length = 620

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 155/358 (43%), Positives = 214/358 (59%), Gaps = 19/358 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 74
           E  +EGPGE GK   +P+      +A L  Y   G N   S+ IS +R+I D+R ++CK 
Sbjct: 89  EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  SVI  FH E  S+L+R+V+S+I R+P + L+EIILVDDFS K  L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207

Query: 135 DYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           D++++   +  V+++R  +REGLIR R  GA+E+ GE+++FLDAH E   NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D + +  P +D ID +T+E R     D   RG F+W   YK   L +++   R+  + 
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD---RENPTT 321

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P+ SP  AGG FA+   +F ELGGYD GL +WGGE +ELSFK+W C G +   PCSR+ H
Sbjct: 322 PFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVAH 381

Query: 313 VYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +YR  + P+    + D      ++ NYKRV E W DE +K   Y   P     D GD+
Sbjct: 382 IYRCKYAPFKNAGMGD-----FVSRNYKRVAEVWMDE-YKETLYKHRPGVGSADAGDL 433


>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 608

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 152/332 (45%), Positives = 204/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRSAACKDETYPEDLPVASVVICFYNEALSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPAQ L E+ILVDD S   DL  +L++++Q +  GK+++IRNT+REGLIR R 
Sbjct: 171 VHSVLDRTPAQLLREVILVDDDSDFDDLKGQLDEFVQTQLPGKIKVIRNTKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV   WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNTMWLQPLLATIQEDRRTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLIPPSELGGPGGATAPIKSPTMAGGLFAMNRDYFDELGRYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      + +N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGRD-----TMAHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432


>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
           gallus]
          Length = 608

 Score =  284 bits (727), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 201/332 (60%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R  +C+   YP DLP ASVI+ F+NE  S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHREVPDTRDAKCREKSYPSDLPFASVIICFYNEALSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL + L +Y++ R     +L+RN +REGLIR R 
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDNSELDDLKKDLVEYVKTRLPKTTKLVRNEKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V+VFLD+HCEV   WL PLL PI  DR+ +  PVID I   T      Y  
Sbjct: 231 IGASHATGKVLVFLDSHCEVNEMWLQPLLTPIKEDRRTVVCPVIDIISADTL----TYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E +  +  + P KSPT AGGLFAMDR +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELEGPEGATAPIKSPTMAGGLFAMDREYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      + +N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGRLLIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R P     + G+I+++
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRTRNYGNITDR 432


>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
          Length = 608

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 152/332 (45%), Positives = 206/332 (62%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  S+ + + R +PD R   CK   YP DLP ASV++ F+NE  S+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKDKTYPADLPVASVVICFYNEALSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPAQ L E+ILVDD S   DL  +L+ ++Q++  GK+++IRN +REGLIR R 
Sbjct: 171 VHSVLDRTPAQLLHEVILVDDDSDFDDLKGELDAFVQKYLPGKIKVIRNRKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV + WL PLLA I  DR+ +  PVID I   T      Y  
Sbjct: 231 IGASHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDRRTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLPEPGGPEGATAPIKSPTMAGGLFAMNRDYFSELGQYDRG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLRT-RSYGNISER 432


>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
 gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
          Length = 621

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 144/360 (40%), Positives = 217/360 (60%), Gaps = 11/360 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P +   K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C++
Sbjct: 111 PTIRESKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 170

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++LE
Sbjct: 171 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 230

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+ +   +  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I  
Sbjct: 231 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPLLARIVQ 290

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P+RE  +R  + + P
Sbjct: 291 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDRTAP 347

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 348 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 407

Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +R   PY F G +A      ++ +N  RV E W DE  + ++Y     A     GD+S++
Sbjct: 408 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVSDR 461


>gi|431895737|gb|ELK05156.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Pteropus alecto]
          Length = 447

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 146/326 (44%), Positives = 202/326 (61%), Gaps = 11/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           EYG N   S  +  +R +PD R + C+   YP+ LP AS+++ FHNE F++L RTV S+I
Sbjct: 104 EYGFNAVVSTSLGRERLVPDTRDKMCRRKHYPVSLPTASIVICFHNEEFNALFRTVSSVI 163

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
             TP   LEEIILVDD S   DL +KL+ +++ F GK++LIRN +REGLIR R  GA  +
Sbjct: 164 NLTPHHVLEEIILVDDMSEFDDLKEKLDHHLEMFRGKIKLIRNQKREGLIRARLIGASRA 223

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+V+VFLD+HCEV   WL PLL  I  DRK++  PVID ID  T E+R    P    RG
Sbjct: 224 SGDVLVFLDSHCEVNRVWLEPLLYAISKDRKMVVCPVIDVIDSTTLEYR----PSPLVRG 279

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +K + +   E    +  + P +SP  AGG+FA+ R +F E+G YD G+ +WGG
Sbjct: 280 AFDWYLQFKWDNVFSYELDGPEGLTRPIRSPAMAGGIFAIRRHYFNEIGQYDKGMDLWGG 339

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELS +IWMCGG I  +PCSR+GH+ +    ++ G +        +TYN  R+   W 
Sbjct: 340 ENLELSLRIWMCGGQIFILPCSRVGHITKQQFSHSSGVIRA------MTYNSLRLAHVWL 393

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE +K   + R P   F+  G+ISE+
Sbjct: 394 DE-YKEQVFLRRPGLRFIPYGNISER 418


>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
 gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
          Length = 630

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 146/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPTVKEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y+    A     GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 468

Query: 371 EQ 372
           ++
Sbjct: 469 DR 470


>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
           [Ornithorhynchus anatinus]
          Length = 608

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 150/332 (45%), Positives = 199/332 (59%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN +   R +PD R  ECK   YP  LP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGSHRDVPDTRDAECKEKSYPPHLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           +HS++ RTPA  L EIILVDD S   DL   L++YI+      +++IRN +REGLIR R 
Sbjct: 171 IHSVLDRTPAHLLHEIILVDDNSELDDLKSGLDEYIRLHLPRNIQVIRNEKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA ++ GEV+VFLD+HCEV   WL PLL PI  DR+ +  PVID I   T  + S    
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNAMWLQPLLVPIREDRRTVVCPVIDIIGADTLAYSS---- 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGPGRATAPIKSPTMAGGLFAMNREYFRELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      + +N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGQD-----TMAHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R P       G+ISE+
Sbjct: 402 LAHVWMDEYKEQYFALR-PELRLRSYGNISER 432


>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
 gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
          Length = 630

 Score =  283 bits (725), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y+    A     GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 468

Query: 371 EQ 372
           ++
Sbjct: 469 DR 470


>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
          Length = 611

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 146/353 (41%), Positives = 217/353 (61%), Gaps = 13/353 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW----DYPLD 80
           G+GG    +PE  ++  +    E   N+  S  IS +RT+PD R E C+         + 
Sbjct: 110 GQGGTGVTVPEDQKSIKEKRFLENQFNVVASEMISVNRTLPDYRSEACRNAAGNEKTTVG 169

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+I+VFHNE +++L+RT+HS+I R+P   LEEII++DD S +  L + L+ YI++F
Sbjct: 170 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKF 229

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              V L+   ER GLIR R  G+  ++G++++FLDAH EV   WL PL+  +  DRK + 
Sbjct: 230 PIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVHRVAEDRKRVV 289

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+ +  E      G F W + ++   +P+RE  +R  + S P ++PT 
Sbjct: 290 APIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRGSDRSMPIQTPTI 346

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E  PCSR+GHV+R   P
Sbjct: 347 AGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTP 406

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F     +V    I +N  R  E W DE +KA+FY   P A  ++ GD++++
Sbjct: 407 YTFPGGTAKV----IHHNAARTAEVWMDE-YKAFFYKMVPAAKNVEAGDVTDR 454


>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
          Length = 613

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 145/352 (41%), Positives = 216/352 (61%), Gaps = 12/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD---L 81
           G+GG    +PE  ++  +    E   N+  S  IS +RT+PD R E C+     +    +
Sbjct: 111 GQGGTGVTVPEEKKSIKEKRFLENQFNVVASEMISVNRTLPDYRSEACRTAGNSIKTTGM 170

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  S+I+VFHNE +++L+RT+HS+I R+P   LEEII++DD S +  L + L+ YI+   
Sbjct: 171 PTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKALP 230

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V L+   ER GLIR R  G+  ++G++++FLDAH EV   WL PL++ +  DRK +  
Sbjct: 231 VPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTEGWLEPLISRVAEDRKRVVA 290

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           P+ID I   T+E+ +  E      G F W + ++   +P+RE  +R  + S P ++PT A
Sbjct: 291 PIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYSVPKRELNRRGSDRSMPIQTPTIA 347

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E  PCSR+GHV+R   PY
Sbjct: 348 GGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPY 407

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F     +V    I +N  R  E W DE +KA+FY   P A  ++ GD++E+
Sbjct: 408 TFPGGTAKV----IHHNAARTAEVWMDE-YKAFFYKMVPAARNVEAGDVTER 454


>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
 gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
          Length = 630

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 217/362 (59%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P +E  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LEDY+ +   +  ++R  +R GLIR R  GA+   G+VI FLDAHCE    WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y     A     GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVS 468

Query: 371 EQ 372
           ++
Sbjct: 469 DR 470


>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 617

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 105 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 164

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 165 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 224

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 225 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 284

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 285 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 341

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 342 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 401

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y+    A     GD+S
Sbjct: 402 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 455

Query: 371 EQ 372
           ++
Sbjct: 456 DR 457


>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
 gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
           adhaerens]
          Length = 490

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 151/351 (43%), Positives = 207/351 (58%), Gaps = 14/351 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G+ G A  +P   + A +        N   S+ IS  RT+PD R   CK   +PL LP  
Sbjct: 1   GQNGTAVIVPAESKNASEQLFNRNHFNQWISDRISLHRTLPDPRHPMCKDQIFPLHLPTT 60

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS---KADLDQKLEDYIQRFN 141
           SV++VFHNE +S+L+RTVHSI+ R+P   L EIIL DD+S     A+L   LE Y  +  
Sbjct: 61  SVVVVFHNEAWSTLLRTVHSILSRSPPDLLHEIILQDDYSDPIGHAELFMPLELYTSKLE 120

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV++ RN + EGLIR+R  G   +   V+ FLDAHCEV   WL PLL  IY +   +  
Sbjct: 121 -KVKIFRNEKHEGLIRSRLNGFSHATAPVVTFLDAHCEVTTGWLEPLLERIYLNETTVVC 179

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P ID ID +T++++  + P    RG+F W + ++   +P  E K+RK   +P  SPT AG
Sbjct: 180 PEIDVIDDRTFQYQ--FGPPALMRGVFNWQLYFRWALIPPEEHKRRKSPIDPVWSPTMAG 237

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+ + FF  LG YD    VWGGEN E+SFK W+CGG +E VPCSR+GHV+R   PY 
Sbjct: 238 GLFAISKKFFKRLGTYDDQFDVWGGENMEISFKAWLCGGKLEIVPCSRVGHVFRHNQPYK 297

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           FG       G  ++ N +RV E W D+ +K +FY  +P     + G+I+E+
Sbjct: 298 FG-------GNFLSRNSQRVAEVWLDD-YKEFFYQVQPHLRKEEFGNIAER 340


>gi|157107416|ref|XP_001649767.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884053|gb|EAT48278.1| AAEL000654-PA [Aedes aegypti]
          Length = 607

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 152/356 (42%), Positives = 214/356 (60%), Gaps = 13/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE G A HL +      D    + G N   S+ IS +R++PD+R + C+   Y
Sbjct: 80  EEKRTGIGEHGIAGHLEKKDEDMKDKLFKKNGFNAVLSDLISLNRSLPDIRHKGCRKKKY 139

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F+NE +S+L+RT  S++ R+P + + EIILVDD S+K  L  +L+ Y+
Sbjct: 140 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELITEIILVDDCSTKEFLKDQLDRYV 199

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
           +    KV++I   ER GLI  R  GAK +  +V++FLD+H E  +NWLPPLL PI  D K
Sbjct: 200 EENMPKVKVIHLPERSGLITARLAGAKVATADVLIFLDSHTEANINWLPPLLEPIAEDYK 259

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID ID+  +E+R+    D   RG F+W   YK   L +++ +     +EP++SP
Sbjct: 260 TCVCPFIDVIDWDNFEYRA---QDEGARGAFDWKFFYKRLPLLQKDLENP---TEPFESP 313

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR +
Sbjct: 314 VMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMYDAPCSRVGHIYRGY 373

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            P+   +  D      ++ NYKRV E W DE +K Y Y R+       D GD+S+Q
Sbjct: 374 APFGNPRKKD-----FLSRNYKRVAEVWMDE-YKEYLYMRDRKKYDNTDAGDLSKQ 423


>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 608

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 157/361 (43%), Positives = 222/361 (61%), Gaps = 22/361 (6%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 72
           E  + GPGE GK    P   R  GD  +      E G +   S+ I+ +R+IPD+R  +C
Sbjct: 82  ETERHGPGEHGK----PVKLRDPGDIKMNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 137

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y  +LP  SVI++F+NE +S+L+RTV+S++ R+P+  L+EIILV+D S+K  L + 
Sbjct: 138 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPSHLLKEIILVNDHSTKPFLWKP 197

Query: 133 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           L+++++   + KV+LI   ER GLI  R  GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 198 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 257

Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
           I  D +    P+ID I + T+E+RS    D   RG F+W   YK   LP R        +
Sbjct: 258 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD-PT 311

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           EP++SP  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+G
Sbjct: 312 EPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSRVG 371

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HVYR + P+   +  +      +T N+KRV E W DE +K + Y R P     + GD+++
Sbjct: 372 HVYRGYSPFPNPRGVN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDKTNPGDLTK 425

Query: 372 Q 372
           Q
Sbjct: 426 Q 426


>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
 gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
          Length = 630

 Score =  283 bits (724), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 217/362 (59%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P +E  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LEDY+ +   +  ++R  +R GLIR R  GA+   G+VI FLDAHCE    WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y     A     GD+S
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYAMSTGARKASAGDVS 468

Query: 371 EQ 372
           ++
Sbjct: 469 DR 470


>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 619

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 158/361 (43%), Positives = 221/361 (61%), Gaps = 22/361 (6%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 72
           E  + GPGE GK    P   R  GD  L      E G +   S+ I+ +R+IPD+R  +C
Sbjct: 93  ETERHGPGEHGK----PLKLRDPGDIKLNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 148

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y  +LP  SVI++F+NE +S+L+RTV+S++ R+P   L+EIILV+D S+K  L + 
Sbjct: 149 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPHLLKEIILVNDHSTKPFLWKP 208

Query: 133 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           L+++++   + KV+LI   ER GLI  R  GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 209 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 268

Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
           I  D +    P+ID I + T+E+RS    D   RG F+W   YK   LP R        +
Sbjct: 269 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD-PT 322

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           EP++SP  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+G
Sbjct: 323 EPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSRVG 382

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HVYR + P+   +  +      +T N+KRV E W DE +K + Y R P     + GD+++
Sbjct: 383 HVYRGYSPFPNPRGVN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDKTNPGDLTK 436

Query: 372 Q 372
           Q
Sbjct: 437 Q 437


>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
          Length = 604

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 150/339 (44%), Positives = 202/339 (59%), Gaps = 9/339 (2%)

Query: 35  EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNE 93
           E  +   D        N   S+ IS  R I D R   CK   YPLD LP  +VI+ FHNE
Sbjct: 106 EEVKQEQDEGWKRNNFNQYISDRISLHRPIKDTRHAMCKDRTYPLDKLPDTTVIIPFHNE 165

Query: 94  GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 153
             ++L+RTV SI+ R+P   + EI+L+DD S+   L   L++ +     K R++R +ER 
Sbjct: 166 ARTTLLRTVWSILDRSPPSLINEILLIDDASTMEHLKAPLDEELATI-PKTRVLRLSERS 224

Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           GLIR +  GA++++G+V+ FLD+HCE  + WL PLL  IY DR  +  PVID ID +T+ 
Sbjct: 225 GLIRAKVFGAEQAKGKVVTFLDSHCECNVGWLEPLLERIYLDRTTVVTPVIDNIDKKTFA 284

Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
           +     P    RGIF W + +   +LP  E KKRK    P  SPT AGGLF+MDR +F E
Sbjct: 285 YTG--SPTVITRGIFTWSLTFSWLDLPWFEQKKRKDPIAPLPSPTMAGGLFSMDREYFFE 342

Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
           +G YD G+ VWGGEN E+SF+IW CGG++E++PCSR+GHVYR F PY F   A +     
Sbjct: 343 IGSYDMGMDVWGGENLEISFRIWQCGGTLEFIPCSRVGHVYRDFHPYKFPSGAVQT---- 398

Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           I  N  RV E W DE +K  +Y   P    +  GDIS++
Sbjct: 399 INKNLNRVAEVWMDE-YKELYYGVRPHHRAIGTGDISDR 436


>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
 gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
          Length = 693

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 151/365 (41%), Positives = 223/365 (61%), Gaps = 13/365 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P E + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYSPREEWGEG---GSGVTHLTPEQQKLADSTFAVNQFNLFVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   YP DLP  SVI+V+HNE +S+L+RTV S+I R+P   L EI+LVDDFS +  L 
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLR 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KL++ ++     +++IR+ +R GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTEMAKQHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-DMG 367
            +GHV+R   P++F     +  G ++  N  RV E W DE  K YFY   P+A  + +  
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNANLLRVAEVWMDE-WKYYFYKIAPVAFRMRESI 438

Query: 368 DISEQ 372
           D+SE+
Sbjct: 439 DVSER 443


>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
          Length = 536

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 218/362 (60%), Gaps = 11/362 (3%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 24  LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 83

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 84  RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 143

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 144 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 203

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 204 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 260

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 261 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 320

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           HV+R   PY F G +A      ++ +N  RV E W DE  + ++Y+    A     GD+S
Sbjct: 321 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWLDE-WRDFYYSMSTGARKASAGDVS 374

Query: 371 EQ 372
           ++
Sbjct: 375 DR 376


>gi|348568063|ref|XP_003469818.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Cavia porcellus]
          Length = 499

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 151/349 (43%), Positives = 203/349 (58%), Gaps = 16/349 (4%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PG+    Y  PE +         EYG N+  S  +  DR +PD R + C++  YPL LP 
Sbjct: 138 PGDQNINYSDPELFNG-----YLEYGFNVIVSRSLGHDREVPDTRDKSCRHRHYPLHLPT 192

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
           ASVI+ FHNE F++L+RTV S++  TP   LEEIILVDD S   DL  KL  Y++ F  K
Sbjct: 193 ASVIICFHNEEFNALLRTVSSVVYLTPPYLLEEIILVDDMSKFDDLKSKLNYYLESFRDK 252

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+L+RN +REGLIR R  GA  + GEV+VFLD+HCEV   WL PLLA I  D + +  PV
Sbjct: 253 VQLVRNKKREGLIRARMIGAWYASGEVLVFLDSHCEVNRVWLEPLLAAISKDSRTVVTPV 312

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID ID  + +    Y P    RG F+W + +K + +   E       + P +SP  AGG+
Sbjct: 313 IDIIDGISLQ----YLPSPLVRGAFDWKLQFKWDSVFSYETDSEGSPTNPIRSPAMAGGI 368

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           FAM R FF ELG YD  + +WGGEN ELS +IWMCGG +  +PCSR+GH+ + +      
Sbjct: 369 FAMHRPFFYELGEYDKDMDLWGGENLELSLRIWMCGGQLLIIPCSRVGHITKLY------ 422

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              D      +  N+ R++  W DE +K  F+ R P    +  G+ISE+
Sbjct: 423 SKPDSALSKAVARNHLRLVHVWLDE-YKEQFFLRNPDLKSMTYGNISER 470


>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           4 [Ciona intestinalis]
          Length = 582

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 158/368 (42%), Positives = 216/368 (58%), Gaps = 24/368 (6%)

Query: 13  LEPPLEPYKEGPGEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           L  P +    GPGEGG A  L    PE  +   D S+  Y +N   S  IS  R + D R
Sbjct: 67  LSRPADIDPRGPGEGGSAVRLLNLSPEVSKQQED-SIQTYAVNQFVSERISLHRRLQDPR 125

Query: 69  MEECKY---WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
            E CK    +DY   LP  SV++ F+NEG+S+L+RTV S++  +P   L EIILVDD+S 
Sbjct: 126 HEMCKSRRPFDY-RSLPTTSVVIAFYNEGWSTLIRTVFSVLHNSPDALLTEIILVDDYSD 184

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
           K  L  KL D+++    +VRL+R T+REGL+R R  GA  ++GEV+ FLD HCE    WL
Sbjct: 185 KVYLKDKLADFLKAL-ARVRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWL 243

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 244
            PLL  I  D  ++ VPVID ID+ T+E+   Y   H  + G F+W + ++ + +P+ E 
Sbjct: 244 EPLLERIMEDESVIVVPVIDTIDWNTFEY---YYGGHEPQIGGFDWRLTFQWHTIPDHER 300

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           K+RK   +P +SPT AGGLFA+ + +F  +G YD G+ +WGGEN ELSF+ WMCGG +E 
Sbjct: 301 KRRKSPVDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSFRTWMCGGKLET 360

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           +PCS +GHV+    PY           P    N  R  E W D+ +K +FY R P A   
Sbjct: 361 IPCSHVGHVFPKQSPY---------PRPKFLTNTLRAAEVWMDD-YKRHFYIRNPPASKE 410

Query: 365 DMGDISEQ 372
           + GDIS +
Sbjct: 411 NYGDISAR 418


>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 6-like [Hydra
           magnipapillata]
          Length = 591

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 161/370 (43%), Positives = 213/370 (57%), Gaps = 27/370 (7%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L P     G  G+A       +A  D S  +YG N   S+ IS +R+IPD R   C   D
Sbjct: 67  LNPEPGSAGMEGQAVSNSVNEKAIEDKSFDDYGFNELASSKISLERSIPDNRDSSCFNVD 126

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK---ADLDQKL 133
           YP+ L   SVI++FHNE +S L+RTVH+++ R+P   L+EIILVDD S K     L +KL
Sbjct: 127 YPVKLSTTSVIVIFHNEAWSVLLRTVHTVLARSPPHMLKEIILVDDASVKEKYGHLGEKL 186

Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           E+Y+   + KV+LIR+  R GL + R  GA  + GEV+VFLD+HCE    WL PLLA + 
Sbjct: 187 ENYVNTLS-KVKLIRSPVRVGLTQARLIGADNAVGEVLVFLDSHCEASFGWLEPLLARLQ 245

Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
            + K+  VP I+ I ++ +E+ S  E   + RGIF W +++    LP RE  +RKY S+P
Sbjct: 246 ENPKLAVVPDIEVISFKNFEYSS--EKGSYNRGIFSWELMFNWGPLPPREKMRRKYESDP 303

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYD---------PGLLVWGGENFELSFKIWMCGGSIEW 304
            KSPT AGGLFAM+R +F E G YD           L  WGGEN E+SF++WMCG  IE 
Sbjct: 304 IKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYWGGENVEMSFRLWMCGEGIEI 363

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGP--LITYNYKRVIETWFDEKHKAYFYTREPLAM 362
           +PCSR+GHV+R   PY         K P     +N  RV E W DE  K  FY+      
Sbjct: 364 IPCSRVGHVFRERAPY---------KSPDGSTDHNSIRVAEVWMDE-FKEIFYSFRANLK 413

Query: 363 FLDMGDISEQ 372
               GD+SE+
Sbjct: 414 PEQGGDVSER 423


>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
          Length = 495

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 216/345 (62%), Gaps = 10/345 (2%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASV 86
           G+A  +PE+  A           N+  S  IS +RT+ D+RM+ CK   YP++ LP  SV
Sbjct: 2   GQAVIIPESQHAEMKEKFKVNQFNLMASELISVNRTLRDVRMDSCKSKTYPVESLPTTSV 61

Query: 87  ILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRL 146
           ++VFHNE +S+L+RTVHS+I R+P   L+EIILVDD S K  L ++L++Y+ + +  V +
Sbjct: 62  VIVFHNEAWSTLLRTVHSVINRSPPPLLKEIILVDDASEKDFLGRQLDEYLSKLSVHVYV 121

Query: 147 IRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDG 206
           +R  +R GLIR R +GA  + G+VI FLDAHCE    WL PLL  I+ +RK +  P+ID 
Sbjct: 122 LRMEKRTGLIRARLKGAARAEGKVITFLDAHCECTEGWLEPLLFEIHKNRKSVVCPIIDV 181

Query: 207 IDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFA 265
           I  +T+E+  +   D  + G F W + ++   +P+RE ++R  + S P +SPT AGGL A
Sbjct: 182 ISDETFEY--ITGSDMTWGG-FNWKLNFRWYPVPQREVERRGGDRSLPLRSPTMAGGLLA 238

Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
           ++R +F E+G YD G+ +WGGEN E+SF+IWMCGG++  V CS +GHV+R   PY F   
Sbjct: 239 IERDYFYEIGSYDDGMDIWGGENLEMSFRIWMCGGTLLIVTCSHVGHVFRKATPYTFPGG 298

Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
             R+    I +N  R+ E W DE  ++++Y   P     D GD+S
Sbjct: 299 TGRI----INHNNARLAEVWMDE-WRSFYYKINPGVKQTDYGDLS 338


>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
          Length = 637

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 154/357 (43%), Positives = 214/357 (59%), Gaps = 14/357 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAY-RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           E  + GPGE GK Y L     +A  D    E G +   S+ I+ +R++PD+R   C+   
Sbjct: 89  EANRVGPGEHGKPYRLTGVEEKALNDKLFKENGYSAVVSDMIALNRSVPDIRHISCRTKA 148

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP  SVI++F+NE +S+L+RTV+S++ R+PA  L+E+ILV+D S+K  L   L ++
Sbjct: 149 YLRELPTVSVIVIFYNEHWSALLRTVYSVLNRSPASLLKEVILVNDHSTKPFLWAPLREF 208

Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++     KVRLI   ER GLI  R  GA+E+RG+V++ LD+H EV  NWLPPLL PI  D
Sbjct: 209 VESELAPKVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPPLLEPIAED 268

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            +    P ID I + T+++R+    D   RG F+W   YK   L   +       ++P+ 
Sbjct: 269 YRTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFN 322

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GHVYR
Sbjct: 323 SPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 382

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            + P+   +  +      +  N+KRV E W DE  K + Y R PL    D GD++ Q
Sbjct: 383 GYAPFGNPRGVN-----FVVRNFKRVAEVWMDEYAK-FLYERNPLFEKTDPGDLTAQ 433


>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 151/352 (42%), Positives = 204/352 (57%), Gaps = 24/352 (6%)

Query: 35  EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
           EA +   D+    +  N+  S  + + R +PD R  +C+   YP DLP+ASV++ F NE 
Sbjct: 79  EADQQLRDSGYHRHAFNLLISTRLGYHRELPDTRDPQCRDRTYPGDLPRASVVICFFNEA 138

Query: 95  FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
            S+L+RTVHS++ RTP   L EIILVDD+S   +L   L+ Y+Q    GKVR++RN +RE
Sbjct: 139 LSALLRTVHSVLDRTPPFLLHEIILVDDYSELEELKGDLDRYVQAELRGKVRVLRNQKRE 198

Query: 154 GLIRTRSRGAKESRG-------------EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
           GLIR R  GA ++ G             EV+VFLD+HCEV   WL PLLAPI  DR+ + 
Sbjct: 199 GLIRGRMIGAAQASGVSPDPQILDLCSGEVLVFLDSHCEVNQMWLQPLLAPIRQDRRTVV 258

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            PVID I   T      Y P    RG F WG+ +K + +P  E K  +    P +SPT A
Sbjct: 259 CPVIDIISADTLS----YSPSPIVRGGFNWGLHFKWDPVPPAELKSPQGPVGPIRSPTMA 314

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA++R +F E+G YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY
Sbjct: 315 GGLFAINRKYFNEIGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPY 374

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                 D      + +N  R+   W DE  + Y   R  L    D GDI E+
Sbjct: 375 GSPGGQD-----TMAHNSLRLAHVWMDEYKEQYLSMRPDLRQ-RDYGDIGER 420


>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           laevis]
 gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
          Length = 582

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 161/375 (42%), Positives = 219/375 (58%), Gaps = 26/375 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHI 58
           +PV+K        +PP +P    PGE GKA  L      +   + S+ +Y +N+  S+ I
Sbjct: 65  QPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQEESIEKYALNIYLSDQI 114

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE  S+L+RT+HS+++ +PA  L EI
Sbjct: 115 SLHRHIMDNRMYECKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLESSPAVLLREI 174

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDDFS K  L  +LEDYI   + +VRLIR T+REGL+R R  GA  + G+V+ FLD H
Sbjct: 175 ILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCH 233

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
           CE    WL PLL  I  +   +  PVID ID+ T+EF    +      G F+W + ++ +
Sbjct: 234 CECVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWH 291

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PE+E ++RK   +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W 
Sbjct: 292 AVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQ 351

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E  PCS +GHV+    PY           P    N  R  E W D  +K  FY R
Sbjct: 352 CGGTLEIEPCSHVGHVFPKKAPY---------ARPNFLQNTARAAEVWMD-GYKELFYNR 401

Query: 358 EPLAMFLDMGDISEQ 372
            P A   + GDISE+
Sbjct: 402 NPPAQKENYGDISER 416


>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
          Length = 617

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 216/353 (61%), Gaps = 12/353 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P + + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYSPHDDWGEG---GTGVSHLTPEQQKRADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   YP DLP  SVI+V+HNE +S+L+RTV S+I R+P   L+EIILVDDFS +  L 
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLKEIILVDDFSDREFLR 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KL++ I+     +++IR+ ER GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPKLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKQHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
            +GHV+R   P++F     +  G ++  N  RV E W DE  K YFY   P A
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNANLLRVAEVWMDE-WKYYFYKIAPQA 431


>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Takifugu rubripes]
          Length = 580

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 148/339 (43%), Positives = 201/339 (59%), Gaps = 11/339 (3%)

Query: 35  EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
           EA +   D+    +  N+  S  +   R +PD R  +C+   YP DLP ASV++ F NE 
Sbjct: 77  EADQQLRDSGYHRHAFNLLISTRLGPHRDLPDTRDPQCRDRIYPRDLPPASVVICFFNEA 136

Query: 95  FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 153
            S+L+RTVHS++ RT    L EIILVDD+S   +L   L+ Y+Q    GKV+++RN  RE
Sbjct: 137 LSALLRTVHSVLDRTAPFLLHEIILVDDYSELEELKGDLDRYVQAELQGKVKVLRNQRRE 196

Query: 154 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           GLIR R  GA  + G+V+VFLD+HCEV   WL PLLA I+ DR+ +  PVID I   T  
Sbjct: 197 GLIRGRMIGAAHASGQVLVFLDSHCEVNQMWLEPLLASIHEDRRTVVCPVIDIISADTLS 256

Query: 214 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLE 273
               Y P    RG F WG+ +K + +P  E K  K   +P +SPT AGGLFA++R +F E
Sbjct: 257 ----YSPSPIVRGGFNWGLHFKWDPVPPSELKSPKGPVDPIRSPTMAGGLFAINRKYFNE 312

Query: 274 LGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPL 333
           +G YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY      D      
Sbjct: 313 MGQYDAGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFRKRRPYGSPGGQD-----T 367

Query: 334 ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + +N  R+   W DE  + Y   R P     D GDIS++
Sbjct: 368 MAHNSLRLAHVWMDEYKEQYLSMR-PELRERDYGDISDR 405


>gi|410897066|ref|XP_003962020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Takifugu rubripes]
          Length = 600

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 14/353 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G+ G+A H+  +  A    S  E   N+  SN I  DR IPD R E C       DLP  
Sbjct: 101 GQFGQAVHVSSSEDALVRKSWDEGFFNVYLSNQIPLDRAIPDTRPESCAQTLVHDDLPST 160

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           SVI  F +E +S+L+R+VHS++ R+P   LEEIILVDDFS+K  L   L+ Y+ +F  KV
Sbjct: 161 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLEEIILVDDFSTKEYLKAPLDKYMSQF-PKV 219

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
           R+IR  ER+GLIR R  GA  ++GEV+ FLD+H E  + WL PLL  IY DR+ +  PVI
Sbjct: 220 RIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPLLERIYMDRRKVPCPVI 279

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
           + I+ +   +  V   D+  RGIF W +++  + LPE   KK     S+P + P  AGGL
Sbjct: 280 EVINDKDMSYMLV---DNFQRGIFRWPLVFGWSPLPEAYIKKHNLTISDPIRCPVMAGGL 336

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F++D+ +F ELG YD GL VWGGEN E+SFKIWMCGG IE +PCSR+GH++R   PY F 
Sbjct: 337 FSIDKKYFYELGAYDSGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQNPYKFP 396

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF----LDMGDISEQ 372
           K  DR K   +  N  RV E W DE +K  FY      +     +D+GD+SEQ
Sbjct: 397 K--DRQKT--VERNLARVAEVWLDE-YKDLFYGHGYHHLLDKSVIDIGDLSEQ 444


>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
           partial [Strongylocentrotus purpuratus]
          Length = 531

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 145/304 (47%), Positives = 195/304 (64%), Gaps = 13/304 (4%)

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           CK   YP DLP  S+I+ FHNE +S+L+RT++SII R+P + ++EIIL+DD S+   L +
Sbjct: 85  CKNISYPHDLPSTSIIICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQE 144

Query: 132 KLEDYIQRFNG-KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            +EDYI + +  ++R++R  +R GLI+ R  G   S GE   FLD+H EV + WL PLLA
Sbjct: 145 PIEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLA 204

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            + SDR I+ +PV+D I+  T+ +  V EP    RG F W   Y+   +P  +  KR   
Sbjct: 205 RLASDRTIVVMPVVDEINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYD--KRPSK 260

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
             P KSP   GGL  MDR+FFLELGG+D G+ VWGGEN E S KIWMCGGSIE +PCSR+
Sbjct: 261 VAPIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRV 320

Query: 311 GHVYRSFMPYNFGKLADRVKGPL--ITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           GHVYR   PY+F       + PL  + +N  RV+E W DE HK +FY R P+    D GD
Sbjct: 321 GHVYRDTSPYSFLG-----QNPLDIVEHNAMRVVEVWTDE-HKHHFYDRLPMLKNRDFGD 374

Query: 369 ISEQ 372
           +S++
Sbjct: 375 VSKR 378


>gi|189237799|ref|XP_001814012.1| PREDICTED: similar to N-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270008127|gb|EFA04575.1| PNR-like protein [Tribolium castaneum]
          Length = 614

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 151/366 (41%), Positives = 213/366 (58%), Gaps = 13/366 (3%)

Query: 8   GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
            KL  + P L   K    + G   ++ +  +   D    ++  N+  S  +S+ R +PD 
Sbjct: 81  NKLQPVYPKLSTDKNELSQLGLVKNIDDQRKK--DEGYKKHAYNVLISERLSYHRDVPDT 138

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R E CK   Y  DLP A++I+ F+NE + +L+RTVHSII RTPA  L+EI+LVDDFS   
Sbjct: 139 RNELCKNISYSADLPTAAIIICFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSDLE 198

Query: 128 DLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
           +L + L  YI + F+ +V+LI+   REGLIR R  GA+ ++ +VI+FLD+H EV + W+ 
Sbjct: 199 NLHENLSTYITKNFDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIE 258

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           PLL  I  +   + +PVID I+  T+     Y      RG F WG+ +K   LP+     
Sbjct: 259 PLLQRIKDNYTNVAMPVIDIINADTF----AYTASPLVRGGFNWGLHFKWENLPKGTLST 314

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           +    +P KSPT AGGLFAM R +F +LG YD G+ +WGGEN E+SF+IWMCGG +E +P
Sbjct: 315 KMDFIKPIKSPTMAGGLFAMSRKYFTDLGEYDAGMNIWGGENLEISFRIWMCGGRLELIP 374

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CSR+GHV+R   PY      D      + +N  RV   W D  +K YF    P A  +D 
Sbjct: 375 CSRVGHVFRQRRPYGAPDGQD-----TMLHNSLRVANVWMDS-YKEYFLNHRPDAKRIDF 428

Query: 367 GDISEQ 372
           GD+S +
Sbjct: 429 GDVSSR 434


>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
          Length = 1150

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 155/345 (44%), Positives = 210/345 (60%), Gaps = 14/345 (4%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           L+  + GPGE GKA  L  A   +        + G N   S+ IS +R+I DLR   CK 
Sbjct: 216 LDRERVGPGEQGKAATLSPAESDSEQRKKLYLQNGFNALLSDKISINRSIADLRHPSCKL 275

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP ASV++ F+ E +S+L+RTVHS++ R+P+  L+E+I+VDD S+K  L  +L+
Sbjct: 276 QQYFKHLPTASVVVPFYEEHWSTLLRTVHSVLNRSPSHLLKEVIIVDDGSTKEFLHGQLQ 335

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +Y+ +   KV+LIR  ER GL++ R  GAK + G+V+VFLD+H E G NWLPPLL PI  
Sbjct: 336 NYVNQNLPKVKLIRQGERTGLMKARLAGAKLASGDVLVFLDSHTEAGYNWLPPLLEPIAE 395

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           + K    P+ID ID QT+   +++  D   RG+F+W   YK   L E +   R   + P+
Sbjct: 396 NPKTCVCPLIDVIDDQTF---NIHPQDDGGRGLFDWRFHYKRLALKESD---RVSPTAPF 449

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGGLFA+   FF ELGGYD  L +WG E +ELSFKIW CGG +   PCSR  H+Y
Sbjct: 450 PSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQCGGRMLDAPCSRFSHIY 509

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           RS+ P+   +  D      IT N+KRV E W DE +K Y Y R+P
Sbjct: 510 RSYSPFPNSRKYD-----FITRNHKRVAEIWMDE-YKQYIYDRDP 548



 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 188/301 (62%), Gaps = 13/301 (4%)

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           C    Y   LP+ SVI+ F++E +S+L+RTV+S+++R+P+  L EIILVDD S K  L +
Sbjct: 675 CHNIKYLQHLPRTSVIIPFYDEHWSTLLRTVYSVMRRSPSSLLLEIILVDDGSMKNFLKE 734

Query: 132 KLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           +L+ Y+       V++I    R GLI  R  GAK ++G+V+VFLD+H E G+NWLPPLL 
Sbjct: 735 QLDHYVATHLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGINWLPPLLE 794

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           PI  + +    P ID I   T+E       D   RG F+W MLYK   LP R  + +K  
Sbjct: 795 PIAHNPRTCVCPFIDVIMDDTFELTP---QDQGARGAFDWNMLYKR--LPLR-PEDQKDP 848

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           ++P++SP  AGGLFA+   FF ELGGYD  L +WG E +ELSFKIW CGG +   PCSR+
Sbjct: 849 TQPFESPVMAGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKIWQCGGRMIDAPCSRV 908

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GH+YRS+ P+   K  D V       N+KRV E W DE +K Y Y ++P+   +D GD++
Sbjct: 909 GHIYRSYSPFPNVKSYDYV-----AKNHKRVAEVWMDE-YKKYVYRKDPMRFSIDAGDLT 962

Query: 371 E 371
           +
Sbjct: 963 K 963


>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
 gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
          Length = 682

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 157/360 (43%), Positives = 215/360 (59%), Gaps = 20/360 (5%)

Query: 20  YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           YK+G    PGE G     P+  R   D        N   SN IS  R++P+   E C+  
Sbjct: 166 YKQGDPNQPGEFGTGKLSPKE-RKLFDEGFKRNSFNEYVSNMISIHRSLPNNTDELCQKA 224

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
            Y  DLP  SVI+ FHNE +S L+RTVHS+++RTP   L+EIILVDDFS    L + LED
Sbjct: 225 SYRNDLPDTSVIICFHNEAWSVLLRTVHSVLERTPDHLLKEIILVDDFSDFDHLKKPLED 284

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F GKVR+IR   R GLIR R +GA  + G+V+ +LD+HCE    WL PLL  I  +
Sbjct: 285 YMSQF-GKVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPLLDRIAQN 343

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYR---GIFEWGMLYKENELPEREAKKRKYNSE 252
              +  PVID I+ +T +    Y    H R   G F WG+++  + LP+R+ +  K   +
Sbjct: 344 STNVVTPVIDTINLETLQ----YHLSSHRRLSVGGFNWGLVFNWHILPDRDYQAMKSRID 399

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P  SPT AGGLF++DR +F +LGGYDPG  +WG EN E+SFKIWMCGG +E VPCS +GH
Sbjct: 400 PIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISFKIWMCGGRLEVVPCSHVGH 459

Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           ++R   PY + K  +     ++  N  R+ E W D+ +K  +Y R    + +D GD+SE+
Sbjct: 460 IFRKKSPYKWRKGIN-----VLQRNNIRLAEVWLDD-YKEIYYNRINHKL-VDFGDVSER 512


>gi|170589103|ref|XP_001899313.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158593526|gb|EDP32121.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 636

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 154/359 (42%), Positives = 218/359 (60%), Gaps = 16/359 (4%)

Query: 18  EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           +  ++G GEGGK   +    ++   D      G +   S+ I+ +R++ D+R   CK   
Sbjct: 111 DALRQGLGEGGKPVVVAISEFKKLRDDLYRINGYDAYISDLIALNRSVKDIRHSGCKNMV 170

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y   LP   V+  F+NE  S+L+R+V+S+I R+P   + EIILVDD S+KA L + LE++
Sbjct: 171 YLEKLPTVGVVFPFYNEHNSTLLRSVYSVINRSPKDIMREIILVDDGSTKAFLKEPLEEF 230

Query: 137 IQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           +++   N  V++IR  +REGLIR R RGA+    +VIVFLDAH EV  NWLPPL+ PI  
Sbjct: 231 LKKAGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDAHSEVNYNWLPPLVEPIAL 290

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           D K++  P ID ID  T+E+R+    D   RG F+W   YK   L E     +K  + P+
Sbjct: 291 DYKMVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPF 344

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SP  AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++   PCSR+GH+Y
Sbjct: 345 HSPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIY 404

Query: 315 RS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R  ++P++   + D      I+ NY+RV E W DE  K + Y R P  + +D GD+SEQ
Sbjct: 405 RCKYIPFSNPGIGD-----FISRNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSEQ 457


>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
          Length = 617

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 149/365 (40%), Positives = 224/365 (61%), Gaps = 13/365 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P + + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYSPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   +P +LP  SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS +  L 
Sbjct: 146 SCRNITFPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLK 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
             KL++ ++     ++++R+ ER GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKEHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
            +GHV+R   P++F     +  G ++  N  RV E W DE  K YFY   P+A  +    
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKILNANLLRVAEVWMDE-WKYYFYKLAPVAYRMRQSI 438

Query: 368 DISEQ 372
           D+SE+
Sbjct: 439 DVSER 443


>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
 gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
          Length = 670

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 160/372 (43%), Positives = 215/372 (57%), Gaps = 28/372 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           L+PP    ++ PGE GK   LP+      + A D    +   N   S+ IS  R++PD R
Sbjct: 155 LDPPAANLEDSPGELGKPVILPKDMSPEMKKAVDDGWTKNAFNQYVSDLISVRRSLPDPR 214

Query: 69  MEECKYWD-YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE +S L+RTVHS++ R+P + + EIILVDDFS   
Sbjct: 215 DAWCKDSALYLSNLPKTDVIICFHNEAWSVLIRTVHSVLDRSPPELIGEIILVDDFSDMP 274

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV+++R  +REGLIR R  GA+ ++  VI +LD+HCE    WL P
Sbjct: 275 HLKKQLEDYFASY-PKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWLEP 333

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID ID  T EF        HYR       G F+W + +  + +P
Sbjct: 334 LLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVP 385

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    SEP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 386 EREKKRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSFKTWMCGG 445

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W D+  K Y+Y R  +
Sbjct: 446 TLEIVPCSHVGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDDYAK-YYYQRIGM 499

Query: 361 AMFLDMGDISEQ 372
               D GD+SE+
Sbjct: 500 DKG-DFGDVSER 510


>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Oryzias latipes]
          Length = 584

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 161/375 (42%), Positives = 223/375 (59%), Gaps = 27/375 (7%)

Query: 7   DGKLGN---LEPPLEPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISF 60
           DG L     ++PP  P    PGE G+A  L   PE  +   + S+  Y +N+  S+ IS 
Sbjct: 62  DGPLARALYIKPP--PDSSAPGEWGRATRLNLSPEE-KKLEEESVESYAINIFVSDKISL 118

Query: 61  DRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
            R I D RMEEC  K +DY   LP  SVI+ F+NE +S+L+RT+HS+++ TPA  L+EII
Sbjct: 119 HRHIQDNRMEECRNKKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEII 177

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           L+DD+S +  L  +L +YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HC
Sbjct: 178 LIDDYSDRGYLKSQLAEYISNLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHC 236

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKEN 237
           E    W+ PLL  I  +   +  PVID ID+ ++EF     EP     G F+W + ++ +
Sbjct: 237 ECVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTGEP---MIGGFDWRLTFQWH 293

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PE E K+RK  ++P++SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W 
Sbjct: 294 SVPESERKRRKSRTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSFRVWQ 353

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGGS+E  PCS +GHV+    PY           P    N  R  E W D  +K +FY R
Sbjct: 354 CGGSLEIHPCSHVGHVFPKKAPY---------ARPNFLQNTVRAAEVWMD-SYKHHFYNR 403

Query: 358 EPLAMFLDMGDISEQ 372
            P A   + GDI+E+
Sbjct: 404 NPPAKKENYGDITER 418


>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
           echinatior]
          Length = 605

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 145/350 (41%), Positives = 210/350 (60%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A  +P    A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 103 PGEMGAAVAIPPENDAKQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKNKKYLKYLPD 162

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY+      
Sbjct: 163 TSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQDLEDYVITLPVP 222

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I +DR  +  P+
Sbjct: 223 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVVCPI 282

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+ S    D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 283 IDVISDDTFEYISA--SDMTWGG-FNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTMAGG 339

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 340 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 399

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                ++    + +N  RV E W DE  + ++Y   P A  +D+GD+SE+
Sbjct: 400 PGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 444


>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Megachile rotundata]
          Length = 605

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 147/353 (41%), Positives = 211/353 (59%), Gaps = 9/353 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           K  PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   YP  
Sbjct: 100 KGKPGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLRDIRLEGCKTKKYPKY 159

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++  
Sbjct: 160 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTL 219

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
                + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  +R  + 
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAENRSTVV 279

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT 
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTM 336

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   P
Sbjct: 337 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSP 396

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F     +V    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 397 YTFPGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 444


>gi|170056941|ref|XP_001864259.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167876546|gb|EDS39929.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 606

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 152/356 (42%), Positives = 210/356 (58%), Gaps = 13/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK  HL +      D    + G N   S+ IS +R++PD+R   CK   Y
Sbjct: 79  ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F+NE +S+L+RT  S++ R+P + + EIILVDD S+K  L  +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV++I   ER GLI  R  GAK +  +V++FLD+H E  +NWLPPLL PI  D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID + + T+E+R+    D   RG F+W   YK   L  ++       +EP++SP
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN---PTEPFESP 312

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR +
Sbjct: 313 IMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRGY 372

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            P+   +  D      +T NYKRV E W DE +K Y Y R+       D GD+S+Q
Sbjct: 373 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYVRDRKKYDNTDAGDLSKQ 422


>gi|170039452|ref|XP_001847548.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167863025|gb|EDS26408.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 606

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 152/356 (42%), Positives = 210/356 (58%), Gaps = 13/356 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE GK  HL +      D    + G N   S+ IS +R++PD+R   CK   Y
Sbjct: 79  ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F+NE +S+L+RT  S++ R+P + + EIILVDD S+K  L  +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
                KV++I   ER GLI  R  GAK +  +V++FLD+H E  +NWLPPLL PI  D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
               P ID + + T+E+R+    D   RG F+W   YK   L  ++       +EP++SP
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN---PTEPFESP 312

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
             AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GH+YR +
Sbjct: 313 IMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHIYRGY 372

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            P+   +  D      +T NYKRV E W DE +K Y Y R+       D GD+S+Q
Sbjct: 373 APFGNPRKKD-----FLTRNYKRVAEVWMDE-YKEYLYVRDRKKYDNTDAGDLSKQ 422


>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           (Silurana) tropicalis]
 gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
 gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
          Length = 582

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 161/375 (42%), Positives = 219/375 (58%), Gaps = 26/375 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHI 58
           +PV+K        +PP +P    PGE GKA  L      +   D S+ +Y +N+  S+ I
Sbjct: 65  QPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQDESIEKYALNIYLSDQI 114

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SV++ F+NE  S+L+RT+HS+++ +PA  L EI
Sbjct: 115 SLHRHIMDNRMYECKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLETSPAVLLREI 174

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDDFS K  L  +LEDYI   + +VRLIR T+REGL+R R  GA  + G+V+ FLD H
Sbjct: 175 ILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCH 233

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 237
           CE    WL PLL  I  +   +  PVID ID+ T+EF    +      G F+W + ++ +
Sbjct: 234 CECISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWH 291

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PE+E ++RK   +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W 
Sbjct: 292 AVPEKERQRRKSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQ 351

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E  PCS +GHV+    PY           P    N  R  E W D  +K  FY R
Sbjct: 352 CGGTLEIEPCSHVGHVFPKKAPY---------ARPNFLQNTARAAEVWMD-GYKELFYNR 401

Query: 358 EPLAMFLDMGDISEQ 372
            P A   + GDISE+
Sbjct: 402 NPPARKENYGDISER 416


>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 648

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 165/380 (43%), Positives = 222/380 (58%), Gaps = 28/380 (7%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYR---AAGDASLGEYGMNMETSNHIS 59
           V  A   +G L PP  P  +GPGE G+   L +  +   A           N   S+ IS
Sbjct: 120 VDHAPAPVGVLAPPQNP--DGPGEMGRPVVLKDLTKEQEAKVKQGWDRNAFNQYISDMIS 177

Query: 60  FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
             R++PD+R  ECK   Y  DLP  SVI+ FHNE +S L+RTVHSII R+P + L EIIL
Sbjct: 178 LHRSLPDVRDSECKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIIL 237

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD+S    L QKLEDY+  F  KV+++R  +REGLIR R  GA  +   V+ +LD+HCE
Sbjct: 238 VDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCE 296

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGM 232
               WL PLL  I  +   +  PVID I   T+E+        HYR       G F+W +
Sbjct: 297 CTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEY--------HYRDSGGVNVGGFDWNL 348

Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
            +  + +PERE ++RK++ +P  SPT AGGLF++D+AFF +LG YD G  +WGGEN ELS
Sbjct: 349 QFSWHAVPERERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELS 408

Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKA 352
           FK WMCGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +K 
Sbjct: 409 FKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDE-YKQ 462

Query: 353 YFYTREPLAMFLDMGDISEQ 372
           Y+Y R    +  D GD+S +
Sbjct: 463 YYYQRIGDDLG-DFGDVSAR 481


>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
          Length = 547

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 209/340 (61%), Gaps = 10/340 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
           G GE G+   L EA     D +      N+  S+ I+ +R++PD+R  +C+   Y    +
Sbjct: 73  GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 132

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++  L  +L++++ + 
Sbjct: 133 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 192

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              +++IR  ER GLIR R  GA E++G+V+ FLD+HCE    W+ PLLA I  +RK + 
Sbjct: 193 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 252

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            PVID I+ +T+ ++   E    +RG F W + ++   LP    K R  + ++P  SPT 
Sbjct: 253 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSNDPTKPIISPTM 309

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD  + +WGGEN E+S ++W CGG IE +PCS +GHV+R   P
Sbjct: 310 AGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHVGHVFRRASP 369

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           ++F        G ++  N  RV E W DE  K +FY   P
Sbjct: 370 HDF---PSHKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 405


>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
 gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
          Length = 662

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 213/353 (60%), Gaps = 15/353 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE G+A  L +  +   +  +  E G N   S+ IS +R++ D+R ++C+  +Y   L
Sbjct: 138 GLGEQGQAASLDDESQIETEKRMSLENGFNALLSDSISVNRSLNDIRHKQCRKKEYLTQL 197

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F NE  S LMR+VHS+I R+P + L+EIILVDD+S +  L   LE YI    
Sbjct: 198 PTVSVIIIFWNEYLSVLMRSVHSLINRSPPELLKEIILVDDYSDREYLGHDLEAYIANHF 257

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  RS GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 258 KIVRVVRLPRRTGLIGARSEGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 317

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  +++R+    D   RG F+W   YK    LPE      K+ ++P+KSP  A
Sbjct: 318 PFIDVIDHSNFQYRA---QDEGARGAFDWEFYYKRLRLLPE----DLKHPADPFKSPVMA 370

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR    +
Sbjct: 371 GGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPCSRIGHIYRGPRNH 430

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N        KG  +  NYKRV E W DE +K Y Y+  + +   +D GD++ Q
Sbjct: 431 N----PSPRKGDYLHRNYKRVAEVWMDE-YKNYLYSHGDGIYERVDAGDLTAQ 478


>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
           kowalevskii]
          Length = 753

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 153/372 (41%), Positives = 225/372 (60%), Gaps = 14/372 (3%)

Query: 11  GNLEP-----PLEP--YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
           GN+ P     PLE   Y + PGEGG    L         A+  ++  N+  S+ I+ +RT
Sbjct: 221 GNILPQLGHRPLEQPWYPDSPGEGGMPVDLTPQEARLSKATFYQFEFNIIASDKIALNRT 280

Query: 64  IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
           +PD R   C++ +YP  LPK SVI+VFHNE +++L+RTV S+I R+P Q LEEI+LVDD 
Sbjct: 281 LPDSRPVACEHREYPHILPKTSVIIVFHNEAWTTLLRTVISVIDRSPWQLLEEILLVDDA 340

Query: 124 SSKAD--LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           S+     L  +L++Y+ +     R+IR  +R GLI+ R RG +E+RGEV+ FLD+HCE  
Sbjct: 341 STSEKYWLQSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECN 400

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 241
           + WL PLL+ I +DR  +  P +D I  +T+ + +  +P+    G F W + +K   LP+
Sbjct: 401 IGWLEPLLSEIVNDRTTVVAPNLDVISDKTFGY-TFIKPEQTMIGGFGWLVDFKWYSLPK 459

Query: 242 REAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           RE  +   + S P ++PT AGGLFA+D  +F  +G YDPG   WG EN ELSF++W CGG
Sbjct: 460 RERLRVNNDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELSFRVWQCGG 519

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           ++E VPCS +GHV+RS +PY +    ++  G  I  N  R+++ W D+  K +F    P 
Sbjct: 520 TLEIVPCSHVGHVFRSSIPYKYKD--NKNPGLTIAKNNMRLMDVWMDD-LKYFFLAILPH 576

Query: 361 AMFLDMGDISEQ 372
               + GD SE+
Sbjct: 577 YAEQEFGDTSER 588


>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Bombus impatiens]
          Length = 643

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 159/379 (41%), Positives = 224/379 (59%), Gaps = 23/379 (6%)

Query: 1   RPVFK-ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETS 55
           R  FK +D  L  L+P P++P     +G  E G   +  +  +   D     Y  N+  S
Sbjct: 90  RNAFKNSDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNILVS 147

Query: 56  NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
           ++I   R +PD R + C+   Y   LP AS+++ F+NE + +L+R++HSII RTPA  L 
Sbjct: 148 DNIGLHRELPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLH 207

Query: 116 EIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           EIILV+D+S    L +K+E YI   FNGKV+  +  +REGLIR R  GA+++ GE+++FL
Sbjct: 208 EIILVNDWSDSKALHEKIETYIANNFNGKVKFFKTEKREGLIRARMFGARKATGEILIFL 267

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+H EV   W+ PLL+ I   + I+ +PVID I+  T++    Y      RG F WG+ +
Sbjct: 268 DSHIEVNKRWIEPLLSQIAHSKTIIAMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHF 323

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           K + +P       +   +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+
Sbjct: 324 KWDNVPVGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFR 383

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           IWMCGGSIE +PCSR+GHV+R   PY  F +    +K  L      RV   W DE +K Y
Sbjct: 384 IWMCGGSIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSL------RVAHVWLDE-YKDY 436

Query: 354 FYTREPLAMFLDMGDISEQ 372
           F         +D GDISE+
Sbjct: 437 FLKN---VQKVDYGDISER 452


>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Strongylocentrotus purpuratus]
          Length = 749

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 143/352 (40%), Positives = 211/352 (59%), Gaps = 12/352 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GPGE G         +A           N   S+ IS +R++PD+R   CK  +Y  DLP
Sbjct: 251 GPGEHGAGVRTKLEEQAKVKIGWDHAYFNEYVSDMISVERSVPDVRHNLCKTKEYSDDLP 310

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
           + SVI+ F  E +S+L+RTVHS++ R+P + + E++LVDDFS +  L + L++Y+++   
Sbjct: 311 RTSVIICFTEESWSTLLRTVHSVLNRSPPELIAEVLLVDDFSQRDYLKEPLDEYMKKL-P 369

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KV+++R  +REGLIR R  GA+ ++G V+ FLD+H E  + WL PLL  I+ D   +  P
Sbjct: 370 KVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPLLQRIHDDPTNVVCP 429

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
            ID ID  ++E+           G F W M +  N +PE EA++R   S P +SP  AGG
Sbjct: 430 AIDAIDATSFEYAG---SGATIIGAFNWEMKFTWNGIPEYEARRRDDESWPIRSPAMAGG 486

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ FF  +G YDPG  +WG EN ELSFKIWMCGGS+E +PCSR+ H++R   PY F
Sbjct: 487 LFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCGGSLEIIPCSRVAHIFRKQQPYKF 546

Query: 323 GKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                   G + T+  N  R++  W DE ++  FY+ +P  M  + GD+S++
Sbjct: 547 P------DGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKPQLMGQEYGDVSDR 592


>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
           gigas]
          Length = 613

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 148/335 (44%), Positives = 198/335 (59%), Gaps = 10/335 (2%)

Query: 38  RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
           + A D     +  N   S+ I F R IPD R  +C+   +P      S+I+ F NE  S+
Sbjct: 102 QIARDEGYQNFAFNALVSDKIGFHRAIPDTRYPKCQDVTFPAINLDTSIIVCFFNEQPSA 161

Query: 98  LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
           L+R VHSI  +TP + ++EIILVDD S+  DL  ++E+Y+ +    VRL+R  EREGLIR
Sbjct: 162 LLRLVHSINDQTPQELVKEIILVDDSSTLDDLSCQIENYVNQHFNNVRLVRTPEREGLIR 221

Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
            R  GA  + G+V+VFLD+HCEV  +WL PLL  I  D   + VPVID I++ T E    
Sbjct: 222 ARVFGANLASGQVLVFLDSHCEVNTDWLEPLLLRISHDPTTVVVPVIDIINHDTME---- 277

Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
           Y+     RG F WG+ +  + LP+ E       S+P  SPT AGGLFAM R +F  LG Y
Sbjct: 278 YQQSPLVRGGFNWGLHFSWDRLPDNEKNDPDLGSKPILSPTMAGGLFAMKRDYFHHLGEY 337

Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYN 337
           D G+ +WGGEN E+SF+IWMCGG +E +PCSR+GH++R   PY   K  D         N
Sbjct: 338 DLGMDIWGGENLEISFRIWMCGGKLEIIPCSRVGHIFRKRRPYGNPKGRDT-----FLKN 392

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             RV   W D K+K YF  + P A  +D GDIS++
Sbjct: 393 SLRVANVWMD-KYKEYFLKQRPQAQVVDYGDISDR 426


>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Nasonia vitripennis]
          Length = 637

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 158/372 (42%), Positives = 220/372 (59%), Gaps = 20/372 (5%)

Query: 6   ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           +D  L  L P P++P     +G  E G   ++ E  +   +     +  N+  S+++S  
Sbjct: 94  SDKLLQQLMPVPVKPSVTVGQGLDELGLVKNMDEQKKR--EEGYKSFAFNVLVSDNLSLH 151

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R IPD R + CK   Y   LP AS+++ F+NE +++L+R+++SI+ RTP   L EIIL++
Sbjct: 152 RDIPDTRHKLCKNQTYDQKLPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILIN 211

Query: 122 DFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DFS    L +++ DY+ Q F+ KV+  R   REGLIR R  GAK++ GEV+VFLD+H EV
Sbjct: 212 DFSDSKSLHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEV 271

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
              WL PLLA I   R I+ +PVID I+  T+++ S        RG F WG+ +K + LP
Sbjct: 272 NKMWLEPLLARISHSRTIVPMPVIDIINADTFQYSS----SPLVRGGFNWGLHFKWDSLP 327

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
                  +   +P KSPT AGGLFAMDR +F ELG YD G+ VWGGEN E+SF+IWMCGG
Sbjct: 328 IGTLSLEQDFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISFRIWMCGG 387

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           SIE +PCSR+GHV+R   PY      D      +  N  RV   W D+ +K YF      
Sbjct: 388 SIELIPCSRVGHVFRRRRPYGGNDQQD-----TMLKNSLRVAYVWMDQ-YKKYFLKN--- 438

Query: 361 AMFLDMGDISEQ 372
              +D GDI+E+
Sbjct: 439 VKKIDYGDITER 450


>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
          Length = 597

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 209/340 (61%), Gaps = 10/340 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
           G GE G+   L EA     D +      N+  S+ I+ +R++PD+R  +C+   Y    +
Sbjct: 62  GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 121

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++  L  +L++++ + 
Sbjct: 122 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 181

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
              +++IR  ER GLIR R  GA E++G+V+ FLD+HCE    W+ PLLA I  +RK + 
Sbjct: 182 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 241

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            PVID I+ +T+ ++   E    +RG F W + ++   LP    K R  + ++P  SPT 
Sbjct: 242 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSNDPTKPIISPTM 298

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD  + +WGGEN E+S ++W CGG IE +PCS +GHV+R   P
Sbjct: 299 AGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHVGHVFRRASP 358

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           ++F        G ++  N  RV E W DE  K +FY   P
Sbjct: 359 HDF---PSHKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 394


>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 697

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 161/379 (42%), Positives = 221/379 (58%), Gaps = 19/379 (5%)

Query: 4   FKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPEAYRAAGDASLGEYGMNME 53
             A  KLG L P   P     Y  GPGEGGKAY      L  A +   D    +   N  
Sbjct: 163 LSAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSPAEQIIFDKGWEDNAYNQY 221

Query: 54  TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
            S+ IS  R +PD R   CK   Y  +LP AS+I+ FHNE +S L+R+VHS+I R+P   
Sbjct: 222 ASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNL 281

Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
           L+EIILVDDFS +  L + LE+Y+   N  V+++R  +REGLIR R  GA+ S G+V+VF
Sbjct: 282 LQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVF 340

Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
           LD+H E    WL PLL  I  +  I+ VPVI  I+ +T +  +  + D+   G F+W + 
Sbjct: 341 LDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLT 399

Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
           ++ +E  ER+  +      P +SPT AGGLFA+ R +F  LG YD G+ +WGGEN ELSF
Sbjct: 400 FRWHEQTERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSF 459

Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           K+WMCGG +E V CS +GH++R   PY +      VK PL   N  R+ + W D+ +K +
Sbjct: 460 KVWMCGGILETVVCSLVGHIFRGRSPYKWNV---NVKDPL-KRNLLRLADVWLDD-YKRF 514

Query: 354 FYTREPLAMFLDMGDISEQ 372
           +Y R      +D GD+SE+
Sbjct: 515 YYARIGFKT-IDFGDVSER 532


>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 605

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 140/323 (43%), Positives = 207/323 (64%), Gaps = 9/323 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S  IS +R++PD+R + CK   Y   LP  SV++VFHNE +S+L+RTV S+I R+P
Sbjct: 127 NLLASERISLNRSLPDVRAKGCKTKKYFELLPTTSVVIVFHNEAWSTLLRTVWSVINRSP 186

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD S +  L +KLE+Y++     V ++R  +R GLIR R  GAK  +G+V
Sbjct: 187 KPLIKEIILVDDASVQPHLGKKLENYVKTLPVPVTVLRTPKRSGLIRARLLGAKHVKGQV 246

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           I FLDAHCE    WL PLLA I  DRK +  P+ID I  +T+E+  +   D  + G F W
Sbjct: 247 ITFLDAHCECTEGWLEPLLARITEDRKTVVCPIIDVISDETFEY--ITASDTTWGG-FNW 303

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            + ++   +P+RE  +R  + + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN 
Sbjct: 304 RLNFRWYRVPKREMDRRNNDKTVPIRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENL 363

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGG++E VPCS +GHV+R   PY F     ++    + +N  RV E W DE 
Sbjct: 364 EMSFRVWQCGGTLEIVPCSHVGHVFRDKSPYTFPGGVSQI----VLHNANRVAEVWMDE- 418

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            + ++Y   P A  +++GDI+ +
Sbjct: 419 WRDFYYAMNPGAKKIEVGDITSR 441


>gi|195020976|ref|XP_001985304.1| GH16989 [Drosophila grimshawi]
 gi|193898786|gb|EDV97652.1| GH16989 [Drosophila grimshawi]
          Length = 682

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 161/355 (45%), Positives = 215/355 (60%), Gaps = 17/355 (4%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE GK   L  E+ R        E G N   S+ IS +R++PD+R  +C+   Y   L
Sbjct: 155 GIGEQGKIAKLDDESVRENEQKVSIENGFNGLLSDSISVNRSLPDIRHIDCRKKLYLRKL 214

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF- 140
           P  SV+++F +E  S LMR+VHS+I R+P + L+EIILVDDFS +A L+++LEDYI    
Sbjct: 215 PTVSVVIIFFDEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLNKELEDYIVNHF 274

Query: 141 -NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
             G VR++R  +R GLI  RS GA+ +  +V++FLD+H E   NWLPPLL PI  +++  
Sbjct: 275 AVGLVRVVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLEPIAINKRAA 334

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNSEPYKSPT 258
             P ID ID+  + +R+    D   RG F+W   YK   L PE      K+ +EP+KSP 
Sbjct: 335 VCPFIDVIDHSNFNYRA---QDEGARGGFDWQFFYKRLPLLPE----DLKHPTEPFKSPV 387

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSR+GH+YR   
Sbjct: 388 MAGGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVGHIYRG-- 445

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           P     +    KG  +  NYKRV E W DE +K Y Y   E +   +D GD++ Q
Sbjct: 446 PRK--SIPSPRKGDYLHKNYKRVAEVWMDE-YKNYLYANGEGIYERVDAGDLTAQ 497


>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
          Length = 654

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 161/379 (42%), Positives = 221/379 (58%), Gaps = 19/379 (5%)

Query: 4   FKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPEAYRAAGDASLGEYGMNME 53
             A  KLG L P   P     Y  GPGEGGKAY      L  A +   D    +   N  
Sbjct: 163 LSAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSPAEQIIFDKGWEDNAYNQY 221

Query: 54  TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
            S+ IS  R +PD R   CK   Y  +LP AS+I+ FHNE +S L+R+VHS+I R+P   
Sbjct: 222 ASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNL 281

Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
           L+EIILVDDFS +  L + LE+Y+   N  V+++R  +REGLIR R  GA+ S G+V+VF
Sbjct: 282 LQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVF 340

Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 233
           LD+H E    WL PLL  I  +  I+ VPVI  I+ +T +  +  + D+   G F+W + 
Sbjct: 341 LDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLT 399

Query: 234 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 293
           ++ +E  ER+  +      P +SPT AGGLFA+ R +F  LG YD G+ +WGGEN ELSF
Sbjct: 400 FRWHEQTERDRNRSGAPYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSF 459

Query: 294 KIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           K+WMCGG +E V CS +GH++R   PY +      VK PL   N  R+ + W D+ +K +
Sbjct: 460 KVWMCGGILETVVCSLVGHIFRGRSPYKWNV---NVKDPL-KRNLLRLADVWLDD-YKRF 514

Query: 354 FYTREPLAMFLDMGDISEQ 372
           +Y R      +D GD+SE+
Sbjct: 515 YYARIGFKT-IDFGDVSER 532


>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 11 [Sarcophilus
           harrisii]
          Length = 970

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 147/332 (44%), Positives = 200/332 (60%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R  ECK   YP  LP AS+++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAECKEKSYPTGLPAASIVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+I RTPA  L EIILVDD S   DL  +L+DY+Q++  GK++++RN + EGLI  R 
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA    GEV+VFLD+HCEV   WL PLL PI+ D + +  PVID I   T     +Y  
Sbjct: 231 IGAAHGTGEVLVFLDSHCEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL----MYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
                G F W + +K + +P  +    +    P KSP  AGGLFAM+R +F ELG YD G
Sbjct: 287 SPIVCGGFNWDLHFKWDLVPFSKLGGPEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDT-----MTNNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L +    G+ISE+
Sbjct: 402 MAHVWLDEYKEQYFSLRPELKL-KSYGNISER 432



 Score =  273 bits (699), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 142/298 (47%), Positives = 190/298 (63%), Gaps = 11/298 (3%)

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
            YP  LP AS+++ F+NE FS+L+RTVHS+I RTPA  L EIILVDD S   DL  +L+D
Sbjct: 507 SYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDD 566

Query: 136 YIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           Y+Q++  GK++++RN +REGLIR R  GA  + GEV+VFLD+HCEV   WL PLL PI+ 
Sbjct: 567 YVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLVPIHE 626

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
           D + +  PVID I   T     +Y      RG F WG+ +K + +P  E    +    P 
Sbjct: 627 DHRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHFKWDLVPFSELGGPEGAIAPI 682

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG +  +PCSR+GH++
Sbjct: 683 KSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIF 742

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R   PY   +  D      +T+N  R+   W DE  + YF  R  L +    G+ISE+
Sbjct: 743 RKRRPYGSPEGQDT-----MTHNSLRLAHVWLDEYKEQYFSLRPELKL-KSYGNISER 794


>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
           catus]
          Length = 546

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 195/304 (64%), Gaps = 9/304 (2%)

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R + CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S + 
Sbjct: 91  RFDRCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERD 150

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L   LE+Y++     V++IR  ER GLIR R RGA  SRG+VI FLDAHCE  L WL P
Sbjct: 151 FLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEP 210

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LLA I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +R
Sbjct: 211 LLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 267

Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           K + + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V 
Sbjct: 268 KGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVT 327

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM 366
           CS +GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D 
Sbjct: 328 CSHVGHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDY 382

Query: 367 GDIS 370
           GD+S
Sbjct: 383 GDVS 386


>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Loxodonta africana]
          Length = 527

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 205/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  NM  SN + + R +PD R   CK   YPLDLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKSYPLDLPAASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           VHS+  RTPA  L EIILVDD S   DL  +L++Y+Q++  GK ++IRN +REGLIR R 
Sbjct: 171 VHSVTDRTPAHLLHEIILVDDDSDLDDLKGELDEYVQKYLPGKTKVIRNKKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA ++ GEV+VFLD+HCEV   WL PLLA +  D   +  PVID I   T     +Y  
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNEMWLQPLLAAVREDPHTVVCPVIDIISADTL----LYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E    +  + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPIVRGGFNWGLHFKWDLVPFDELGGPEGATAPIKSPTMAGGLFAMNRHYFSELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE  + YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKEQYFSLRPDLKT-RSYGNISER 432


>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Cavia porcellus]
          Length = 608

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 152/332 (45%), Positives = 200/332 (60%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN + + R +PD R   CK   YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRNAACKEQSYPADLPVASVVICFYNEAFSALLRT 170

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 160
           VHS++ RTPA  L EIILVDD S   DL  +L++Y+Q+    K+++IRN +REGLIR R 
Sbjct: 171 VHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYVQKSLPTKIKVIRNAKREGLIRGRM 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + GEV+VFLD+HCEV   WL PLLA I  D   +  PVID I   T      Y  
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNEMWLQPLLATIRGDPHTVVCPVIDIISADTL----AYSS 286

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  E       + P KSPT AGGLFAM+R +F ELG YD G
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSELGGEDGATAPIKSPTMAGGLFAMNRQYFNELGQYDSG 346

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGG +  +PCSR+GH++R   PY   +  D      +T+N  R
Sbjct: 347 MDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQD-----TMTHNSLR 401

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W DE    YF  R  L      G+ISE+
Sbjct: 402 LAHVWLDEYKDQYFSLRPDLKT-KSYGNISER 432


>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
 gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
          Length = 685

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 166/407 (40%), Positives = 225/407 (55%), Gaps = 59/407 (14%)

Query: 1   RPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGM 50
           +PVF  D        P +P YK+G     GE GKA       L    +A  D  +     
Sbjct: 95  KPVFMVD--------PNDPIYKKGDANQAGELGKAVVVDKTKLTSEQKAIYDKGMLNNAF 146

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ IS  RT+P     ECK   Y  +LP+ SVI+ FHNE +S L+RTVHS+++RTP
Sbjct: 147 NQYASDMISVHRTLPTNIDAECKVEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTP 206

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              LEEI+LVDDFS      + LE+Y+ +F GKV+++R  +REGLIR R RGA  + GEV
Sbjct: 207 EHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEV 266

Query: 171 IVFLDAHCE-----------------VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 213
           + +LD+HCE                     W+ PLL  I  D   +  PVID ID  T+E
Sbjct: 267 LTYLDSHCECMEGKETENRVRTRNKKCKKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFE 326

Query: 214 FRSVYEPDHHYR------GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMD 267
           +       HH +      G F+WG+ +  + +PER+ K R    +P +SPT AGGLF++D
Sbjct: 327 Y-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSID 379

Query: 268 RAFFLELGGYDPGLLVWGGENFELSFK----IWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           + +F +LG YDPG  +WGGEN ELSFK    IWMCGG++E VPCS +GHV+R   PY + 
Sbjct: 380 KKYFEKLGTYDPGFDIWGGENLELSFKVRKCIWMCGGTLEIVPCSHVGHVFRKRSPYKW- 438

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
               R    ++  N  R+ E W D+ +K Y+Y R       D GD+S
Sbjct: 439 ----RTGVNVLKRNSIRLAEVWLDD-YKTYYYERIN-NQLGDFGDVS 479


>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Oryzias latipes]
          Length = 758

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 211/352 (59%), Gaps = 12/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G+ G+   LP +          E   N+  S+ I  DR IPD R E C       DLP  
Sbjct: 257 GQFGRGVILPSSEDEEVRKRWDEGHFNVYLSDRIPVDRAIPDTRPEVCSQAVVHDDLPST 316

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           SVI  F +E +S+L+R+VHS++ R+P   L+EIILVDDFS+K  L + L+ Y+ +F  KV
Sbjct: 317 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKEPLDKYMSQF-PKV 375

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
           R++R  ER+GLIR R  GA  + GEV+ FLD+H E  + WL PLL  IY DR+ +  PVI
Sbjct: 376 RIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPLLERIYLDRRKVPCPVI 435

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
           + I+ +   +  +   D+  RGIF+W +++  N L E   +K     S+P + P  AGGL
Sbjct: 436 EVINDKDMSYMLI---DNFQRGIFKWPLVFGWNALSEDYIRKHNITVSDPIRCPVMAGGL 492

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F++D+ +F ELG YDPGL VWGGEN E+SFKIWMCGG IE +PCSR+GH++R   PY+F 
Sbjct: 493 FSIDKKYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQNPYSFP 552

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYF---YTREPLAMFLDMGDISEQ 372
           K  DR K   +  N  RV E W DE    ++   Y         D+G+++EQ
Sbjct: 553 K--DRQKT--VERNLARVAEVWLDEYKDLFYGHGYQHLLDKSVTDIGNLTEQ 600


>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Bombus terrestris]
          Length = 643

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 156/373 (41%), Positives = 221/373 (59%), Gaps = 22/373 (5%)

Query: 6   ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 61
           +D  L  L+P P++P     +G  E G   +  +  +   D     Y  N+  S++I   
Sbjct: 96  SDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNILVSDNIGLH 153

Query: 62  RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
           R IPD R + C+   Y   LP AS+++ F+NE + +L+R++HSII RTPA  L EIILV+
Sbjct: 154 REIPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVN 213

Query: 122 DFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           D+S    L +K++ YI   FNGKV+  +  +REGLIR R  GA+++ GEV++FLD+H EV
Sbjct: 214 DWSDSKALHEKIKTYIVNNFNGKVKFYKTEKREGLIRARMFGARKATGEVLIFLDSHIEV 273

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
              W+ PLL+ I   + I+ +P+ID I+  T++    Y      RG F WG+ +K + +P
Sbjct: 274 NKRWIEPLLSQIAQSKTIVAMPIIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVP 329

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
                  +   +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+IWMCGG
Sbjct: 330 VGTFAHDEDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGG 389

Query: 301 SIEWVPCSRIGHVYRSFMPY-NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           SIE +PCSR+GHV+R   PY  F +    +K  L      RV   W DE +K YF     
Sbjct: 390 SIELIPCSRVGHVFRRRRPYGTFDQHDTMLKNSL------RVAHVWLDE-YKDYFLKN-- 440

Query: 360 LAMFLDMGDISEQ 372
               +D GDISE+
Sbjct: 441 -VQKVDYGDISER 452


>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
           floridanus]
          Length = 667

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 155/368 (42%), Positives = 221/368 (60%), Gaps = 20/368 (5%)

Query: 10  LGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           L  L+P P++P     +G  E G   ++ +  +         Y  N+  S+++   R +P
Sbjct: 124 LKQLQPAPVKPAVTLDQGLDELGMVKNMEDQQKRT--IGYKNYAFNVLISDNLGVRRNVP 181

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R + CK   Y  +LP AS+I+ F+NE +++L+R++HSI++RTPA  L EIILV+DFS 
Sbjct: 182 DTRHKLCKTQKYSSNLPNASIIICFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSD 241

Query: 126 KADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
              L +K+  YI+  F  KVRL +  +REGLIR R  GA+++ G+V++FLD+H EV   W
Sbjct: 242 SDILHEKIHAYIKNNFGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIW 301

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 244
           + PLL+ I   + I+ +PVID I+  T++    Y      RG F WG+ +K + LP    
Sbjct: 302 IEPLLSRIAYSKTIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTL 357

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           K      +P KSPT AGGLFA+DR +F+++G YD G+ VWGGEN E+SF+IWMCGGSIE 
Sbjct: 358 KHENDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISFRIWMCGGSIEL 417

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           +PCSR+GHV+R   PY      D      +  N  RV   W DE +K YF      A  +
Sbjct: 418 IPCSRVGHVFRRRRPYGSDDPHDT-----MLKNSLRVAHVWMDE-YKDYFLKN---AKAI 468

Query: 365 DMGDISEQ 372
           D GDISE+
Sbjct: 469 DYGDISER 476


>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
           aries]
          Length = 582

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 208/352 (59%), Gaps = 18/352 (5%)

Query: 25  GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
           GE GKA    L E+     +  +  Y +N+  S+ IS  R I D RM ECK   +    L
Sbjct: 79  GEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRL 138

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +LE Y+   +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLEAYVSNLD 198

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I+ D  ++  
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIHKDETVVIC 257

Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
           PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK   EP++SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIEPFRSPTMA 314

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +GHV+    PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                      P    N  R  E W DE +K +FY R P A     GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISER 416


>gi|348519900|ref|XP_003447467.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Oreochromis niloticus]
          Length = 777

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 199/306 (65%), Gaps = 10/306 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N+  S+ I  DR IPD R + C+      DLP  SVI  F +E +S+L+R+VHS++ R+P
Sbjct: 304 NVYLSDKIPVDRAIPDTRPQMCEQSLVHDDLPSTSVIFCFVDEVWSTLLRSVHSVLNRSP 363

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              L+EIILVDDFS+K  L ++L+DY+ +F  KVR++R  ER+GLIR R  GA  ++GEV
Sbjct: 364 PHLLKEIILVDDFSTKDYLKKQLDDYMAQF-PKVRIVRLKERQGLIRARLAGAAVAKGEV 422

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+H E  + WL PLL  +Y DRK +  PVI+ I  +   +  V   D+  RGIF+W
Sbjct: 423 LTFLDSHIECNVGWLEPLLERVYLDRKKVPCPVIEVISDKDMSYMMV---DNFQRGIFKW 479

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++  + +P  + KK     S+P + P  AGGLF++D+ +F ELG YDPGL VWGGEN 
Sbjct: 480 PLVFGWSAVPPEDIKKFNLTISDPIRCPVMAGGLFSIDKQYFFELGTYDPGLDVWGGENM 539

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SFKIWMCGG IE +PCSR+GH++R   PY F K  DR K   +  N  RV E W DE 
Sbjct: 540 EISFKIWMCGGEIEIIPCSRVGHIFRGQNPYKFPK--DRQK--TVERNLARVAEVWLDE- 594

Query: 350 HKAYFY 355
           +K  FY
Sbjct: 595 YKDLFY 600


>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 623

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 208/336 (61%), Gaps = 11/336 (3%)

Query: 38  RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
           R   D     +  N+  S+ I   R +PD R   CK   Y  +LP ASVI+ F+NE F++
Sbjct: 114 RRKRDEGYKNFAFNILVSDAIGIHRELPDTRHNLCKKKKYSKNLPTASVIICFYNEHFTT 173

Query: 98  LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLI 156
           L+R+++S+++RTP+  L+EIILV+DFS  A L + + +Y+   F  KV+L ++ +R GLI
Sbjct: 174 LLRSIYSVLERTPSYLLKEIILVNDFSDLAGLHRNISNYVNTNFTDKVKLFKSKKRLGLI 233

Query: 157 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 216
           R R  G++++ G+V+VFLD+H EV +NWL PLL+ I   +K + VP+ID I+  T++   
Sbjct: 234 RARIFGSRKASGDVLVFLDSHIEVNVNWLQPLLSRIVDSKKNVVVPIIDIINADTFK--- 290

Query: 217 VYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGG 276
            Y      RG F WG+ +K   LP+   K  +   +P  SPT AGGLFA++RA+F ELG 
Sbjct: 291 -YSSSPLVRGGFNWGLHFKWENLPKSTLKSNEDFVKPILSPTMAGGLFAINRAYFKELGE 349

Query: 277 YDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY 336
           YD G+ +WGGEN E+SF+IWMCGG++E +PCSR+GHV+R   PY      D      +  
Sbjct: 350 YDNGMNIWGGENLEISFRIWMCGGNLELIPCSRVGHVFRKRRPYGSPNGEDT-----MMR 404

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N  RV   W D+ +K +FY + P       GDIS++
Sbjct: 405 NSLRVANVWMDD-YKEFFYKQHPEGKTFPFGDISDR 439


>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis florea]
          Length = 605

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 145/350 (41%), Positives = 209/350 (59%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 103 PGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYSKYLPD 162

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LE Y++R    
Sbjct: 163 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEHYVKRLPVP 222

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 223 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 282

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 283 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 339

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 340 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 399

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                +V    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 400 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 444


>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
          Length = 594

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 139/325 (42%), Positives = 201/325 (61%), Gaps = 10/325 (3%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE GK   +P + +        E   N+  S+ I  +R++ D+R  +CK   YP  LP 
Sbjct: 92  PGEMGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 151

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RT+ S+I R+P   L+EIILVDD S +  L ++LEDY++     
Sbjct: 152 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEDYVKTLPVS 211

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             ++R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLLA I  DRK +  P+
Sbjct: 212 TIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 271

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I  +T+E+  V   D  + G F W + ++   +P RE ++R ++ + P ++PT AGG
Sbjct: 272 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGG 328

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E  PCS +GHV+R   PY F
Sbjct: 329 LFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTF 388

Query: 323 -GKLADRVKGPLITYNYKRVIETWF 346
            G +A+     ++  N  RV E W 
Sbjct: 389 PGGVAN-----IVLKNAARVAEVWM 408


>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 637

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 141/351 (40%), Positives = 215/351 (61%), Gaps = 10/351 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGEGG++  +PE  +        E   N+  S+ ++ +R+I D R   C+  ++P DLP 
Sbjct: 133 PGEGGRSVSIPENLKQEAKKRFPENQFNIVASDLMALNRSINDQRSSRCRSHEFPSDLPT 192

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD-LDQKLEDYIQRFNG 142
            S+++VFHNEG S+L+RT+ SI+ R+P ++++EII+VDD S   + L   LE +++    
Sbjct: 193 TSIVIVFHNEGNSTLLRTLTSIVMRSPTEFIQEIIMVDDASVDREYLKDILETFVKELPV 252

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           +V +IRNT+R GL+++R +GA+++ G+ + FLDAH E    WL  LL  +  DR  +  P
Sbjct: 253 RVEIIRNTQRLGLMKSRLKGAEKATGDTLTFLDAHIECSPGWLEYLLYEVKKDRTAVVCP 312

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 261
           +ID I+    +F  +   D  + G F W + ++   +P RE  +R Y+ S P  SPT AG
Sbjct: 313 IIDVINDD--DFAYLTGSDMTWGG-FNWRLNFRWYPVPNREEVRRNYDHSLPLLSPTMAG 369

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF +DR +F E+G YDPG+ VWGGEN E+SF++W CGG +   PCS +GHV+R   PY 
Sbjct: 370 GLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRVWQCGGKVLIHPCSHVGHVFRKQTPYT 429

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           F        G +I +N KR++E W D K+K + Y   P    +D GD+SE+
Sbjct: 430 FPGGT----GKVIFHNNKRLVEVWLD-KYKDFVYAIMPELKNVDAGDVSER 475


>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
           grunniens mutus]
          Length = 566

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 155/363 (42%), Positives = 212/363 (58%), Gaps = 20/363 (5%)

Query: 14  EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +PP + +    GE GKA    L E+     +  +  Y +N+  S+ IS  R I D RM E
Sbjct: 54  KPPADSH--ALGEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 111

Query: 72  CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           CK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L 
Sbjct: 112 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLK 171

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            +LE Y+   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL 
Sbjct: 172 TQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLE 230

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
            I  D  ++  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK 
Sbjct: 231 RIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 287

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
             EP++SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS 
Sbjct: 288 RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 347

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+    PY           P    N  R  E W DE +K +FY R P A     GDI
Sbjct: 348 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDI 397

Query: 370 SEQ 372
           SE+
Sbjct: 398 SER 400


>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Metaseiulus occidentalis]
          Length = 585

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 200/326 (61%), Gaps = 12/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           ++  N   S  I   R +PD R   CK   Y  DLP+ASVI+ F+NE +S+L+RTV+S++
Sbjct: 94  QHAFNTLVSERIGLRRRVPDTRDALCKQQKYSKDLPRASVIICFYNEAWSTLIRTVNSVL 153

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            R+P+  L+EIILVDD S  A+L + L  ++Q+ + KVR+IR  EREGLIR R  GA  S
Sbjct: 154 DRSPSALLQEIILVDDLSDIAEL-EPLAGFVQK-HEKVRVIRTREREGLIRARMIGAHNS 211

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+V+VFLD+H EV   WL PLL PI  ++  +T PVID I+  T+E    Y P    +G
Sbjct: 212 TGDVLVFLDSHVEVNERWLQPLLVPIQQNQTTVTCPVIDIINADTFE----YSPSPLVKG 267

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F WGM ++ + LP+   K  K    P  SPT AGGLFA+ +  F  LG YD G+ VWGG
Sbjct: 268 GFNWGMHFRWDNLPKGYFKSEKERIAPLPSPTMAGGLFAIHKDEFRRLGEYDWGMDVWGG 327

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELSF+IWMCGGS++ +PCSR+GHV+R   PY      D      +  N  RV   W 
Sbjct: 328 ENLELSFRIWMCGGSLKIMPCSRVGHVFRKRRPYGASNGED-----TLAKNSLRVANVWM 382

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           D+ +K Y+Y   P    +D GDIS +
Sbjct: 383 DD-YKKYYYRMRPDLKDIDFGDISAR 407


>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
 gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
 gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
          Length = 578

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 155/363 (42%), Positives = 212/363 (58%), Gaps = 20/363 (5%)

Query: 14  EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +PP + +    GE GKA    L E+     +  +  Y +N+  S+ IS  R I D RM E
Sbjct: 66  KPPADSH--ALGEWGKASKLQLSESELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 123

Query: 72  CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           CK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L 
Sbjct: 124 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLK 183

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            +LE Y+   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL 
Sbjct: 184 TQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLE 242

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
            I  D  ++  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK 
Sbjct: 243 RIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 299

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
             EP++SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS 
Sbjct: 300 RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 359

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+    PY           P    N  R  E W DE +K +FY R P A     GDI
Sbjct: 360 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDI 409

Query: 370 SEQ 372
           SE+
Sbjct: 410 SER 412


>gi|115497708|ref|NP_001069909.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Bos taurus]
 gi|83405338|gb|AAI11261.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Bos taurus]
 gi|440895696|gb|ELR47826.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Bos grunniens mutus]
          Length = 448

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 203/342 (59%), Gaps = 16/342 (4%)

Query: 31  YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
           Y +PE            YG N   S ++   R +PD R   C+   YP  LP AS+I+ F
Sbjct: 93  YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147

Query: 91  HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
           HNE F++L RT+ SI+  T    LEEIILVDD S   DL +KL+ +++ F GK++LIRN 
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207

Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
           +REGLIR R  GA  + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID IDY 
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267

Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAF 270
           T E    Y+P    RG F W + +K + +   E +  +  + P +SP  AGG+FA++R +
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHY 323

Query: 271 FLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVK 330
           F E+G YD G+ +WGGEN ELS +IWMCGG +  +PCSR+GH+ R  +   F  +     
Sbjct: 324 FNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPCSRVGHINRQHVTNRFEIMK---- 379

Query: 331 GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             ++ YN  R++ TW DE +K  F+ R P       G+ISE+
Sbjct: 380 --VVEYNNLRLVHTWLDE-YKGQFFLRRPALKSAAYGNISER 418


>gi|296488205|tpg|DAA30318.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 5 [Bos
           taurus]
          Length = 447

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 203/342 (59%), Gaps = 16/342 (4%)

Query: 31  YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
           Y +PE            YG N   S ++   R +PD R   C+   YP  LP AS+I+ F
Sbjct: 93  YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147

Query: 91  HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
           HNE F++L RT+ SI+  T    LEEIILVDD S   DL +KL+ +++ F GK++LIRN 
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207

Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
           +REGLIR R  GA  + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID IDY 
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267

Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAF 270
           T E    Y+P    RG F W + +K + +   E +  +  + P +SP  AGG+FA++R +
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAINRHY 323

Query: 271 FLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVK 330
           F E+G YD G+ +WGGEN ELS +IWMCGG +  +PCSR+GH+ R  +   F  +     
Sbjct: 324 FNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPCSRVGHINRQHVTNRFEIMK---- 379

Query: 331 GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             ++ YN  R++ TW DE +K  F+ R P       G+ISE+
Sbjct: 380 --VVEYNNLRLVHTWLDE-YKGQFFLRRPALKSAAYGNISER 418


>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 [Ovis
           aries]
          Length = 448

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 145/325 (44%), Positives = 197/325 (60%), Gaps = 11/325 (3%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           YG N   S ++   R++PD R   C+   YP  LP AS+I+ FHNE FS+L RT+ SI+ 
Sbjct: 105 YGFNHIISKNLGHYRSVPDTRNVMCRKKTYPARLPTASIIICFHNEEFSALFRTLSSIMA 164

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
            TP   LEEIILVDD S   DL +KL+ +++ F GK++LIRN +REGLIR R  GA  + 
Sbjct: 165 LTPQYILEEIILVDDTSDFDDLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHAS 224

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID IDY T E    Y+P    RG 
Sbjct: 225 GDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGA 280

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F W + +K + +   E +  +  + P +SP  AGG+FA+ R +F E+G YD G+ +WGGE
Sbjct: 281 FNWHLEFKWDHVLSYEIEGPEGPTTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGE 340

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N ELS +IWMCGG +  +PCSR+GH+ R  M        D     ++ YN  R+   W D
Sbjct: 341 NLELSLRIWMCGGQLYVIPCSRVGHINRQHMT------NDSEIMKVVEYNSLRLAHIWLD 394

Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
           E +K  F+ R P       G+ISE+
Sbjct: 395 E-YKEEFFLRRPALKSAAYGNISER 418


>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Megachile rotundata]
          Length = 637

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 146/327 (44%), Positives = 209/327 (63%), Gaps = 17/327 (5%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N+  S++I  DR +PD R + C+   YP  LP AS+++ F+NE + +L+R++HSII+
Sbjct: 135 YAFNVLISDNIGLDRKLPDTRHKLCQMQQYPNKLPNASIVICFYNEHYMTLLRSIHSIIE 194

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKES 166
           RTP   L EIILV+D+S   +L +K++ +I   F+ KV+  +  +REGLIR R  GA+++
Sbjct: 195 RTPKHLLHEIILVNDWSDSKELHEKIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKA 254

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            GEV++FLD+H EV   W+ PLL+ I   + I+ +PVID I+  T++    Y      RG
Sbjct: 255 TGEVLIFLDSHIEVNKMWIEPLLSRIAHSKTIVAMPVIDIINADTFQ----YTASPLVRG 310

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F WG+ +K  +LP +      +  +P KSPT AGGLFAMDR +F+ELG YD G+ VWGG
Sbjct: 311 GFNWGLHFKWEQLPTKLVHDEDF-IKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGG 369

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN E+SF+IWMCGGSIE +PCSR+GHV+R   PY     AD  K   +  N  RV   W 
Sbjct: 370 ENLEISFRIWMCGGSIELIPCSRVGHVFRKRRPYG----ADD-KHDTMLKNSLRVAYVWL 424

Query: 347 DE-KHKAYFYTREPLAMFLDMGDISEQ 372
           DE KH   +Y ++     +D GDI+++
Sbjct: 425 DEYKH---YYLKD--VNKIDYGDITDR 446


>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus terrestris]
 gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus terrestris]
          Length = 606

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 145/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++     
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                +V    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARSVAVGDVSER 445


>gi|195030214|ref|XP_001987963.1| GH10909 [Drosophila grimshawi]
 gi|193903963|gb|EDW02830.1| GH10909 [Drosophila grimshawi]
          Length = 668

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 160/361 (44%), Positives = 216/361 (59%), Gaps = 29/361 (8%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           G GE G    + +A  A  +    EY   G N   S+ IS +R++PD+R EECK   Y  
Sbjct: 145 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 202

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 138
            LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S    L Q+L+DY+ Q
Sbjct: 203 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 262

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
            +   V ++ + ER+GLI  R  GAK S GE IVF D+H EV  NWLPPLL PI  + KI
Sbjct: 263 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAIVFFDSHIEVNYNWLPPLLEPIAINNKI 322

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSP 257
            T P++D ID+  + +   Y+     RG F+W   YK+   LPE    K    S PY++P
Sbjct: 323 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK----SLPYRNP 376

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
              GGLFA+   FF +LGGYD GL +WGGE +ELSFKIWMCGG +  VPCSR+ H++R  
Sbjct: 377 VMMGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQ 436

Query: 318 M-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISE 371
           M     P N+  LA          N+KRV E W DE +K + Y R+      +D GD++ 
Sbjct: 437 MDPRPNPLNYNFLA---------RNHKRVAEVWMDE-YKEHVYRRDRTTYDKIDAGDLTR 486

Query: 372 Q 372
           Q
Sbjct: 487 Q 487


>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 639

 Score =  277 bits (709), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 150/332 (45%), Positives = 204/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  S+ I   R +PD R + C    Y   LP AS+I+ F+NE   +L+R+
Sbjct: 134 DVGYRKHAFNVLVSSKIGPFREVPDTRHKLCPEQSYDKVLPSASIIMCFYNEHLQTLLRS 193

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG-KVRLIRNTEREGLIRTRS 160
           V+S++ RTPA  L EIILVDD S   DL   LE  +++FN  K+RLIRN +REGL+R+R 
Sbjct: 194 VNSVLGRTPAYLLHEIILVDDCSDFDDLGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRV 253

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA+ + G+V+VFLD+H EV ++W+ PLL  I  +R I+ +PVID I+  T+     Y  
Sbjct: 254 YGARNATGDVLVFLDSHIEVNVDWIEPLLQRIKVNRTILAMPVIDIINSDTF----AYTS 309

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + LP+    K      P++SPT AGGLFAMDR +F ELG YD G
Sbjct: 310 SPLVRGGFNWGLHFKWDNLPKGSLAKETDFVGPFQSPTMAGGLFAMDRKYFKELGEYDMG 369

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + VWGGEN E+SF+ W CGGSIE +PCSRIGHV+R   PY      D      +  N  R
Sbjct: 370 MDVWGGENLEISFRAWQCGGSIELLPCSRIGHVFRKRRPYGSPDGTD-----TMIRNSLR 424

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W D+  K YF+  +P A  LD GD+SE+
Sbjct: 425 LARVWMDDYIK-YFFENQPHANKLDAGDLSER 455


>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis mellifera]
          Length = 606

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 144/350 (41%), Positives = 209/350 (59%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R++ CK   Y   LP 
Sbjct: 104 PGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKTKKYSKYLPD 163

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++     
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 341 LFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                +V    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 445


>gi|170060398|ref|XP_001865784.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
 gi|167878898|gb|EDS42281.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
          Length = 356

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 125/147 (85%), Positives = 138/147 (93%)

Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
           GIFEWGMLYKENE+P REAK+RK++SEPYKSPTHAGGLFA++R FFL++G YDPGLLVWG
Sbjct: 51  GIFEWGMLYKENEVPRREAKRRKHDSEPYKSPTHAGGLFAINREFFLKIGAYDPGLLVWG 110

Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
           GENFELSFKIW CGGSIEWVPCSR+GHVYR FMPYNFGKLA++ KGPLIT NYKRVIETW
Sbjct: 111 GENFELSFKIWQCGGSIEWVPCSRVGHVYRGFMPYNFGKLANKKKGPLITINYKRVIETW 170

Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
           FDE++K YFYTREPLA FLDMGDISEQ
Sbjct: 171 FDEQYKEYFYTREPLARFLDMGDISEQ 197


>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
 gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
 gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
          Length = 562

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P + + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   YP +LP  SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS +  L 
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              L+  ++     +++IR+ ER GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
            +GHV+R   P++F     +  G ++  N  RV E W D+  K YFY   P A  +    
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438

Query: 368 DISEQ 372
           D+SE+
Sbjct: 439 DVSER 443


>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus impatiens]
          Length = 606

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 145/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++     
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                +V    + +N  RV E W DE  + ++Y   P A  + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVAVGDVSER 445


>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
           floridanus]
          Length = 608

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 143/353 (40%), Positives = 211/353 (59%), Gaps = 9/353 (2%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           K  PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   YP  
Sbjct: 103 KGSPGEMGAAVHIAPENEAKQQELFKLNQFNLMASDLISLNRSLKDIRLEGCKNKKYPKY 162

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++LE +I   
Sbjct: 163 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKKELEKHITEL 222

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
                + R  +R GLIR R  GAK  +G+VI FLDAHCE    WL PLL+ I +DR  + 
Sbjct: 223 PVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 282

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            P+ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT 
Sbjct: 283 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTM 339

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E   CS +GHV+R   P
Sbjct: 340 AGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISSCSHVGHVFRDKSP 399

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y F     ++    + +N  RV E W DE  + ++Y   P A  +D+GD+SE+
Sbjct: 400 YTFPGGVSKI----VLHNAARVAEVWMDE-WRDFYYAMNPGARNVDVGDVSER 447


>gi|195030212|ref|XP_001987962.1| GH10908 [Drosophila grimshawi]
 gi|193903962|gb|EDW02829.1| GH10908 [Drosophila grimshawi]
          Length = 684

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 216/361 (59%), Gaps = 29/361 (8%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           G GE G    + +A  A  +    EY   G N   S+ IS +R++PD+R EECK   Y  
Sbjct: 161 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 218

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 138
            LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S    L Q+L+DY+ Q
Sbjct: 219 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 278

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
            +   V ++ + ER+GLI  R  GAK S GE +VF D+H EV  NWLPPLL PI  + KI
Sbjct: 279 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAMVFFDSHIEVNYNWLPPLLEPIAINNKI 338

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSP 257
            T P++D ID+  + +   Y+     RG F+W   YK+   LPE    K    S PY++P
Sbjct: 339 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK----SLPYRNP 392

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
              GGLFA+   FF +LGGYD GL +WGGE +ELSFKIWMCGG +  VPCSR+ H++R  
Sbjct: 393 VMIGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQ 452

Query: 318 M-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISE 371
           M     P N+  LA          N+KRV E W DE +K + Y R+      +D GD++ 
Sbjct: 453 MDPRPNPLNYNFLA---------RNHKRVAEVWMDE-YKEHVYRRDRTTYDNIDAGDLTR 502

Query: 372 Q 372
           Q
Sbjct: 503 Q 503


>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 460

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 195/302 (64%), Gaps = 9/302 (2%)

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           CK   YP  LP  SV++VFHNE +S+L+RTVHS+I+ +P   LEEIILVDD S +  L +
Sbjct: 7   CKDKVYPEKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGK 66

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LEDY+ + +  V+++R  +R GLIR R  GA   +G+VI FLDAHCE   NWL PLLA 
Sbjct: 67  QLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLAR 126

Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 250
           I  DR  +  PVID I  +T+E+ S  +      G F W + ++   +P+RE  +R  + 
Sbjct: 127 IAEDRTRVVCPVIDVISDETFEYISASDLTW---GGFNWKLNFRWYRVPQRELDRRGGDR 183

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           + P ++PT AGGLFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPCS +
Sbjct: 184 TLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHV 243

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+R   PY F     ++    + +N  R+ E W DE  K +++   P A  +D GD+S
Sbjct: 244 GHVFRKSTPYTFPGGTSKI----VNHNNARLAEVWLDE-WKEFYFAINPAAKNVDKGDLS 298

Query: 371 EQ 372
            +
Sbjct: 299 HR 300


>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Metaseiulus occidentalis]
          Length = 658

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 149/351 (42%), Positives = 206/351 (58%), Gaps = 14/351 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
           G+ G A  L    +   D    +   N+  S+ +  +R++PD R   C+   YP+ ++P 
Sbjct: 144 GKDGHAVILGRDEQLEADREFSKAAFNVYVSDRLPLNRSLPDTRHRHCRAITYPVAEMPT 203

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNG 142
           ASV+++F +E FS+L+RT+ S+I R+P   L EIILVDDFS   DL  +LE YI+  F  
Sbjct: 204 ASVVIIFTDEIFSTLLRTIVSVIDRSPRHLLREIILVDDFSQSEDLKDRLERYIEHHFRA 263

Query: 143 KV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            V RLIR  ER GLIR R  GA+ +RG+V++FLD+HCE    WL PLL PI  DR+ +  
Sbjct: 264 DVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTPGWLEPLLEPIRRDRRAVVC 323

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           PVID IDY+T ++ +  E D    G F W   +  + +P    + R   +EP +SPT AG
Sbjct: 324 PVIDVIDYRTLQYVAA-EGDRFQIGGFNWRGEFTWHNIPSAWRRNRVSVAEPMRSPTMAG 382

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA++R +F E G YD  +  WGGEN E+SF+IW CGG I   PCS +GH++R + PY 
Sbjct: 383 GLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIAPCSHVGHIFRDYQPYK 442

Query: 322 F--GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
              GK  + +       N KR +E W DE  K Y Y   P    + +GDIS
Sbjct: 443 IPGGKDTNAI-------NTKRAVEVWMDE-FKKYIYQARPELKKIRIGDIS 485


>gi|347971870|ref|XP_313714.5| AGAP004429-PA [Anopheles gambiae str. PEST]
 gi|333469065|gb|EAA09257.5| AGAP004429-PA [Anopheles gambiae str. PEST]
          Length = 663

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 148/334 (44%), Positives = 208/334 (62%), Gaps = 13/334 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN +   R IPD R + C+   Y   LP ASV++ F+NE   +L+R+
Sbjct: 154 DIGYRKHAFNVLVSNKLGPFRPIPDTRHKLCQAQVYDKVLPVASVVMCFYNEHLETLVRS 213

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLD---QKLEDYIQRFNGKVRLIRNTEREGLIRT 158
           +H+++KRTPA  L+E+ILVDD S   DL    Q  ++  Q    KVRL+RNT+REGLIR+
Sbjct: 214 IHTVLKRTPAYLLKELILVDDCSDFEDLTVGGQLEKELAQLGTNKVRLLRNTDREGLIRS 273

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R  GA+ + G+V++FLD+H EV ++W+ PLLA I  DR I+ +PVID I+  T+    VY
Sbjct: 274 RVYGARNATGQVLIFLDSHIEVNVDWIEPLLARIKHDRTILAMPVIDIINSDTF----VY 329

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
                 RG F WG+ +K + LP+   ++      P+ SPT AGGLFA+DRA+F ELG YD
Sbjct: 330 TASPLVRGGFNWGLHFKWDNLPKGSLERDTDFVGPFNSPTMAGGLFAIDRAYFKELGEYD 389

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNY 338
            G+ VWGGEN E+SF+ W CGGSIE +PCSRIGHV+R   PY      D      +  N 
Sbjct: 390 MGMDVWGGENLEISFRAWQCGGSIELLPCSRIGHVFRKRRPYGSPDGQD-----TMIRNS 444

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            R+   W D+ +  YFY ++P A  +  G++SE+
Sbjct: 445 LRLAHVWMDD-YIRYFYEQQPQAHHVPYGNVSER 477


>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
 gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 6; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
 gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
 gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
          Length = 618

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P + + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   YP +LP  SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS +  L 
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              L+  ++     +++IR+ ER GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
            +GHV+R   P++F     +  G ++  N  RV E W D+  K YFY   P A  +    
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438

Query: 368 DISEQ 372
           D+SE+
Sbjct: 439 DVSER 443


>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
 gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
          Length = 675

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 205/336 (61%), Gaps = 10/336 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R + C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKDKFKENQFNLLASDMISLNRSLTDVRHDGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   YP  LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LEDY+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 238 LEDYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           HV+R   PY F G +A      ++ +N  RV E W 
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 445


>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Nasonia vitripennis]
          Length = 587

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 146/335 (43%), Positives = 201/335 (60%), Gaps = 7/335 (2%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           GE G+  +L    +  G+  L +  +N+  SN I   R +PD+R   CK   Y   LP A
Sbjct: 75  GEYGRPAYLSGEEKIKGNEVLKKKAVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSA 134

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
           S+I++FHNE FS L+RTV+S+IK TP + L+EIILVDD  S  +L   LE YIQ R   K
Sbjct: 135 SIIIIFHNEAFSVLLRTVYSVIKETPPKLLKEIILVDD-KSNEELLGLLEYYIQTRLPKK 193

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+L+R  ER+GL+R R +GAK + G+V++FLDAHCEV   WL PLL  I   +  +  P+
Sbjct: 194 VKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAVVTPI 253

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID I  +T+E+    EP     G F W   +    + E + K +     P KSPT AGGL
Sbjct: 254 IDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTSAISPVKSPTMAGGL 313

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           FA++R +F ++G YD  +  WGGEN E+SF+IW CGG +E +PCSR+GHV+R+F+PY F 
Sbjct: 314 FAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRVGHVFRNFLPYKFP 373

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
              D   G     N  R+   W D+  + Y+  RE
Sbjct: 374 MDKD-THG----INTARLANVWMDDYKRLYYLHRE 403


>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
 gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
 gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
          Length = 617

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 148/365 (40%), Positives = 221/365 (60%), Gaps = 13/365 (3%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
            NL  P + + EG   G    HL    +   D++      N+  S+ IS  R++P++R  
Sbjct: 89  ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C+   YP +LP  SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS +  L 
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205

Query: 131 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
              L+  ++     +++IR+ ER GLIR R  GA+E++G+V+ FLD+HCE    WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
             I  +RK +  PVID I+  T++++   E    +RG F W + ++   +P   AK+   
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLL 322

Query: 250 N-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
           + + P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS
Sbjct: 323 DPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG- 367
            +GHV+R   P++F     +  G ++  N  RV E W D+  K YFY   P A  +    
Sbjct: 383 HVGHVFRKSSPHDF---PGKSSGKVLNTNLLRVAEVWMDD-WKHYFYKIAPQAHRMRSSI 438

Query: 368 DISEQ 372
           D+SE+
Sbjct: 439 DVSER 443


>gi|332021082|gb|EGI61469.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Acromyrmex
           echinatior]
          Length = 580

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 150/358 (41%), Positives = 215/358 (60%), Gaps = 16/358 (4%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P    ++G  E G   +L +  +   D    +Y  N+  S+++   R IPD R + CK  
Sbjct: 47  PAVTLEQGLDELGMVKNLEDQRKR--DEGYKDYAFNILISDNLGVQRNIPDTRHKLCKMQ 104

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
            YP +LP AS+I+ F+NE +++L+R++HSI+++TP   L EIILV+D+S    L + ++ 
Sbjct: 105 KYPANLPNASIIICFYNEHYTTLLRSLHSILEKTPTVLLHEIILVNDYSDSDTLHENIKV 164

Query: 136 YIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           YI+  FN +VRL +   REGLIR R  GA+++ G+V++FLD+H EV   W+ PLL+ I  
Sbjct: 165 YIRNNFNDRVRLFKTERREGLIRARVFGARKATGKVLIFLDSHIEVNEIWIEPLLSRIAY 224

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
            R I+ +PVID I+  T++    Y      RG F WG+ +K + LP           +P 
Sbjct: 225 SRNIIPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLNHDVDFVKPI 280

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           KSPT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+
Sbjct: 281 KSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISFRIWMCGGSIELIPCSRVGHVF 340

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R   PY      D      +  N  RV   W DE +K YF      A  +D GDISE+
Sbjct: 341 RRRRPYGSDDPQD-----TMLKNSLRVAHVWMDE-YKDYFLKN---AKTIDYGDISER 389


>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus impatiens]
          Length = 606

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 144/350 (41%), Positives = 208/350 (59%), Gaps = 9/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++     
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF+IWMCGG++E   CS +GHV+R   PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRIWMCGGTLEIATCSHVGHVFRKSTPYTF 400

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                ++    + +N  R+ E W D+  K ++Y   P A  + +GD+SE+
Sbjct: 401 PGGTSKI----VNHNNARLAEVWLDQ-WKYFYYNINPGARNVAVGDVSER 445


>gi|393910975|gb|EJD76111.1| glycosyl transferase, variant [Loa loa]
          Length = 549

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 149/356 (41%), Positives = 214/356 (60%), Gaps = 16/356 (4%)

Query: 21  KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           ++G GEGG+ A    E ++   D      G N   S+ I+ +R+I D+R   C+   Y  
Sbjct: 93  RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LP   V+   HNE  S+L+R+++S+I R+P   ++E+ILVDD S+K  L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212

Query: 140 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
              N  V+++R  +REGLIR R  GA+    +VIVFLDAH E   NWLPPL+ PI  D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID  T+E+R+    D   RG F+W   YK   L E     +K  + P+ +P
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPFHNP 326

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS- 316
             AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++   PCSR+GH+YR  
Sbjct: 327 VMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIYRCK 386

Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           ++P+      D   G  I+ NY+RV E W DE  K + Y R P  + +D GD+S+Q
Sbjct: 387 YVPF-----PDPGIGDFISKNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSKQ 436


>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
 gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
          Length = 353

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 135/303 (44%), Positives = 191/303 (63%), Gaps = 12/303 (3%)

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           +C    YP  LP  +V++ FHNE +S+L+RTVHS+I R+PA  L EI+L+DDFS+   L 
Sbjct: 25  KCSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLK 84

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            KL  Y+ +    VR++R ++REGLIR R  GA+ ++G+VI FLDAHCE  ++WL PLL+
Sbjct: 85  SKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLS 143

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I+SDR I+ VPVID I    + +           G F W M +  + LP     +RK  
Sbjct: 144 RIHSDRTIVAVPVIDIISSTNFMYSGTPSA---VIGGFSWDMQFTWHSLPNNRQSERKDR 200

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           + P ++PT AGGLF++DR +F E G YD G+ VWGGEN E+SF+IW CGG +E +PCSR+
Sbjct: 201 TAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRV 260

Query: 311 GHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           GHV+R+  PY+F G  ++      ++ N  RV+  W DE +  Y Y + P    L  GDI
Sbjct: 261 GHVFRTRFPYSFPGGYSE------VSVNLARVVHVWMDE-YNQYVYMKRPDLQSLKYGDI 313

Query: 370 SEQ 372
           + +
Sbjct: 314 TSR 316


>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
 gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
          Length = 578

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 159/376 (42%), Positives = 215/376 (57%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP + +   PGE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLESYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P+ E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  RV E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARVAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
           partial [Papio anubis]
          Length = 444

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 142/300 (47%), Positives = 193/300 (64%), Gaps = 9/300 (3%)

Query: 72  CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           CK   YP +LP  SV++VFHNE +S+L+RTV+S+I R+P   L E+ILVDD S +  L  
Sbjct: 39  CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 98

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
            LE+Y++     V++IR  ER GLIR R RGA  S+G+VI FLDAHCE  L WL PLLA 
Sbjct: 99  TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 158

Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 250
           I  DRK +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + 
Sbjct: 159 IKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 215

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 216 TLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHV 275

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+R   PY F        G +I  N +R+ E W DE  K +FY   P  + +D GD+S
Sbjct: 276 GHVFRKATPYTFPGGT----GHVINKNNRRLAEVWMDE-FKDFFYIISPGVVKVDYGDVS 330


>gi|312065523|ref|XP_003135832.1| glycosyl transferase [Loa loa]
 gi|307769015|gb|EFO28249.1| glycosyl transferase [Loa loa]
          Length = 614

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 149/356 (41%), Positives = 214/356 (60%), Gaps = 16/356 (4%)

Query: 21  KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           ++G GEGG+ A    E ++   D      G N   S+ I+ +R+I D+R   C+   Y  
Sbjct: 93  RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LP   V+   HNE  S+L+R+++S+I R+P   ++E+ILVDD S+K  L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212

Query: 140 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
              N  V+++R  +REGLIR R  GA+    +VIVFLDAH E   NWLPPL+ PI  D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
            +  P+ID ID  T+E+R+    D   RG F+W   YK   L E     +K  + P+ +P
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DNKKNPTRPFHNP 326

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS- 316
             AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++   PCSR+GH+YR  
Sbjct: 327 VMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAPCSRVGHIYRCK 386

Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           ++P+      D   G  I+ NY+RV E W DE  K + Y R P  + +D GD+S+Q
Sbjct: 387 YVPF-----PDPGIGDFISKNYRRVAEVWMDEYAK-FLYKRRPPLLTVDFGDLSKQ 436


>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
 gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
          Length = 374

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 211/353 (59%), Gaps = 17/353 (4%)

Query: 24  PGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           PGE G+     +L    R   +    +Y  N   S  I   RTIPD R   CK  +Y + 
Sbjct: 1   PGELGQGVVLRNLSPQDRKQLEEGYKKYAFNEFASTKIPLTRTIPDGRHWLCKSKEYDVS 60

Query: 81  -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LP  SVI+ FHNE +S+LMRTVHS+++  P++ L E+I+VDD S    L  +L DY+  
Sbjct: 61  RLPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLKAQLTDYVAG 120

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
              KV+LIR  +REGLIR R  GA  +R +V+VFLD+HCE  + WL PLL  I  +R  +
Sbjct: 121 LP-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDRIVQNRSHV 179

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
             PVID ID++T+E+R +       RG F+W ++++  ++P    K+R  + +P  SPT 
Sbjct: 180 VTPVIDVIDFKTFEYRHL--AIIQVRG-FDWRLIFRWEKIPASYEKRRGLSVDPILSPTM 236

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+D+ +F  LG YD G+ +WGGEN ELSF+IW CGG++E +PCSR+GHV+R   P
Sbjct: 237 AGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSFRIWQCGGTLEIMPCSRVGHVFRQRFP 296

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y       +    + T N  RV E W D+ +K YFY    +      GD++E+
Sbjct: 297 Y-------QTSTEVTTRNLMRVAEVWMDQ-YKEYFYQIRHIKK-KSFGDVTER 340


>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Sus scrofa]
 gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Sus scrofa]
          Length = 582

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 208/362 (57%), Gaps = 18/362 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G    G    L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 70  KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 129

Query: 74  Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
              +DY   LP  SV++ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  
Sbjct: 130 SKKFDYR-RLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 188

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  
Sbjct: 189 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 247

Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK  
Sbjct: 248 IAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 304

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
            +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 364

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+    PY           P    N  R  E W DE +K +FY R P A     GDIS
Sbjct: 365 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 414

Query: 371 EQ 372
           E+
Sbjct: 415 ER 416


>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Oryctolagus cuniculus]
          Length = 579

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 156/363 (42%), Positives = 211/363 (58%), Gaps = 20/363 (5%)

Query: 14  EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +PP +   +  GE GKA    L E      +  +  Y +N+  S+ IS  R I D RM E
Sbjct: 67  KPPAD--SQALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 124

Query: 72  CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           CK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +A L 
Sbjct: 125 CKSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRAYLK 184

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL 
Sbjct: 185 TQLETYISNLD-RVRLIRTKKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLE 243

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
            I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK 
Sbjct: 244 RIERDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKS 300

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
             +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS 
Sbjct: 301 RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSH 360

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GHV+    PY           P    N  R  E W D+ +K +FY R P A   D GDI
Sbjct: 361 VGHVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKEDYGDI 410

Query: 370 SEQ 372
           SE+
Sbjct: 411 SER 413


>gi|195148068|ref|XP_002014996.1| GL18655 [Drosophila persimilis]
 gi|194106949|gb|EDW28992.1| GL18655 [Drosophila persimilis]
          Length = 646

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 148/332 (44%), Positives = 207/332 (62%), Gaps = 24/332 (7%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E+CK   Y   LP  SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q+L+DY+   F   V ++RN ER+GLI  R  GAK + 
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           GEV+VF D+H EV  NWLPPLL PI  + KI T P++D ID+  + +   Y+     RG 
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W   YK+   LPE    K    S+P+++P   GGLFA+   FF +LGGYD  L +WGG
Sbjct: 327 FDWRFFYKQLPVLPEDSVDK----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGG 382

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-----PYNFGKLADRVKGPLITYNYKRV 341
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     P N+           +  N+KRV
Sbjct: 383 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPMDPRPNPRNYN---------FVGRNHKRV 433

Query: 342 IETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            E W DE +K + Y+R+P     +D GD++ Q
Sbjct: 434 AEVWMDE-YKEHVYSRDPQTYNNIDAGDLTRQ 464


>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           2 [Sus scrofa]
          Length = 578

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 208/362 (57%), Gaps = 18/362 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G    G    L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 66  KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125

Query: 74  Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
              +DY   LP  SV++ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  
Sbjct: 126 SKKFDY-RRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 184

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  
Sbjct: 185 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243

Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK  
Sbjct: 244 IAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 300

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
            +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +
Sbjct: 301 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 360

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+    PY           P    N  R  E W DE +K +FY R P A     GDIS
Sbjct: 361 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 410

Query: 371 EQ 372
           E+
Sbjct: 411 ER 412


>gi|410953294|ref|XP_003983307.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 [Felis
           catus]
          Length = 443

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 143/326 (43%), Positives = 202/326 (61%), Gaps = 11/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +YG N   S  +  +R +PD R ++C    YP +LP ASV++ FHNE FS+L RT+ S++
Sbjct: 99  KYGFNTVLSKSLGSEREVPDTRNKKCFQKHYPANLPTASVVVCFHNEEFSALFRTMFSVV 158

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
             TP  +LEEIILVDD S   DL +KL+ +++ F GK++LIRN +REGLIR+R  GA  +
Sbjct: 159 NLTPRHFLEEIILVDDMSDSDDLKEKLDHHLEVFRGKIKLIRNKKREGLIRSRMIGASRA 218

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID  T E    Y P    RG
Sbjct: 219 SGDVLVFLDSHCEVNKVWLEPLLHAIAKDPKMVVCPLIDVIDSVTLE----YWPSPVVRG 274

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F W + +K + +   E    +  + P +SP  AGG+FA++R +F E+G YD G+ +WG 
Sbjct: 275 AFNWHLQFKWDNVFSYEMDGPEGPTLPIRSPAMAGGIFAINRHYFREIGQYDKGMNLWGA 334

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELS +IWMCGG +  +PCSR+GH+ +   P N  + A+      +TYN  R+   W 
Sbjct: 335 ENLELSLRIWMCGGQLFVLPCSRVGHISKQRFP-NQPEFAE-----AMTYNSLRLAHVWL 388

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE +K  F+ R P    +  G+ISE+
Sbjct: 389 DE-YKEQFFLRRPGLKSVAYGNISER 413


>gi|348585731|ref|XP_003478624.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Cavia porcellus]
          Length = 937

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 154/373 (41%), Positives = 218/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V K D  L   +P      + PG+ G+   +P            E   N+  S+ I  DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVIVPPGKEKEAQKRWKEGNFNVYLSDLIPVDR 473

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C        LP  S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNQLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPQHLIKEILLVDD 533

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L  KL+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 534 FSTKDYLKDKLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RG+F W M +    +P E
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPE 649

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK R   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 650 VVAKNRIKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE VPCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 710 IEIVPCSRVGHIFRNDNPYSFPK--DRLKT--VERNLVRVAEVWLDE-YKELFYGHGDHL 764

Query: 360 LAMFLDMGDISEQ 372
           +   LD G++++Q
Sbjct: 765 IDQRLDAGNLTQQ 777


>gi|296210176|ref|XP_002751862.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Callithrix jacchus]
          Length = 443

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 140/326 (42%), Positives = 204/326 (62%), Gaps = 11/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +YG N+  S  +  +R +PD R + C    YP+ LP AS+++ F+NE F++L RT+ S+ 
Sbjct: 99  KYGFNIIISRSLGIEREVPDTRNKMCLQKRYPVRLPTASIVICFYNEEFNALFRTMSSVW 158

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
             TP   LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA  +
Sbjct: 159 NLTPHHLLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 218

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+V+VFLD+HCEV   WL PLL  I  D K++  PVID IDY+T E    Y+P    RG
Sbjct: 219 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDYRTLE----YKPSPVVRG 274

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +K + +   E    +  ++P +SP  AGG+FA+ R +F E+G YD  +  WGG
Sbjct: 275 AFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGG 334

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELS +IWMCGG +  +PCSR+GH+ +       GK +  +    +T+NY R+   W 
Sbjct: 335 ENLELSLRIWMCGGQLFIIPCSRVGHISKK----QSGKPSTLINA--VTHNYLRLAHVWL 388

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE +K  F+ R+P   ++  G+ISE+
Sbjct: 389 DE-YKEQFFLRKPGLKYMTYGNISER 413


>gi|32698686|ref|NP_055383.1| polypeptide N-acetylgalactosaminyltransferase 5 [Homo sapiens]
 gi|51315940|sp|Q7Z7M9.1|GALT5_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           AltName: Full=Polypeptide GalNAc transferase 5;
           Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 5;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|30841528|gb|AAP34404.1| GalNAc-T5 [Homo sapiens]
 gi|119631854|gb|EAX11449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
           sapiens]
 gi|148745655|gb|AAI42677.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
           sapiens]
 gi|158257740|dbj|BAF84843.1| unnamed protein product [Homo sapiens]
          Length = 940

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 151/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS+I R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|195492881|ref|XP_002094181.1| GE20340 [Drosophila yakuba]
 gi|194180282|gb|EDW93893.1| GE20340 [Drosophila yakuba]
          Length = 666

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 161/352 (45%), Positives = 214/352 (60%), Gaps = 13/352 (3%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE GKA  L  E+ R        E G N   S+ IS +R++PD+R   C+  +Y   L
Sbjct: 142 GLGEKGKAATLDDESQRDLEKQKSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVAKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P ID ID+  + +R+    D   RG F+W   YK   L E +    K+ ++P+KSP  AG
Sbjct: 322 PFIDVIDHTNFNYRA---QDEGARGAFDWEFFYKRLPLLEEDL---KHPADPFKSPVMAG 375

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P N
Sbjct: 376 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PRN 433

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
                   KG  +  NYKRV E W DE +K Y Y+  + L   +D GD++EQ
Sbjct: 434 HQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482


>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
          Length = 666

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L  E+ R        E G N   S+ IS +R++PD+R   C+  +Y   L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ ++P+KSP  A
Sbjct: 322 PFIDVIDHTNFHYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N        KG  +  NYKRV E W DE +K Y Y+  + L   +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482


>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
           caballus]
          Length = 453

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 139/297 (46%), Positives = 192/297 (64%), Gaps = 9/297 (3%)

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           YP +LP  SV++VFHNE +S+L+RTVHS+I R+P   LEEI+LVDD S +  L + LE Y
Sbjct: 5   YPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESY 64

Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
           +++    V +IR  +R GLIR R +GA  S+G+VI FLDAHCE  + WL PLLA I  DR
Sbjct: 65  VKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDR 124

Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYK 255
           K +  P+ID I   T+E+ +    D  Y G F W + ++   +P+RE  +RK + + P +
Sbjct: 125 KTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVR 181

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           +PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R
Sbjct: 182 TPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFR 241

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              PY F        G +I  N +R+ E W DE  K +FY   P    +D GDIS +
Sbjct: 242 KATPYTFPGGT----GQIINKNNRRLAEVWMDE-FKNFFYIISPGVTKVDYGDISSR 293


>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
          Length = 662

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 147/332 (44%), Positives = 203/332 (61%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N+  SN I   R +PD R + C    Y   LP AS+I+ F+NE   +L+R+
Sbjct: 157 DVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQSYDKVLPSASIIMCFYNEHLETLVRS 216

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 160
           V SII+RTP+  L EIILVDD S   DL   LE  +    N KVRLIRN EREGL+R+R 
Sbjct: 217 VTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHELNALKNSKVRLIRNAEREGLMRSRV 276

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA+ + G+V++FLD+H EV ++W+ PLL  I +++ I+ +PVID I+  T+    +Y  
Sbjct: 277 YGARNATGDVLIFLDSHIEVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTF----IYSS 332

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + LP+    K      P++SPT AGGLFA+DR +F +LG YD G
Sbjct: 333 SPLVRGGFNWGLHFKWDNLPKGTLAKESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMG 392

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + VWGGEN E+SF+ W CGGSIE VPCSRIGHV+R   PY     +D      +  N  R
Sbjct: 393 MDVWGGENLEISFRTWQCGGSIELVPCSRIGHVFRKRRPYGSPDGSD-----TMIRNSLR 447

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +   W D+  K YF   +P A  +D GD++++
Sbjct: 448 LSRVWMDDYIK-YFLENQPQAKKVDPGDLTDR 478


>gi|198474477|ref|XP_001356707.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
 gi|198138408|gb|EAL33772.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
          Length = 646

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 147/332 (44%), Positives = 207/332 (62%), Gaps = 24/332 (7%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E+CK   Y   LP  SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q+L+DY+   F   V ++RN ER+GLI  R  GAK + 
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           GEV+VF D+H EV  NWLPPLL PI  + KI T P++D ID+  + +   Y+     RG 
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W   YK+   LPE    K    S+P+++P   GGLFA+   FF +LGGYD  L +WGG
Sbjct: 327 FDWRFFYKQLPVLPEDSVDK----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGG 382

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-----PYNFGKLADRVKGPLITYNYKRV 341
           E +ELSFKIWMCGG +  +PCSR+ H++R  M     P N+           +  N+KRV
Sbjct: 383 EQYELSFKIWMCGGMLLDIPCSRVAHIFRGPMDPRPNPRNYN---------FVGRNHKRV 433

Query: 342 IETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            E W DE +K + Y+R+P     +D GD++ Q
Sbjct: 434 AEVWMDE-YKEHVYSRDPQTYNNIDAGDLTRQ 464


>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 582

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 206/352 (58%), Gaps = 18/352 (5%)

Query: 25  GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DL 81
           GE GKA    L EA     +  +  Y +N+  S+ IS  R I D RM ECK   +    L
Sbjct: 79  GEWGKASRLQLNEAELKQQEELIERYAINIYLSDKISLHRHIEDKRMYECKSKTFNYRQL 138

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +LE Y+   +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLD 198

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I  D  ++  
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISEDETVIIC 257

Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
           PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK   +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +GHV+    PY
Sbjct: 315 GGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                      P    N  R  E W DE +K +FY R P A     GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISER 416


>gi|426337441|ref|XP_004032714.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gorilla
           gorilla gorilla]
          Length = 940

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 219/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P+      +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|34042986|gb|AAQ56703.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 666

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L  E+ R        E G N   S+ IS +R++PD+R   C+  +Y   L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ ++P+KSP  A
Sbjct: 322 PFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N        KG  +  NYKRV E W DE +K Y Y+  + L   +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482


>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Monodelphis domestica]
          Length = 708

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 156/364 (42%), Positives = 213/364 (58%), Gaps = 21/364 (5%)

Query: 14  EPPLEPYKEGPGEGGKAYHLP---EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           +PP +P     GE G+A HL    +A +   +    +Y +N+  S+ IS  R I D RM 
Sbjct: 195 KPPPDP--GALGEWGEASHLQLQGDAEKQQAEELTEKYAINIYLSDRISLHRHIRDDRMY 252

Query: 71  EC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           EC  K +DY   LP  SVI+ F+NE +S+L+RTVHS+++  PA  L+EIILVDD S K  
Sbjct: 253 ECRLKSFDY-RRLPTTSVIIAFYNEAWSTLLRTVHSVLETAPAVLLKEIILVDDLSDKVY 311

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L  +LE YI     +VRLIR  +REGL+R R  GA  + GEV+ FLD HCE    WL PL
Sbjct: 312 LKAQLETYISSLQ-RVRLIRTKKREGLVRARLIGATFATGEVLTFLDCHCECNQGWLEPL 370

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L  I  D  ++  PVID ID+ T++F    +      G F+W + ++   +PE E ++ +
Sbjct: 371 LERIGQDESVIICPVIDTIDWNTFDF--YMQEGEPVIGGFDWHLTFQWQPVPEHERRRWQ 428

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
             ++P KSP  AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG++E  PCS
Sbjct: 429 SRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGALEIHPCS 488

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+    PY           P    N  R  E W D+ +K +FY R PLA     GD
Sbjct: 489 HVGHVFPKRAPY---------ARPNFRQNTVRAAEVWMDD-YKEHFYNRNPLARKESYGD 538

Query: 369 ISEQ 372
           +SE+
Sbjct: 539 VSER 542


>gi|443704264|gb|ELU01402.1| hypothetical protein CAPTEDRAFT_127533 [Capitella teleta]
          Length = 390

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 155/354 (43%), Positives = 206/354 (58%), Gaps = 13/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
            GPGE G++        AA          N   S+ +SF+RTIPD R   C  K +DY  
Sbjct: 6   NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 64

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LPK SVI+ F  E +S+L+R+VHS++ RTP + LEEIILVDDFS +  L  KL++Y+ R
Sbjct: 65  SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
              KV LIR   R+GLIR R R  + +RG V+ FLD+H E  + W  PLL  I  +R+++
Sbjct: 125 L-PKVTLIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
             PVID I  + + +  +     + RG F W ML+K   +P  E  +   + + P ++PT
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTPT 240

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+ + FF  LG YDPGL +WG EN ELSFK WMCGGS+E +PCSR+GHVYRS  
Sbjct: 241 IAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRSTQ 300

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+F      VK  +   N  RV   W D  +   FY  +P       GDIS +
Sbjct: 301 PYSFP--GGNVK--VFMRNNLRVANVWMD-GYVNLFYLMKPELRNEPFGDISSR 349


>gi|24656262|ref|NP_647749.2| polypeptide GalNAc transferase 6, isoform A [Drosophila
           melanogaster]
 gi|24656265|ref|NP_728779.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
           melanogaster]
 gi|442629817|ref|NP_001261342.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
           melanogaster]
 gi|51315873|sp|Q6WV16.2|GALT6_DROME RecName: Full=N-acetylgalactosaminyltransferase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
 gi|7292281|gb|AAF47689.1| polypeptide GalNAc transferase 6, isoform A [Drosophila
           melanogaster]
 gi|7292282|gb|AAF47690.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
           melanogaster]
 gi|440215219|gb|AGB94037.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
           melanogaster]
          Length = 666

 Score =  274 bits (701), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 163/353 (46%), Positives = 215/353 (60%), Gaps = 15/353 (4%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L  E+ R        E G N   S+ IS +R++PD+R   C+  +Y   L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ ++P+KSP  A
Sbjct: 322 PFIDVIDHTNFHYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
           N        KG  +  NYKRV E W DE +K Y Y+  + L   +D GD++EQ
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEVWMDE-YKNYLYSHGDGLYESVDPGDLTEQ 482


>gi|6525067|gb|AAF15313.1|AF154107_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5 [Homo
           sapiens]
          Length = 610

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 153/373 (41%), Positives = 221/373 (59%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V + D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 93  VLRIDVTLSPRDP------KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDR 146

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  SVI+ F +E +S+L+R+VHS+I R+P   ++EI+LVDD
Sbjct: 147 AIEDTRPAGCAEQLVXNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDD 206

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 207 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 265

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 266 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 322

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK R   ++  + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 323 VIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 382

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 383 IEIIPCSRVGHIFRNDNPYSFPK--DRMKT--VERNLVRVAEVWLDE-YKELFYGHGDHL 437

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G++++Q
Sbjct: 438 IDQGLDVGNLTQQ 450


>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Ailuropoda melanoleuca]
          Length = 578

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 156/375 (41%), Positives = 212/375 (56%), Gaps = 26/375 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RP++K        +PP + +  G         L E      +  +  Y +N+  S+ IS 
Sbjct: 61  RPLYK--------KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISL 112

Query: 61  DRTIPDLRMEECKY--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
            R I D RM ECK   +DY   LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EII
Sbjct: 113 HRHIEDKRMYECKSRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEII 171

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           LVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HC
Sbjct: 172 LVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHC 230

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKEN 237
           E    WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ +
Sbjct: 231 ECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWH 287

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +P+ E  +RK   +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W 
Sbjct: 288 SVPKHERDRRKSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQ 347

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R
Sbjct: 348 CGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNR 397

Query: 358 EPLAMFLDMGDISEQ 372
            P A     GDISE+
Sbjct: 398 NPPARKEAYGDISER 412


>gi|194759472|ref|XP_001961971.1| GF15238 [Drosophila ananassae]
 gi|190615668|gb|EDV31192.1| GF15238 [Drosophila ananassae]
          Length = 663

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 144/326 (44%), Positives = 203/326 (62%), Gaps = 11/326 (3%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+EECK   Y   LP  SVI +F NE  S+L+R++HS++ R
Sbjct: 164 GFNGLLSDRISVNRSVPDVRLEECKTRKYLAKLPNVSVIFIFFNEYLSTLLRSIHSVVNR 223

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L  +L+DY+   F G V ++RN ER GLI  R  GAK + 
Sbjct: 224 TPPELLKQIVLVDDGSDWESLKHQLDDYVSIHFPGLVDIVRNPERRGLIGARIAGAKVAV 283

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H E   NWLPPLL PI  + KI T P+ID ID+ T+ +   ++     RG 
Sbjct: 284 GDVMVFFDSHIEANYNWLPPLLEPIAINNKICTCPMIDSIDHATFSYHGGHQ--EGARGG 341

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F+W M YK+  +   ++  +   S P++SP   GGLFA++  FF +LGGYD  L +WGGE
Sbjct: 342 FDWKMYYKQLPVLAEDSIDK---SLPFRSPVMMGGLFAINTDFFWDLGGYDDELDIWGGE 398

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
            +ELSFKIWMCGG +  VPCS + H++R  M     + + R     +  N+KRV E W D
Sbjct: 399 QYELSFKIWMCGGMLLDVPCSHVAHIFRGPMD---PRPSPRENTNFVARNHKRVAEVWMD 455

Query: 348 EKHKAYFYTREPLAM-FLDMGDISEQ 372
           E +K Y Y R+P     +D GD++ Q
Sbjct: 456 E-YKKYLYERDPETYEKIDAGDLTRQ 480


>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
           anubis]
          Length = 578

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 158/376 (42%), Positives = 214/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP + +   PGE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P+ E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 3 [Bombus impatiens]
          Length = 607

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 144/350 (41%), Positives = 205/350 (58%), Gaps = 8/350 (2%)

Query: 24  PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           PGE G A H+     A           N+  S+ IS +R++ D+R+E CK   Y   LP 
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
            S+++VFHNE +S+L+RTV S+I R+P   L+EIILVDD S +  L Q LEDY++     
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
             + R  +R GLIR R  GAK   G+VI FLDAHCE    WL PLL+ I  DR  +  P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGG 262
           ID I   T+E+  +   D  + G F W + ++   + +RE  +R  + + P ++PT AGG
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMAGG 340

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           LF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E  PCS +GHV+R   PY F
Sbjct: 341 LFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTF 400

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                +V    + +N  RV E W DE    Y+      A  + +GD+SE+
Sbjct: 401 PGGVSKV----VLHNAARVAEVWMDEWRDFYYAMNPEGARNVAVGDVSER 446


>gi|443715013|gb|ELU07165.1| hypothetical protein CAPTEDRAFT_143879 [Capitella teleta]
          Length = 390

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 155/354 (43%), Positives = 206/354 (58%), Gaps = 13/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
            GPGE G++        AA          N   S+ +SF+RTIPD R   C  K +DY  
Sbjct: 6   NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSFDYS- 64

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LPK SVI+ F  E +S+L+R+VHS++ RTP + LEEIILVDDFS +  L  KL++Y+ R
Sbjct: 65  SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
              KV LIR   R+GLIR R R  + +RG V+ FLD+H E  + W  PLL  I  +R+++
Sbjct: 125 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
             PVID I  + + +  +     + RG F W ML+K   +P  E  +   + + P ++PT
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTPT 240

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+ + FF  LG YDPGL +WG EN ELSFK WMCGGS+E +PCSR+GHVYRS  
Sbjct: 241 IAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRSTQ 300

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+F      VK  +   N  RV   W D  +   FY  +P       GDIS +
Sbjct: 301 PYSFP--GGNVK--VFMRNNLRVANVWMD-GYVNLFYLMKPELRNEPFGDISSR 349


>gi|380805795|gb|AFE74773.1| polypeptide N-acetylgalactosaminyltransferase-like 6, partial
           [Macaca mulatta]
          Length = 336

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 141/276 (51%), Positives = 181/276 (65%), Gaps = 12/276 (4%)

Query: 97  SLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLI 156
           SL+RT+HSII RTP   + EIILVDDFS +  L  KLE+Y+ RF+ KVR++R  +REGLI
Sbjct: 1   SLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLI 59

Query: 157 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 216
           RTR  GA  +RGEV+ FLD+HCEV +NWLPPLL  I  + K +  P+ID ID+  + + +
Sbjct: 60  RTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHFGYEA 119

Query: 217 VYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGG 276
             +     RG F+W M YK   +P     +R   S+P++SP  AGGLFA+DR +F ELGG
Sbjct: 120 --QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFESPVMAGGLFAVDRKWFWELGG 175

Query: 277 YDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY 336
           YDPGL +WGGE +E+SFK+WMCGG +  VPCSR+GH+YR ++PY          G  +  
Sbjct: 176 YDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYRKYVPYKVP------SGTSLAR 229

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KRV ETW DE    Y Y R P    L  GDIS Q
Sbjct: 230 NLKRVAETWMDE-FAEYIYQRRPEYRHLSTGDISAQ 264


>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
 gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
          Length = 675

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 138/336 (41%), Positives = 204/336 (60%), Gaps = 10/336 (2%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           L P ++  K  PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE+Y+ +   K  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 251
             +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + +
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRT 354

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
            P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +G
Sbjct: 355 APLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVG 414

Query: 312 HVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           HV+R   PY F G +A      ++ +N  RV E W 
Sbjct: 415 HVFRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 445


>gi|6688167|emb|CAB65104.1| GalNAc-T5 [Homo sapiens]
          Length = 668

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 149/354 (42%), Positives = 215/354 (60%), Gaps = 12/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C       +L
Sbjct: 164 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 223

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F +E +S+L+R+VHS+I R+P   ++EI+LVDDFS+K  L   L+ Y+ +F 
Sbjct: 224 PTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 282

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  RK +  
Sbjct: 283 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 342

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
           PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  + P  A
Sbjct: 343 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMA 399

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY
Sbjct: 400 GGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 459

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           +F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 460 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 508


>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
 gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
          Length = 693

 Score =  273 bits (699), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 155/363 (42%), Positives = 208/363 (57%), Gaps = 28/363 (7%)

Query: 22  EGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK-YWD 76
           +  GE GK   LP+      + A D        N   S+ IS  R++PD R   CK    
Sbjct: 153 DNAGEMGKPVVLPKEMAPDMKKAVDEGWTNNAFNQYVSDLISVHRSLPDPRDAWCKDSAR 212

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LPK  VI+ FHNE +S L+RTVHS++ R+P++ + EIILVDD+S    L +KLEDY
Sbjct: 213 YLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPSELIGEIILVDDYSDMTHLKKKLEDY 272

Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
              +   V+++R  +REGLIR R  GAK ++  VI +LD+HCE    WL PLL  I  + 
Sbjct: 273 FADY-PMVKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDRIARNS 331

Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKY 249
             +  PVID ID  T EF        HYR       G F+W + +  + +PERE K+   
Sbjct: 332 TTVVCPVIDVIDDATLEF--------HYRDSSGVNVGGFDWNLQFSWHSVPEREKKRHNS 383

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            SEP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG++E VPCS 
Sbjct: 384 TSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSH 443

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +GH++R   PY +     R    ++  N  R+ E W D+  K Y+Y R  +    D GD+
Sbjct: 444 VGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDDYSK-YYYQRIGMDKG-DFGDV 496

Query: 370 SEQ 372
           S++
Sbjct: 497 SDR 499


>gi|114581297|ref|XP_525944.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           2 [Pan troglodytes]
 gi|410296312|gb|JAA26756.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
           troglodytes]
 gi|410333399|gb|JAA35646.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
           troglodytes]
          Length = 940

 Score =  273 bits (699), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|51316006|sp|Q8IA42.2|GALT4_DROME RecName: Full=N-acetylgalactosaminyltransferase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4; Short=pp-GaNTase 4
 gi|34042946|gb|AAQ56701.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 659

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PDLR+E CK   Y   LP  SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++RN ER+GLI  R  GAK + 
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 337

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     LD GD++ Q
Sbjct: 449 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 475


>gi|221330664|ref|NP_001137779.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
           melanogaster]
 gi|442625712|ref|NP_722910.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
           melanogaster]
 gi|25987157|gb|AAN75751.1|AF324752_1 N-acetylgalactosaminyltransferase [Drosophila melanogaster]
 gi|220901927|gb|ACL82986.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
           melanogaster]
 gi|440213268|gb|AAN10370.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
           melanogaster]
          Length = 644

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PDLR+E CK   Y   LP  SVI +F NE F++L+R+++S+I R
Sbjct: 145 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 204

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++RN ER+GLI  R  GAK + 
Sbjct: 205 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 264

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 265 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 322

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 323 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 378

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 379 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 433

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     LD GD++ Q
Sbjct: 434 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 460


>gi|281341921|gb|EFB17505.1| hypothetical protein PANDA_013078 [Ailuropoda melanoleuca]
          Length = 936

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 154/373 (41%), Positives = 220/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V K D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 649

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK R   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 650 VVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 710 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 764

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G+++EQ
Sbjct: 765 IDQGLDVGNLTEQ 777


>gi|345797223|ref|XP_545481.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Canis
           lupus familiaris]
          Length = 602

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 154/375 (41%), Positives = 222/375 (59%), Gaps = 13/375 (3%)

Query: 2   PVFKADGKLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           P   A  ++  ++  L P   E PG+ G+   +P       +    E   N+  S+ I  
Sbjct: 77  PAQPAVRRVSGIDATLSPRDPEAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPV 136

Query: 61  DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 120
           DR I D R   C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LV
Sbjct: 137 DRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLV 196

Query: 121 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           DDFS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E 
Sbjct: 197 DDFSTKDYLKDDLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVEC 255

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
            + WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P
Sbjct: 256 NVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 312

Query: 241 -EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
            +  AK R   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCG
Sbjct: 313 PDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCG 372

Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-- 357
           G IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY    
Sbjct: 373 GEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGD 427

Query: 358 EPLAMFLDMGDISEQ 372
             +   LD+G+++EQ
Sbjct: 428 HLIDQGLDVGNLTEQ 442


>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
           saltator]
          Length = 606

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/365 (41%), Positives = 219/365 (60%), Gaps = 20/365 (5%)

Query: 13  LEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           L+P P++P     +G  E G   +L +  +   D     Y  N+  S+++   RT+PD R
Sbjct: 66  LQPVPVKPAVTLDQGLDELGMVKNLDDQRKR--DEGYKNYSFNVLISDNLGVLRTLPDTR 123

Query: 69  MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
            + C+   YP +LP AS+I+ F+NE + +L+R++HSII +TP   L EIILV+D+S    
Sbjct: 124 HKLCRARKYPTNLPNASIIICFYNEHYMTLLRSLHSIIDKTPTSLLHEIILVNDYSDSNI 183

Query: 129 LDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
           L +K++ YI   F+ KV+  +  +REGLIR R  GA+++ G+V++FLD+H EV   W+ P
Sbjct: 184 LHEKIKVYITNNFDAKVQFFKTDKREGLIRARVFGARKATGDVLIFLDSHIEVNEVWIEP 243

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LL+ I   + I+ +PVID I+  T++    Y      RG F WG+ +K + LP    K+ 
Sbjct: 244 LLSRIAHSKTIVAMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLKQE 299

Query: 248 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
               +P KSPT AGGLFA+DR +F ++G YD G+ VWGGEN E+SF+IWMCGG+IE +PC
Sbjct: 300 DDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISFRIWMCGGNIELIPC 359

Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
           SR+GHV+R   PY      D      +  N  RV   W DE +K YF         +D G
Sbjct: 360 SRVGHVFRRRRPYGSDDPQD-----TMLKNSLRVAHVWLDE-YKDYFLRN---VRKIDFG 410

Query: 368 DISEQ 372
           DISE+
Sbjct: 411 DISER 415


>gi|354548807|gb|AER27632.1| AT25481p1 [Drosophila melanogaster]
          Length = 666

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PDLR+E CK   Y   LP  SVI +F NE F++L+R+++S+I R
Sbjct: 167 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 226

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++RN ER+GLI  R  GAK + 
Sbjct: 227 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 286

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 287 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 344

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 345 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 400

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 401 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 455

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     LD GD++ Q
Sbjct: 456 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 482


>gi|301776863|ref|XP_002923851.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Ailuropoda melanoleuca]
          Length = 937

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 154/373 (41%), Positives = 220/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V K D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 649

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK R   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 650 VVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 709

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 710 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 764

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G+++EQ
Sbjct: 765 IDQGLDVGNLTEQ 777


>gi|195359229|ref|XP_002045319.1| GM11142 [Drosophila sechellia]
 gi|194122575|gb|EDW44618.1| GM11142 [Drosophila sechellia]
          Length = 658

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E CK   Y   LP  SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++RN ER+GLI  R  GAK + 
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 337

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     LD GD++ Q
Sbjct: 449 MDE-YKQYVYKRDPKTYDSLDAGDLTRQ 475


>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
          Length = 647

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 156/374 (41%), Positives = 213/374 (56%), Gaps = 28/374 (7%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
           G + PP E   + PG  GK   LP+      + A D    +   N   ++ IS  R++PD
Sbjct: 130 GVIAPPHEDSPDSPGAMGKPVVLPKDMSPEMKKAVDDGWSKNAFNQYAADLISIRRSLPD 189

Query: 67  LRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
            R   CK    Y  DLP  SVI+ FHNE +S L+RTVHS++ R+P   ++E+ILVDDFS 
Sbjct: 190 PRDPWCKEPGRYGTDLPATSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSD 249

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
                ++LEDY + +  +V++IR  +REGLIR R  GA+ +   V+ +LD+HCE    WL
Sbjct: 250 MPHTQKQLEDYFEAYP-RVKIIRAPKREGLIRARLLGARYATAPVLTYLDSHCECTTGWL 308

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENE 238
            PLL  I  +   +  PVID ID  T E+        HYR       G F+W + +  + 
Sbjct: 309 EPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVGGFDWNLQFNWHA 360

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P+RE K+ K  +EP  SPT AGGLF++D+ FF  LG YD G  +WGGEN ELSFK WMC
Sbjct: 361 VPDREKKRHKSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSFKTWMC 420

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG++E VPCS +GH++R   PY +     R    +I  N  R+ E W DE  K Y+Y R 
Sbjct: 421 GGTLEIVPCSHVGHIFRKRSPYKW-----RTGVNVIKRNSVRLAEVWLDEYAK-YYYQRI 474

Query: 359 PLAMFLDMGDISEQ 372
                 D GD+SE+
Sbjct: 475 GNDKG-DYGDVSER 487


>gi|195576344|ref|XP_002078036.1| GD23236 [Drosophila simulans]
 gi|194190045|gb|EDX03621.1| GD23236 [Drosophila simulans]
          Length = 674

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/328 (46%), Positives = 207/328 (63%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E CK   Y   LP  SVI +F NE F++L+R+++S+I R
Sbjct: 175 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 234

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++RN ER+GLI  R  GAK + 
Sbjct: 235 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 294

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 295 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 352

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 353 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 408

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 409 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 463

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     LD GD++ Q
Sbjct: 464 MDE-YKQYVYKRDPKTYDNLDAGDLTRQ 490


>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
           rerio]
 gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
 gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
          Length = 582

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 207/359 (57%), Gaps = 16/359 (4%)

Query: 17  LEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           L P    PGE G+A  L      +   +AS+    +N+  S+ IS  R I D RM ECK 
Sbjct: 72  LPPDSNAPGEYGRATRLTLTSEEKKEEEASVERCAINIFISDKISLHRHIQDNRMHECKA 131

Query: 75  WDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKL 133
             Y +  LP  SV++ F+NE +S+L+RT+HS+++ TPA  L++IILVDDFS +  L  +L
Sbjct: 132 KKYNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLETTPAVLLKDIILVDDFSDRGYLKSQL 191

Query: 134 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
             YI     +VRLIR  +REGL+R R  GA  + G V+ FLD HCE    W+ PLL  I 
Sbjct: 192 AQYISNLE-RVRLIRTKKREGLVRARLIGATYATGSVLTFLDCHCECVPGWIEPLLERIA 250

Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
            +   +  PVID ID+ T+EF    + +    G F+W + ++ + +PE + K RK   +P
Sbjct: 251 ENETTIICPVIDTIDWNTFEF--YMQTEEPMVGGFDWRLTFQWHAVPEIDRKIRKSRIDP 308

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            +SPT AGGLFA+ +A+F  LG YD G+ VWGGEN ELSF++W CGGS+E  PCS +GHV
Sbjct: 309 IRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHV 368

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +    PY                N  R  E W D  +K +FY R P A     GDISE+
Sbjct: 369 FPKKAPYARSNFLQ---------NTVRAAEVWMD-TYKQHFYNRNPPARKESYGDISER 417


>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
 gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
          Length = 677

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 137/334 (41%), Positives = 201/334 (60%), Gaps = 10/334 (2%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 74
           P +      PGE GK   +P   +        E   N+  S+ IS +R++ D+R E C+ 
Sbjct: 122 PTVREQHGQPGEMGKPVKIPADMKEVMKEKFKENQFNLLASDMISLNRSLTDVRHENCRR 181

Query: 75  WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
             Y   LP  S+++VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L +KLE
Sbjct: 182 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKKLE 241

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           DY+ +   +  ++R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I  
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEP 253
           +R+ +  P+ID I  +T+E+  +   D  + G F W + ++   +P RE  +R  + + P
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRTAP 358

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV
Sbjct: 359 LRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHV 418

Query: 314 YRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           +R   PY F G +A      ++ +N  RV E W 
Sbjct: 419 FRDKSPYTFPGGVA-----KIVLHNAARVAEVWM 447


>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
          Length = 671

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 155/371 (41%), Positives = 210/371 (56%), Gaps = 29/371 (7%)

Query: 1   RPVFKADGKLGNLEPPL--EPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMET 54
           RP  + D + G   P +   P + GPGE GK   LP+      +   D    +   N   
Sbjct: 142 RPARQPDDQGGLALPGVIAPPSEGGPGELGKPVVLPKDLSPEVKKLVDEGWAKNAFNQYV 201

Query: 55  SNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
           ++ IS  RT+PD R   CK    Y  DLP  SVI+ FHNE +S L+RTVHS++ R+P   
Sbjct: 202 ADMISIRRTLPDPRDAWCKEPGRYREDLPPTSVIICFHNEAWSVLLRTVHSVLDRSPEHL 261

Query: 114 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 173
           ++E+ILVDDFS      ++LE+Y   +  +V+++R  +REGLIR R  GA+ +   V+ +
Sbjct: 262 VKEVILVDDFSDMPHTQKQLEEYFLAY-PRVKIVRAAKREGLIRARLLGARHATAPVLTY 320

Query: 174 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------G 226
           LD+HCE    WL PLL  I  +   +  PVID ID  T E+        HYR       G
Sbjct: 321 LDSHCECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVG 372

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +  + +PERE +K K  +EP  SPT AGGLFA+DR FF  LG YD G  +WGG
Sbjct: 373 GFDWNLQFNWHAVPEREKRKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGG 432

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELSFK WMCGGS+E +PCS +GH++R   PY +     R    +I  N  R+ E W 
Sbjct: 433 ENLELSFKTWMCGGSLEIIPCSHVGHIFRKRSPYKW-----RTGVNVIKRNSVRLAEVWM 487

Query: 347 DEKHKAYFYTR 357
           DE +  Y+Y R
Sbjct: 488 DE-YAQYYYQR 497


>gi|410968689|ref|XP_003990834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 5 [Felis catus]
          Length = 939

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/360 (41%), Positives = 216/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 431 PRDP--KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 488

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 489 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 548

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 549 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 607

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 608 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDII 664

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 665 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 724

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G+++EQ
Sbjct: 725 RNDNPYTFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTEQ 779


>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
          Length = 491

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 207/362 (57%), Gaps = 18/362 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 5   KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 64

Query: 74  Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
              +DY   LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  
Sbjct: 65  SRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 123

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  
Sbjct: 124 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 182

Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK  
Sbjct: 183 ISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSR 239

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
            +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +
Sbjct: 240 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 299

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+    PY           P    N  R  E W DE +K +FY R P A     GDIS
Sbjct: 300 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDIS 349

Query: 371 EQ 372
           E+
Sbjct: 350 ER 351


>gi|332233960|ref|XP_003266176.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 5 [Nomascus
           leucogenys]
          Length = 940

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPDVIAKNRIKETDII 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|149730635|ref|XP_001491185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Equus
           caballus]
          Length = 940

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/373 (40%), Positives = 221/373 (59%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V K D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 423 VLKIDVTLSPRDP------KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDR 476

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 595

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RG+F W M +    +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPD 652

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK R  +++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 653 IVAKNRIKDTDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 713 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 767

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G++++Q
Sbjct: 768 IDQGLDVGNLTQQ 780


>gi|109099754|ref|XP_001087663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           2 [Macaca mulatta]
          Length = 940

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   +EEI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           K P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|403258969|ref|XP_003922012.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           1 [Saimiri boliviensis boliviensis]
          Length = 940

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 148/353 (41%), Positives = 214/353 (60%), Gaps = 12/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
            PG+ G+   +P       +    E   N+  S+ I  DR I D R   C       +LP
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNLP 496

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F  
Sbjct: 497 TTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF-P 555

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  RK +  P
Sbjct: 556 KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVACP 615

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
           VI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  + P  AG
Sbjct: 616 VIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVIRCPVMAG 672

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY+
Sbjct: 673 GLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 732

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 733 FPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLINQGLDVGNLTQQ 780


>gi|296204771|ref|XP_002749473.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Callithrix jacchus]
          Length = 940

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 148/354 (41%), Positives = 215/354 (60%), Gaps = 12/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C       +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F 
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDDLDKYMSQF- 554

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  RK +  
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
           PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  + P  A
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVIRCPVMA 671

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY
Sbjct: 672 GGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 731

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           +F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 732 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|348522865|ref|XP_003448944.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           protein 2-like [Oreochromis niloticus]
          Length = 590

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 206/342 (60%), Gaps = 13/342 (3%)

Query: 25  GEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           GE GKA   HL    R     +L +YG N   S  IS  R +P+ R  +C   ++   LP
Sbjct: 126 GEMGKAVRLHLEGLERDMELRALQQYGFNEVVSERISLHRRLPEARHPKCLGVEHIESLP 185

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
            ASV++ F++E +S+L+RTVHS++   P QYL+E++LVDD S +  L   L +Y+   +G
Sbjct: 186 SASVVICFNDEAWSTLLRTVHSVLDTAPKQYLQEVLLVDDLSQQGHLKTGLSEYVSHLDG 245

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            VRLIR+T+R G+   R+ GA  + GEV+VF+D+HCE    WL PLL  I  DR  +  P
Sbjct: 246 -VRLIRSTKRLGVGGCRTLGAARAVGEVVVFMDSHCECQKGWLEPLLERIALDRTRVVSP 304

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 262
           ++D ID+QT+ + +   P    RG+F+W + +    +PE + K+ +   +P +SP   GG
Sbjct: 305 IMDVIDWQTFRYNATQWP---VRGVFDWRLDFFWESIPELQDKEPEMAVQPLQSPALGGG 361

Query: 263 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF 322
           + A+DR FF  +G YDPG+++WG E  ELS ++W CGGS+E VPCSR+GH+ R  +PY F
Sbjct: 362 VVAIDRHFFQSVGTYDPGMVLWGAEQIELSIRVWSCGGSMEVVPCSRVGHLIRHHLPYRF 421

Query: 323 GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
                     L+  N  R+ ETW D  +K  +Y R+ LA F+
Sbjct: 422 P------DQDLLQRNKIRIAETWMD-TYKKIYYRRDTLAHFI 456


>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
 gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
          Length = 650

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    E  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  DLPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476


>gi|344276550|ref|XP_003410071.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Loxodonta africana]
          Length = 448

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 142/328 (43%), Positives = 201/328 (61%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP ASVI+ FHNE F++L RTV S
Sbjct: 102 LLQYGFNIIISRSLGKEREVPDTRNKMCLEKHYPKYLPTASVIICFHNEEFNALFRTVSS 161

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP   LEEIILVDD S   DL +KL+ +++ F GK++LIRN +REGLIR R  GA 
Sbjct: 162 VMNLTPHYILEEIILVDDMSEFDDLKEKLDYHLEVFRGKIKLIRNKKREGLIRARLIGAS 221

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T E    Y P    
Sbjct: 222 RASGDVLVFLDSHCEVNRVWLEPLLFAISKDPKVVVCPLIDVINDTTLE----YTPSPVV 277

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F W + +K + +   E +  +  + P +SP  AGG+FA+ R +F E+G YD G+ +W
Sbjct: 278 RGAFNWKLQFKWDNVLSYEMEGPEGPTGPIRSPAMAGGIFAIQRKYFNEIGQYDKGMYLW 337

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+GH+ +  +  NF  +        + YN  R++  
Sbjct: 338 GGENLELSLRIWMCGGQLFIIPCSRVGHISKQHIQNNFRFMQS------LRYNNLRLVHV 391

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P    ++ G+ISE+
Sbjct: 392 WLDE-YKEQFFLQGPGLKSMNYGNISER 418


>gi|443726011|gb|ELU13353.1| hypothetical protein CAPTEDRAFT_91056 [Capitella teleta]
          Length = 426

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 152/354 (42%), Positives = 205/354 (57%), Gaps = 13/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 79
             PGE G++        A           N   S+ +SF+RTIPD R   C  K +DY  
Sbjct: 42  NSPGEHGRSVRTSPDDEAVVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 100

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LPK SVI+ F  E +S+L+R+VHS++ RTP   LEEI+LVDDFS +  L  KL+DY+ R
Sbjct: 101 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPDLLEEILLVDDFSQREHLHAKLDDYLTR 160

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
              KV LIR   R+GLIR R R  + +RG V+ FLD+H E  + W  PLL  I  +R+++
Sbjct: 161 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 219

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
             PVID I  + + +  +     + RG F W ML+K   +P+ E  +   + + P ++PT
Sbjct: 220 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPDYEKSRTGGDPTAPVRTPT 276

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLFA+ + FF  LG YDPGL +WG EN ELSFK WMCGGS+E +PC+R+GHVYRS  
Sbjct: 277 IAGGLFAIHQGFFRSLGFYDPGLHIWGSENLELSFKAWMCGGSMEMIPCARVGHVYRSTQ 336

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+F      VK  +   N  RV   W D+ +   FY  +P       GDIS +
Sbjct: 337 PYSFP--GGNVK--VFMRNNLRVANVWMDD-YVDLFYLMKPELRNEPFGDISSR 385


>gi|431894831|gb|ELK04624.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Pteropus alecto]
          Length = 939

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 148/354 (41%), Positives = 214/354 (60%), Gaps = 12/354 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C       +L
Sbjct: 435 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAKQLVHNNL 494

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F 
Sbjct: 495 PTTSVIMCFVDEVWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 553

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  RK +  
Sbjct: 554 PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLSRKKVAC 613

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHA 260
           PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  + P  A
Sbjct: 614 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDIIRCPVMA 670

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY
Sbjct: 671 GGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPY 730

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           +F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 731 SFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 779


>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 4 [Equus caballus]
          Length = 703

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 150/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 191 KPPADSHALGEWGKASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 250

Query: 74  YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              +    LP  SV++ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +
Sbjct: 251 SQKFNYRKLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 310

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 311 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 369

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK   
Sbjct: 370 SKDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRI 426

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P  SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 427 DPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 486

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W DE +K +FY R P A     GDISE
Sbjct: 487 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISE 536

Query: 372 Q 372
           +
Sbjct: 537 R 537


>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
 gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
          Length = 650

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    E  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  DLPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476


>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Anolis carolinensis]
          Length = 583

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 156/376 (41%), Positives = 221/376 (58%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEA--YRAAGDASLGEYGMNMETSNHI 58
           RPV++        +PP +P+  G GE GKA  L  +   +   +  +  Y +N+  S+ I
Sbjct: 66  RPVYQ--------KPPPDPH--GLGEWGKAARLTLSPEEKKLEEELVERYAINIYLSDKI 115

Query: 59  SFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
           S  R I D RM EC  K +DY   LP  SVI+ F+NE +S+L+RT+HS+++ +P+  L+E
Sbjct: 116 SLHRHIDDGRMPECRSKTYDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLESSPSVLLKE 174

Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           IILVDD S K  L  +LE YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD 
Sbjct: 175 IILVDDLSDKVYLKGELEKYISNLQ-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDC 233

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
           HCE    WL PLL  +  +  ++  PVID ID+ T+EF    +P     G F+W + ++ 
Sbjct: 234 HCECVPGWLEPLLQRVAENESVIICPVIDTIDWNTFEF--YMQPGEPMIGGFDWRLTFQW 291

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P+ E ++RK   +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 292 HSVPDYERQRRKSKVDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSFRVW 351

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W D+ +K +FY 
Sbjct: 352 QCGGILEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYN 401

Query: 357 REPLAMFLDMGDISEQ 372
           R P A   + GD+SE+
Sbjct: 402 RNPPARKENFGDLSER 417


>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Macaca mulatta]
          Length = 558

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L +  +A G+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Hydra magnipapillata]
          Length = 548

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 144/379 (37%), Positives = 212/379 (55%), Gaps = 13/379 (3%)

Query: 2   PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH--LPEAYRAAGDASLGEYGMNMETSNHIS 59
           P+   D  LG L   L P    P  G + Y   LP+  ++        +  +   S+ IS
Sbjct: 57  PIVDVD-VLGQLGIELYPELIDPLLGARGYPAILPDNLKSQSKNLFKNHSFDSLLSDRIS 115

Query: 60  FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
            +R + +++ + C    YP +LP  SVI+ FHNE  S+L+RTVHS+I  TP   L  I+L
Sbjct: 116 LNRRLGNVKGDLCSSKQYPAELPNTSVIICFHNEATSALLRTVHSVINETPPNILSNIVL 175

Query: 120 VDDFSSKADLDQKLEDYIQRFNGK-----VRLIRNTEREGLIRTRSRGAKESRGEVIVFL 174
           VDD S  A L + L +YI   N K     V L RN +R+GL+R+R +GA+ + G V+ FL
Sbjct: 176 VDDASVGAALKKPLRNYINELNRKLGEEMVILYRNAKRQGLVRSRLKGAELASGTVLTFL 235

Query: 175 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 234
           D+HCE    W+ PLL  I  D++ +  PVI+ ID     ++          G F W + +
Sbjct: 236 DSHCEATEGWVEPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKTELDRITQVGGFTWDLFF 295

Query: 235 KENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
              E+ E E + R   ++P KSPT AGGLFA+D+++F E+G YD  + +WGGEN E+SF+
Sbjct: 296 NWKEITEDEKRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSFR 355

Query: 295 IWMCGGSIEWVPCSRIGHVYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAY 353
           IWMCGG +E +PCSR+GH++R    PY+F     +     +  N+ R+ E W DE  + Y
Sbjct: 356 IWMCGGKLEIIPCSRVGHIFRKENSPYSFPNGVSKT----LAKNFNRLAEVWMDEYKELY 411

Query: 354 FYTREPLAMFLDMGDISEQ 372
           +  + P    +  GDISE+
Sbjct: 412 YRRKPPEDKLVKYGDISER 430


>gi|402888383|ref|XP_003907542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Papio
           anubis]
          Length = 940

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 151/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           K P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM--FLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      M   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLMDQGLDVGNLTQQ 780


>gi|344268422|ref|XP_003406059.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Loxodonta africana]
          Length = 939

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 149/360 (41%), Positives = 215/360 (59%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P            E   N+  S+ I  DR I D R   C   
Sbjct: 431 PRDP--KAPGQFGRPVIVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPTGCAEQ 488

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 489 LVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 548

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 549 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLS 607

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RG+F W M +    +P +  AK R   ++  
Sbjct: 608 RKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPDVVAKNRIKETDVI 664

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 665 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 724

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 725 RNDNPYTFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 779


>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Macaca
           mulatta]
          Length = 558

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L +  +A G+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|355564907|gb|EHH21396.1| hypothetical protein EGK_04452 [Macaca mulatta]
          Length = 940

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           K P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
          Length = 558

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 147/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  DL+  L   + R   KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSS--DLEDCL--LLTRI-PKVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|449281639|gb|EMC88675.1| Polypeptide N-acetylgalactosaminyltransferase-like protein 2
           [Columba livia]
          Length = 640

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 147/370 (39%), Positives = 209/370 (56%), Gaps = 25/370 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHI 58
           RP  +A+G   + + P  P +  P EG           AAG   LG   +G N   S  I
Sbjct: 123 RPEARAEGDAESPQLPARPLQ--PAEGA----------AAGQRPLGLETHGFNEALSERI 170

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S  R +P++R   C   +Y   LP ASVI+ FH+E +S+L+RTVHSI+   P   L++II
Sbjct: 171 SLRRDLPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSIMDTAPKASLKDII 230

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 178
           LVDD S +  L   L +YI + +G V+LIR+ +R G+IR R  GA  + G+V+VF+D+HC
Sbjct: 231 LVDDLSQQGPLKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHC 289

Query: 179 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 238
           E    WL PLLA + S+R  +  PVID ID++T+++   Y     +RG+F+W + +    
Sbjct: 290 ECQKGWLEPLLARLSSNRNSVVSPVIDVIDWKTFQY---YHSVGLHRGVFDWKLDFHWEP 346

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +PERE K R+    P +SP  AG + AMDR +F   G YD  + +WG EN ELS + W+C
Sbjct: 347 VPEREEKVRQSPISPIRSPVVAGAVVAMDRHYFQNTGAYDSDMTMWGAENLELSIRTWLC 406

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GGS+E +PCSR+GHVYR+  P  F           I  N  R+ ETW     K  FY  +
Sbjct: 407 GGSVEIIPCSRVGHVYRNHFPRAFS------YEEAIVRNKIRIAETWLG-SFKDNFYKHD 459

Query: 359 PLAMFLDMGD 368
            +A  +   +
Sbjct: 460 TVAFLISKAE 469


>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
          Length = 499

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L +  +A G+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 12  KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 70

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 71  TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 125

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 126 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 185

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 186 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 242

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 243 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 297

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 298 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 341


>gi|13929126|ref|NP_113984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Rattus norvegicus]
 gi|51315691|sp|O88422.1|GALT5_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           AltName: Full=Polypeptide GalNAc transferase 5;
           Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 5;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|3510639|gb|AAC69708.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T5 [Rattus
           norvegicus]
 gi|149047792|gb|EDM00408.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5, isoform CRA_a
           [Rattus norvegicus]
 gi|149047793|gb|EDM00409.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5, isoform CRA_a
           [Rattus norvegicus]
          Length = 930

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 146/353 (41%), Positives = 215/353 (60%), Gaps = 12/353 (3%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
            PG+ G+   +P   +   +    E   N+  S+ I  DR I D R   C       DLP
Sbjct: 427 APGQFGRPVVVPPGKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F  
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKANLDKYMSQF-P 545

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y +RK +  P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
           VI+ I+ +   + +V   D+  RG+F W M +    +P +  AK     ++  + P  AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPDVIAKNGIKETDIIRCPVMAG 662

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 723 FPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 770


>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
 gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
          Length = 693

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 158/385 (41%), Positives = 217/385 (56%), Gaps = 31/385 (8%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSN 56
           +P  K D K   L+ P+    + PGE GK   LP+      + A D    +   N   S+
Sbjct: 134 KPPPKEDDK-SVLDAPVANLNDNPGELGKPVILPKDMPIDMKKAVDDGWTKNAFNQYVSD 192

Query: 57  HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
            IS  R++PD R   CK    Y  +LPK  VI+ FHNE +S L+RTVHS++ R+P + + 
Sbjct: 193 LISVHRSLPDPRDAWCKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPPELIG 252

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           +IILVDD+S    L ++LEDY   +   V+++R  +REGLIR R  GAK ++  VI +LD
Sbjct: 253 QIILVDDYSDMPHLKKQLEDYFASY-PMVQIVRGPQREGLIRARLLGAKYAKSPVITYLD 311

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
           +HCE    WL PLL  I  +   +  PVID ID  T EF        HYR       G F
Sbjct: 312 SHCECAEGWLEPLLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGF 363

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +  + +PERE ++    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN
Sbjct: 364 DWNLQFSWHAVPEREKRRHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGEN 423

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFK WMCGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W D+
Sbjct: 424 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RTGVNVLKKNSVRLAEVWMDD 478

Query: 349 KHKAYFYTREPLAMFL-DMGDISEQ 372
             K Y    + + M   D GD+SE+
Sbjct: 479 YSKYYL---QRIGMDKGDYGDVSER 500


>gi|311275140|ref|XP_003134592.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Sus scrofa]
          Length = 446

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 138/326 (42%), Positives = 200/326 (61%), Gaps = 11/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +YG N   S  +   R +PD R + C    YP +LP AS+I+ FHNE F++L+RTV SI+
Sbjct: 103 KYGFNHIVSKSLGNYRNVPDSRNKMCHQKHYPANLPTASIIICFHNEEFNALLRTVSSIM 162

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
             TP   +EEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA  +
Sbjct: 163 TLTPHHIIEEIILVDDMSEYDDLKEKLDYHLEIFRGKIKVIRNKKREGLIRARLVGASRA 222

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+++VFLD+HCEV   WL PLL  I  D K++  P++D IDY T E    Y+P    RG
Sbjct: 223 SGDILVFLDSHCEVNKIWLEPLLDAIVKDPKMVVCPIMDVIDYVTLE----YKPSPVVRG 278

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           +F W + ++ + +   E       + P +SP   GGLFA+ R +F E+G YD G+ +WGG
Sbjct: 279 VFNWHLQFEWDRVFSYEMDGPDGPTRPIRSPAMVGGLFAIHRHYFNEIGQYDKGMNLWGG 338

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELS +IWMCGG +  +PCSR+GH+ + +   N G++        + YN  R++  W 
Sbjct: 339 ENLELSLRIWMCGGQLFLLPCSRVGHINKPYFT-NQGEIKKA-----MAYNNLRIVHVWL 392

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE +K  F+ + P    L  G++SE+
Sbjct: 393 DE-YKEQFFLQNPRLKSLAYGNVSER 417


>gi|219804492|ref|NP_001137331.1| polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
 gi|296490560|tpg|DAA32673.1| TPA: polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
          Length = 940

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/372 (40%), Positives = 220/372 (59%), Gaps = 16/372 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V + D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+++ G+V+ FLD+H E  +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK +   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 653 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPL 360
           IE VPCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE K   Y +    +
Sbjct: 713 IEIVPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLGRVAEVWLDEYKELFYGHGNHLI 768

Query: 361 AMFLDMGDISEQ 372
              LD+G++++Q
Sbjct: 769 DQGLDVGNLTQQ 780


>gi|355750550|gb|EHH54877.1| hypothetical protein EGM_03977 [Macaca fascicularis]
          Length = 940

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDAI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           K P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 KCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|395838452|ref|XP_003792129.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Otolemur garnettii]
          Length = 869

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 142/328 (43%), Positives = 200/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N  TS ++ F R +PD R + C    Y   LP ASVI+ FHNE F++L RT+ S
Sbjct: 283 LSKYGFNTITSTNVGFKREVPDTRHKMCLQNHYSTHLPTASVIICFHNEEFNALFRTMFS 342

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP   LEEIILVDD S   DL +KL+  ++ F GK++LIRN +REGLIR R  GA 
Sbjct: 343 VVNLTPNSLLEEIILVDDMSEFDDLKEKLDYVLEVFRGKIKLIRNQKREGLIRGRMIGAA 402

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID  T E+R+        
Sbjct: 403 RASGDVLVFLDSHCEVNKGWLEPLLYSIAKDHKMVVCPLIDVIDETTLEYRA----SPVV 458

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E        +P +SP  AGG+FA+ R +F E+G YD G+ +W
Sbjct: 459 RGAFDWELKFKWDNVFSYEMDGPDRPIKPIRSPAMAGGIFAIYRHYFNEIGQYDKGMDLW 518

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+GH+ +      F +++   +    T N  R++  
Sbjct: 519 GGENLELSLRIWMCGGQLFIIPCSRVGHITKK----QFKEVSAITRA--FTRNSLRMVHV 572

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ R+P    +  G+ISE+
Sbjct: 573 WLDE-YKEQFFLRKPGLRSIAYGNISER 599


>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Anolis carolinensis]
          Length = 592

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 160/378 (42%), Positives = 228/378 (60%), Gaps = 31/378 (8%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RPV++        +PPL    E  GE G+A    L E+     + S+  + +N+  S+ I
Sbjct: 70  RPVYE--------KPPLGRETE-LGELGRAARLELSESELRRQEESVALHQINVYLSDRI 120

Query: 59  SFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 116
           S  R +P+ R  +C  K +DY  +LPK SVI+ F+NE +S+L+RTVHS+++ +P   LEE
Sbjct: 121 SLHRRLPERRHPQCTEKRYDY-YNLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDILLEE 179

Query: 117 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 176
           IILVDD+S K  L +KLE+Y+     KVRLIR  +REGL+R R  GA  ++G+V+ FLD 
Sbjct: 180 IILVDDYSDKEHLKEKLENYVANLR-KVRLIRANKREGLVRARLLGASIAKGDVLTFLDC 238

Query: 177 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYK 235
           HCE    WL PLL  I  +   +  PVID ID+ T+E+  +  EP     G F+W +++ 
Sbjct: 239 HCECHEEWLEPLLERIKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFT 295

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 295
            + +PERE K+R+  ++  +SPT AGGLFA+++ +F  LG YD G+ VWGGEN E SF+I
Sbjct: 296 WHVVPEREQKQRRSKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSFRI 355

Query: 296 WMCGGSIEWVPCSRIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           W CGGS+E  PCS +GHV+    PY+  K LA+ V          R  E W D  +K  +
Sbjct: 356 WQCGGSLEIHPCSHVGHVFPKQAPYSRAKALANSV----------RAAEVWMD-SYKELY 404

Query: 355 YTREPLAMFLDMGDISEQ 372
           Y R P A     GD++E+
Sbjct: 405 YHRNPHARMEPYGDVTER 422


>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
 gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
          Length = 684

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 207/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    E  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I+ +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSSTVVCPVIDVINDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476


>gi|194855550|ref|XP_001968569.1| GG24947 [Drosophila erecta]
 gi|190660436|gb|EDV57628.1| GG24947 [Drosophila erecta]
          Length = 659

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 205/328 (62%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E CK   Y   LP  SV+ VF NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVVFVFFNEHFNTLLRSMYSVINR 219

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L++I+LVDD S    L Q L+DY+Q+ F   V ++ + ER+GLI  R  GAK + 
Sbjct: 220 TPPELLKQIVLVDDGSEWDSLKQPLDDYVQQHFPHLVTVVHSPERQGLIGARIAGAKVAV 279

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           GEV+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++  +F          RG 
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKICTCPIVDSISHE--DFSYFGGNKDGTRGG 337

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S+PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SQPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+ H++R  M     K     +G   +  N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     +D GD++ Q
Sbjct: 449 MDE-YKQYVYNRDPTTYDNVDAGDLTRQ 475


>gi|397525624|ref|XP_003832760.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Pan
           paniscus]
          Length = 940

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 149/360 (41%), Positives = 217/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++ +++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIHKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 780


>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Meleagris gallopavo]
          Length = 579

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 153/359 (42%), Positives = 208/359 (57%), Gaps = 20/359 (5%)

Query: 19  PYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P    PGE GK   L   PE  +   +  + +Y +N+  S+ IS  R I D RM  CK  
Sbjct: 70  PDSYAPGEWGKPTRLQLSPEEKKQEAEL-IDKYAINIYLSDKISLHRHIEDNRMSGCKTK 128

Query: 76  DYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 134
            Y    LP  SV++ F+NE +S+L+RTVHS+++ +P+  L+EIILVDD S K  L   LE
Sbjct: 129 SYNYRKLPTTSVVIAFYNEAWSTLLRTVHSVLETSPSVLLKEIILVDDLSDKVYLKTDLE 188

Query: 135 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
            YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I  
Sbjct: 189 KYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERIAE 247

Query: 195 DRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
           +  ++  PVID ID+ T+E+     EP     G F+W + ++ + +P+ E  +RK  ++P
Sbjct: 248 NETVVICPVIDTIDWNTFEYYMQSAEP---MIGGFDWRLTFQWHSVPKHERLRRKSETDP 304

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +GHV
Sbjct: 305 IRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVGHV 364

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +    PY           P    N  R  E W DE +K +FY R P A   + GDISE+
Sbjct: 365 FPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKENYGDISER 413


>gi|440896773|gb|ELR48609.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Bos grunniens
           mutus]
          Length = 940

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 151/373 (40%), Positives = 221/373 (59%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V + D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+++ G+V+ FLD+H E  +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK +   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 653 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGE 712

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE VPCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 713 IEIVPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLGRVAEVWLDE-YKELFYGHGDHL 767

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G++++Q
Sbjct: 768 IDQGLDVGNLTQQ 780


>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Saccoglossus kowalevskii]
          Length = 633

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 144/332 (43%), Positives = 199/332 (59%), Gaps = 11/332 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D    ++  N   S+ I F R +PD R   C Y  Y  +LP  SV++ F NE +S+L+RT
Sbjct: 137 DEGYQQHAFNQLISDRIGFHRGLPDTRNGLCAYQVYSNNLPSTSVVICFFNEAWSTLLRT 196

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 160
           V+S+I R+PA  L EIILVDD+SS   L   L+D+I+      V++I N +REGLIR R 
Sbjct: 197 VYSVIDRSPANLLHEIILVDDYSSSTYLKDYLDDFIKTNLFQIVKIIHNKKREGLIRARM 256

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA  + G+V++FLD+HCEV   WL PLL  I  D   +  P+ID I+  T+E    Y+ 
Sbjct: 257 IGAAAATGDVVMFLDSHCEVSTQWLEPLLERIKFDPHTVVCPIIDIINADTFE----YQQ 312

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P  + K ++   +P +SPT AGGLFAMDR +F ELG YD G
Sbjct: 313 SPLVRGGFNWGLHFKWDTIPSSQFKGKEDYIKPVRSPTMAGGLFAMDRKYFHELGEYDDG 372

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IW CGG++E +PCSR+GHV+R   PY      D      ++ N  R
Sbjct: 373 MDIWGGENLEISFRIWQCGGTLEIIPCSRVGHVFRKRRPYGSPNGED-----TMSKNSLR 427

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           V   W DE  + YF  ++      D GDIS +
Sbjct: 428 VAHVWMDEYKEHYFELKKD-NRNKDYGDISSR 458


>gi|338724473|ref|XP_001495495.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5-like
           [Equus caballus]
          Length = 448

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 141/328 (42%), Positives = 198/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
              YG N   S  +  +R +PD R + C    YP  LP AS+++ FHNE F++L+RTV S
Sbjct: 102 FSRYGFNAMISQRLGNEREVPDTRNKMCLQKHYPTRLPSASIVICFHNEEFNALLRTVSS 161

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++K TP + LEEIILVDD S   DL +KL+ +++ F GK++LIRN ++EGLIR R  GA 
Sbjct: 162 VMKLTPYRVLEEIILVDDMSEFDDLKEKLDHHLEFFRGKIKLIRNKKKEGLIRARLIGAS 221

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID IDY T +    Y+P    
Sbjct: 222 LASGDVLVFLDSHCEVNKVWLEPLLLAIAKDPKMVVCPLIDVIDYMTLK----YKPSPVV 277

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F W + +K + +   E    +    P +SP  AGG+FA+DR +F E+G YD  + +W
Sbjct: 278 RGAFNWHLQFKWDNVFSYEMDGPEGPIAPIRSPAMAGGIFAIDRQYFNEIGRYDKDMNLW 337

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+GH+ +  +         R     +TYN  R++  
Sbjct: 338 GGENLELSLRIWMCGGQLFVLPCSRVGHIDKQRIE------NKREYLKAMTYNNLRMVHV 391

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE HK   + R P    +  G+ISE+
Sbjct: 392 WLDE-HKEQVFLRRPGLKSVAYGNISER 418


>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
           boliviensis boliviensis]
          Length = 578

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 205/361 (56%), Gaps = 16/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G        HL E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 66  KPPADSHALGEWGKASKLHLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125

Query: 74  YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +
Sbjct: 126 SKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 185

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 186 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 244

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +R    
Sbjct: 245 GRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKYERDRRISRI 301

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W DE +K +FY R P A     GDISE
Sbjct: 362 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDISE 411

Query: 372 Q 372
           +
Sbjct: 412 R 412


>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 641

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 151/358 (42%), Positives = 205/358 (57%), Gaps = 18/358 (5%)

Query: 22  EGPGEGGKAYHLPEAYRAAG----DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
             PGE GK   +P           D        N   S+ IS  R++PD+R   CK   +
Sbjct: 136 NAPGENGKGVIVPTNLTGDAKRRLDIGWQNNAFNQYASDMISLHRSLPDMRDPGCKTQKF 195

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             DLP+ SVI+ FHNE +S LMRTVHS+I R+P   L+EIILVDDFS    L ++LEDY 
Sbjct: 196 RRDLPQTSVIICFHNEAWSVLMRTVHSVIDRSPKNLLKEIILVDDFSDMKHLKEQLEDYT 255

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
           ++  G V+++R ++REGLIR R  GAK +   V+ +LD+HCE    WL PLL  I     
Sbjct: 256 RKL-GIVKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPLLDRIAEADT 314

Query: 198 IMTVPVIDGIDYQTWEF---RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
            +  PVID I   T+E+   R+ Y  +    G F+W + +  + LP+R+   RK +    
Sbjct: 315 NVVCPVIDVISDSTFEYPHRRAGYTVN---VGGFDWNLQFSWHSLPQRDKDARKQSWSAV 371

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SPT AGGLF++ +A+F +LG YD G  +WG EN ELSFK+WMCGG +E VPCS +GHV+
Sbjct: 372 PSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSFKVWMCGGRLEIVPCSHVGHVF 431

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R   PY + K  + +K      N  R+ + W DE  + YF    P     D GDISE+
Sbjct: 432 RKRSPYKWLKGVNVLKK-----NSVRLAKVWMDEYAQYYFDRIGP--DLGDYGDISER 482


>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
 gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
          Length = 666

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 159/352 (45%), Positives = 214/352 (60%), Gaps = 13/352 (3%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GE GKA  L  E+ R        E G N   S+ IS +R++PD+R   C   +Y   L
Sbjct: 142 GLGEKGKAASLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCHKKEYVTKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R  +R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPKRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P ID ID+  + +R+    D   RG F+W   YK   L + +    K+ ++P+KSP  AG
Sbjct: 322 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLKDDL---KHPADPFKSPIMAG 375

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P N
Sbjct: 376 GLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PRN 433

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR-EPLAMFLDMGDISEQ 372
                 R  G  +  NYKRV E W DE +K Y Y+  + +   +D GD++EQ
Sbjct: 434 HQPSPRR--GDYLHRNYKRVAEVWMDE-YKNYLYSHGDGVYESVDPGDLTEQ 482


>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Cricetulus griseus]
          Length = 797

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 202/343 (58%), Gaps = 22/343 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y LDLP  SVI+ 
Sbjct: 55  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLDLPATSVIIT 114

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 115 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 169

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 170 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSPIIDVISL 229

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 230 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 286

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 287 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 340

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++
Sbjct: 341 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 382


>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Danaus plexippus]
          Length = 533

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 197/333 (59%), Gaps = 23/333 (6%)

Query: 33  LPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFH 91
           + E  + A      +   N   S+ IS  RT+PD R E CK    Y  DLP+ SV++ FH
Sbjct: 1   MSEDAKLAVSEGWKKNAFNQYASDLISIRRTLPDPRDEWCKQPGRYLEDLPQTSVVICFH 60

Query: 92  NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
           NE +S L+RTVHS+I R+PA  ++EIILVDDFS    L Q+L+DY+     KVR++R T+
Sbjct: 61  NEAWSVLLRTVHSVIDRSPAHLIKEIILVDDFSDMPHLMQQLDDYMSSL-PKVRIVRATQ 119

Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
           REGLIR R  GAK     V+ +LD+HCE    WL PLL  I  ++  +  PVID ID  T
Sbjct: 120 REGLIRARLLGAKYVTAPVLTYLDSHCECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNT 179

Query: 212 WEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
            E+        HYR       G F+W + +  + +P RE  + K+ +EP  SPT AGGLF
Sbjct: 180 LEY--------HYRDSTSVNVGGFDWNLQFNWHPVPARERARHKHTAEPVWSPTMAGGLF 231

Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
           A+D+ FF  LG YD G  +WGGEN ELSFK WMCGG++E VPCS +GH++R   PY +  
Sbjct: 232 AIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-- 289

Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
              R    ++  N  R+ E W D+  K Y+Y R
Sbjct: 290 ---RTGVNVLKKNSVRLAEVWLDDYSK-YYYQR 318


>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
 gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
          Length = 578

 Score =  270 bits (691), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP +      GE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF   + EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           troglodytes]
 gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           paniscus]
 gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
          Length = 578

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RP++K        +PP + +  G         L E      +  +  Y +N+  S+ IS 
Sbjct: 61  RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112

Query: 61  DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
            R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
               WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + 
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R 
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRN 398

Query: 359 PLAMFLDMGDISEQ 372
           P A     GDISE+
Sbjct: 399 PPARKEAYGDISER 412


>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
 gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
 gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
 gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
          Length = 622

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 154/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   E         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LE Y+Q+    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+  +  P I  ID  T++F R V     H RG F+W + +    LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +A+F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K        +I  N  R+ E W D+ +K  FY R   A  +  
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMVQ 455

Query: 365 --DMGDISEQ 372
             + GDISE+
Sbjct: 456 ENNFGDISER 465


>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
 gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
          Length = 578

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP +      GE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF   + EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Nomascus leucogenys]
          Length = 578

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RP++K        +PP + +  G         L E      +  +  Y +N+  S+ IS 
Sbjct: 61  RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112

Query: 61  DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
            R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
               WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + 
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R 
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRN 398

Query: 359 PLAMFLDMGDISEQ 372
           P A     GDISE+
Sbjct: 399 PPARKEAYGDISER 412


>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
          Length = 575

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 214/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP +      GE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 58  RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 107

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 108 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 167

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 168 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 226

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF   + EP     G F+W + ++ 
Sbjct: 227 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 283

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 284 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 343

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 344 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 393

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 394 RNPPARKEAYGDISER 409


>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Otolemur garnettii]
          Length = 582

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 153/352 (43%), Positives = 203/352 (57%), Gaps = 18/352 (5%)

Query: 25  GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
           GE GKA    L E      +  +  Y +N+  S+ IS  R I D RM ECK   +    L
Sbjct: 79  GEWGKASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRL 138

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +LE YI    
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLE 198

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I  D   +  
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVC 257

Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
           PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK   +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +GHV+    PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                      P    N  R  E W DE +K +FY R P A     GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKETYGDISER 416


>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
          Length = 586

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 152/349 (43%), Positives = 202/349 (57%), Gaps = 13/349 (3%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           +P L P     GE GK   LP    A      DA   +   N   S+ IS  R++PD R 
Sbjct: 72  KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131

Query: 70  EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           E CK    +   LP+ SVI+ FHNE +S L+RTVHS++ R+P+  ++E+ILVDDFS    
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L Q+L DY      KV++IR  +REGLIR R  GA  + GEV+ +LD+HCE    WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L  I  D   +  PVID ID  T E+   ++      G F+W + +  + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
             +EP  SPT AGGLF++D+ FF  LG YD G  +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
            +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 370 HVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDEYAK-YYYQR 412


>gi|402865469|ref|XP_003896945.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 [Papio
           anubis]
          Length = 475

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ FHNE F +L RTV S
Sbjct: 129 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 188

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP  +LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA 
Sbjct: 189 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 248

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 249 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 304

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    +  ++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 305 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 364

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
           GGEN ELS +IWMCGG +  +PCSR+GH+          K   R    +I+   +NY R+
Sbjct: 365 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 415

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +  W DE +K  F+ R+P   ++  G+I E+
Sbjct: 416 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 445


>gi|426221067|ref|XP_004004733.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Ovis
           aries]
          Length = 938

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 149/360 (41%), Positives = 218/360 (60%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R   C   
Sbjct: 430 PRDP--KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 487

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 488 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 547

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+++ G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 548 YMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLLERVYLS 606

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK +   ++  
Sbjct: 607 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNKIKETDII 663

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 664 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 723

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 724 RNDNPYSFPK--DRMK--TVERNLGRVAEVWLDE-YKELFYGHGDHLIDQGLDVGNLTQQ 778


>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
          Length = 575

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 104 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 162

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 163 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 217

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 218 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 277

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 278 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 334

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 335 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 389

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 390 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 433


>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
           gorilla gorilla]
          Length = 578

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 153/374 (40%), Positives = 210/374 (56%), Gaps = 24/374 (6%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 60
           RP++K        +PP + +  G         L E      +  +  Y +N+  S+ IS 
Sbjct: 61  RPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISL 112

Query: 61  DRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
            R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIIL
Sbjct: 113 HRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIIL 172

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE
Sbjct: 173 VDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENE 238
               WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + 
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHS 288

Query: 239 LPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
           +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W C
Sbjct: 289 VPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE 358
           GG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R 
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPY---------ARPNFLRNTARAAEVWMDE-YKEHFYNRN 398

Query: 359 PLAMFLDMGDISEQ 372
           P A     GDISE+
Sbjct: 399 PPARKEAYGDISER 412


>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 16 [Homo sapiens]
 gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
           [Homo sapiens]
 gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
           [synthetic construct]
          Length = 542

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Pan paniscus]
          Length = 532

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)

Query: 6   ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           AD  L + +P    + +   +  +  +L       GD     Y  N   S  IS +R +P
Sbjct: 15  ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S+I RTP   + EIILVDDFS+
Sbjct: 75  DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSN 134

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
             D  ++L         KV+ +RN ER+GL+R+R RGA  ++G  + FLD+HCEV  +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  +  D   +  PVID I+  T+ +    E     RG F+W + ++  +L   +  
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
           PCSR+GHV+R   PY F        G   TY  N KR  E W DE +K Y+Y   P A+ 
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359

Query: 364 LDMGDI 369
              G++
Sbjct: 360 RPFGNV 365


>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Pan
           paniscus]
          Length = 529

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 42  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 100

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 101 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 155

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 156 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 215

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 216 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 272

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 273 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 327

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 328 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 371


>gi|403276501|ref|XP_003929936.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Saimiri boliviensis boliviensis]
          Length = 455

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/326 (42%), Positives = 202/326 (61%), Gaps = 11/326 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +YG N+  S  +   R +PD R + C    YP+ LP AS+++ F+NE F++L RTV SI 
Sbjct: 111 KYGFNIIISRSLGIKREVPDTRSKMCLQKRYPVRLPTASIVICFYNEEFNALFRTVSSIW 170

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
             TP   LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA  +
Sbjct: 171 NLTPHHCLEEIILVDDMSKVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 230

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+V+VFLD+HCEV   WL PLL  I  D K++  PVID ID +T +    Y+P    RG
Sbjct: 231 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDDRTLK----YKPSPVVRG 286

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +K + +   E    +  ++P +SP  AGG+FA+ R +F E+G YD  +  WGG
Sbjct: 287 AFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGG 346

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELS +IWMCGG +  +PCSR+GH+ +       GK ++ +    +  NY R++  W 
Sbjct: 347 ENLELSLRIWMCGGQLFIIPCSRVGHISKK----QPGKGSELINA--VARNYLRLVHVWL 400

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE +K  F+ R+P   ++  G+ISE+
Sbjct: 401 DE-YKEQFFLRKPGLKYMTYGNISER 425


>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
          Length = 621

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 152/349 (43%), Positives = 202/349 (57%), Gaps = 13/349 (3%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 69
           +P L P     GE GK   LP    A      DA   +   N   S+ IS  R++PD R 
Sbjct: 72  KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131

Query: 70  EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 128
           E CK    +   LP+ SVI+ FHNE +S L+RTVHS++ R+P+  ++E+ILVDDFS    
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191

Query: 129 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
           L Q+L DY      KV++IR  +REGLIR R  GA  + GEV+ +LD+HCE    WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L  I  D   +  PVID ID  T E+   ++      G F+W + +  + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
             +EP  SPT AGGLF++D+ FF  LG YD G  +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
            +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 370 HVGHIFRKRSPYKW-----RSGVNVLRRNSVRLAEVWLDEYAK-YYYQR 412


>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 657

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 147/338 (43%), Positives = 200/338 (59%), Gaps = 13/338 (3%)

Query: 25  GEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY-WDYPL 79
           GE G+  HLP       +   D    +   N   S+ IS  R +PD R + CK    +  
Sbjct: 119 GEMGRPVHLPANLTGEIKKLVDEGWSKNAFNQYVSDLISVHRKLPDPRDKWCKEPGRFLQ 178

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
           DLP+ SV++ FHNE +S L+RTVHS++ R+P   L+EIILVDDFS    L ++LEDY+  
Sbjct: 179 DLPQTSVVICFHNEAWSVLLRTVHSVLDRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSH 238

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
           +  KV++IR ++REGLIR R  GA  +   V  FLD+HCE  + WL PLL  I  D   +
Sbjct: 239 YP-KVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPLLDRIAKDPTTV 297

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
             PVID ID  T E+ +  +      G F+W + +  + +PERE K+ K  +EP  SPT 
Sbjct: 298 VCPVIDVIDDTTLEY-NFRDSGGVNVGGFDWNLQFNWHAVPEREKKRHKNTAEPVWSPTM 356

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFA+D+ FF  +G YD G  +WGGEN ELSFK WMCGG++E VPCS +GH++R   P
Sbjct: 357 AGGLFAIDKNFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRRRSP 416

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           Y +     R    ++  N  R+ E W D+  K Y+Y R
Sbjct: 417 YKW-----RSGVNVLKRNSVRLAEVWLDDYAK-YYYQR 448


>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
          Length = 558

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADMAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Papio
           anubis]
          Length = 558

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Nomascus leucogenys]
          Length = 558

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
          Length = 558

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Homo sapiens]
 gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Homo sapiens]
 gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
 gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1;
           AltName: Full=Polypeptide GalNAc transferase-like
           protein 1; Short=GalNAc-T-like protein 1;
           Short=pp-GaNTase-like protein 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase-like
           protein 1; AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
 gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
 gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
 gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
           [Homo sapiens]
 gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
           [Homo sapiens]
 gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
 gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Pan
           troglodytes]
          Length = 558

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|15207811|dbj|BAB62930.1| hypothetical protein [Macaca fascicularis]
          Length = 373

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ FHNE F +L RTV S
Sbjct: 27  LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 86

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP  +LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA 
Sbjct: 87  VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 146

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 147 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 202

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    +  ++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 203 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 262

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
           GGEN ELS +IWMCGG +  +PCSR+GH+          K   R    +I+   +NY R+
Sbjct: 263 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 313

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +  W DE +K  F+ R+P   ++  G+I E+
Sbjct: 314 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 343


>gi|354482531|ref|XP_003503451.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Cricetulus griseus]
          Length = 929

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 150/373 (40%), Positives = 220/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V + D  L   +P        PG+ G+   +P   +   +    E   N+  S+ I  DR
Sbjct: 412 VLRIDESLSPRDP------NAPGQFGRPVVVPPGKKEEAERRWKEGNFNVYLSDLIPVDR 465

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R  EC       DLP  S+I+ F +E +S+L+R+VHSI+ R+P   ++EI+LVDD
Sbjct: 466 AIEDTRPAECAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSILNRSPPHLIKEILLVDD 525

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 526 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 584

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y +RK +  PVI+ I+ +   + +V   D+  RG+F W M +    +P +
Sbjct: 585 GWLEPLLERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPD 641

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK     ++  + P    GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 642 VVAKSGIKETDIIRCPVMGCGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGE 701

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 702 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 756

Query: 360 LAMFLDMGDISEQ 372
           +   LD+G++++Q
Sbjct: 757 IDQGLDVGNLTQQ 769


>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
          Length = 644

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 133/320 (41%), Positives = 198/320 (61%), Gaps = 10/320 (3%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           ++ +  +  N+  S  +  DR IPD R  +C   +    L   +VI+ FHNE +S+L+RT
Sbjct: 165 ESGMQRHSFNVRASELLPLDRPIPDYRPTQCPSINQST-LSPTTVIICFHNEAWSTLLRT 223

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
           +HS+I R+P+  + EIILVDD S+   L + LE+++ +    V L+R   REGLIR R  
Sbjct: 224 LHSVINRSPSHLIMEIILVDDASTFDYLGEPLENHLSQLEN-VYLLRTKIREGLIRARLL 282

Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
           G   ++G+V+VFLD+HCE    WLPPLL  I +DR  +  P++D I++QT+E+R+  E  
Sbjct: 283 GVSYAKGDVLVFLDSHCECAEGWLPPLLLAIEADRTKIVCPLVDVIEFQTFEYRAAKEEL 342

Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
           H   G F+W + +   +LPE E K+R   ++  ++PT  GGLFA+DR +F  +G YD G+
Sbjct: 343 H---GAFDWNLQFIWKDLPEHEMKRRTSPADNIRAPTIIGGLFAVDRLYFKRIGSYDSGM 399

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
            +WG EN ELSF++WMCGGS+E  PCSR+GHV+R+ +PY F     R     I  N  R 
Sbjct: 400 DIWGSENLELSFRVWMCGGSLEISPCSRVGHVFRTRIPYGFPNGGKRT----IRNNAMRA 455

Query: 342 IETWFDEKHKAYFYTREPLA 361
            E W D+ +K +FY  + + 
Sbjct: 456 AEVWLDD-YKKFFYASQNIT 474


>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Loxodonta africana]
          Length = 555

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 201/343 (58%), Gaps = 22/343 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y LDLP  SVI+ 
Sbjct: 71  KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   ++ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DQREGLIRSRVRGADVAVAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKISRTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 398


>gi|15207947|dbj|BAB62998.1| hypothetical protein [Macaca fascicularis]
          Length = 443

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 202/331 (61%), Gaps = 17/331 (5%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ FHNE F +L RTV S
Sbjct: 97  LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP  +LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA 
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 272

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    +  ++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 273 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
           GGEN ELS +IWMCGG +  +PCSR+GH+          K   R    +I+   +NY R+
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 383

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +  W DE +K  F+ R+P   ++  G+I E+
Sbjct: 384 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 413


>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Saimiri boliviensis boliviensis]
          Length = 558

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y LDLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 400


>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
          Length = 578

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 157/376 (41%), Positives = 213/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP +      GE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
 gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
 gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
          Length = 578

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 66  KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125

Query: 74  YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +
Sbjct: 126 AKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQ 185

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 186 LEAYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERI 244

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +R    
Sbjct: 245 SRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSRI 301

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W D+ +K +FY R P A     GDISE
Sbjct: 362 HVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKETYGDISE 411

Query: 372 Q 372
           +
Sbjct: 412 R 412


>gi|26325284|dbj|BAC26396.1| unnamed protein product [Mus musculus]
          Length = 930

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
            PG+ G+   +P   +   +    E   N+  S+ I  DR I D R   C       DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F  
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y +RK +  P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
           VI+ I+ +   + +V   D+  RG+F W M +    +P +  AK     ++  + P  AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIELIPCSRVGHIFRNDNPYS 722

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
           F K  DR+K   +  N  RV E W D+  + ++ +    +   LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770


>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
           lupus familiaris]
          Length = 552

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS  R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSSRAVPDTRHLRCTMLVYCADLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+  D      D +Q     KV+ IRN+ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMNLIQEIILVDDFSNDPD------DCLQLIKLPKVKCIRNSERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I    + +    
Sbjct: 183 RIRGANVAKGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD
Sbjct: 240 ESAAELRGGFDWSLHFQWEQLSPEQKARRLDPAEPIRTPIIAGGLFVMDKSWFNYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIESR 388


>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
           catus]
          Length = 582

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 151/352 (42%), Positives = 204/352 (57%), Gaps = 18/352 (5%)

Query: 25  GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 81
           GE GKA    L +      +  +  Y +N+  S+ IS  R I D RM ECK   +    L
Sbjct: 79  GEWGKASKLQLSQDELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRRL 138

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +LE YI   +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLD 198

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I  D   +  
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGKDETAIVC 257

Query: 202 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
           PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +RK   +P +SPT A
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRKSRIDPIRSPTMA 314

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +GHV+    PY
Sbjct: 315 GGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPKRAPY 374

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                      P    N  R  E W D+ +K +FY R P A     GDISE+
Sbjct: 375 ---------ARPNFLQNTARAAEVWMDQ-YKEHFYNRNPPARKEAYGDISER 416


>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Gorilla gorilla gorilla]
          Length = 532

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)

Query: 6   ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           AD  L + +P    + +   +  +  +L       GD     Y  N   S  IS +R +P
Sbjct: 15  ADSGLSSSQPSDADWDDVWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ RTP   + EIILVDDFS+
Sbjct: 75  DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
             D  ++L         KV+ +RN ER+GL+R+R RGA  ++G  + FLD+HCEV  +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  +  D   +  PVID I+  T+ +    E     RG F+W + ++  +L   +  
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
           PCSR+GHV+R   PY F        G   TY  N KR  E W DE +K Y+Y   P A+ 
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359

Query: 364 LDMGDI 369
              G++
Sbjct: 360 RPFGNV 365


>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
           catus]
          Length = 552

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 141/334 (42%), Positives = 194/334 (58%), Gaps = 17/334 (5%)

Query: 41  GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
           GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+R
Sbjct: 70  GDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCADLPPTSIIITFHNEARSTLLR 129

Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
           T+ S++ RTP   ++EIILVDDFS+  D   +L         KV+ IRNTER+GL+R+R 
Sbjct: 130 TIRSVLNRTPMNLIQEIILVDDFSNDPDDCSQLIKL-----PKVKCIRNTERQGLVRSRI 184

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
           RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I    + +    E 
Sbjct: 185 RGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---IES 241

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD  
Sbjct: 242 AAELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFEYLGKYDTD 301

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
           + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N 
Sbjct: 302 MDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKNT 355

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 356 KRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|195471079|ref|XP_002087833.1| GE18238 [Drosophila yakuba]
 gi|194173934|gb|EDW87545.1| GE18238 [Drosophila yakuba]
          Length = 659

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 205/328 (62%), Gaps = 16/328 (4%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +R++PD+R+E CK   Y   LP  SV+ +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISLNRSVPDIRLEACKTRKYLAKLPNISVVFIFFNEHFNTLLRSIYSVINR 219

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L +I+LVDD S    L Q L+DY+ Q F   V ++ + ER+GLI  R  GAK + 
Sbjct: 220 TPPELLRQIVLVDDGSEWDSLKQPLDDYVAQHFPHLVTVVHSPERQGLIGARLAGAKVAV 279

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           GEV+VF D+H EV  NWLPPL+ PI  + KI T P++D I ++ + + S  +     RG 
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKIATCPMVDTIAHEDFSYFSGNKDG--ARGG 337

Query: 228 FEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
           F+W MLYK+   LPE    K    S PY+SP   GGLFA++  FF +LGGYD  L +WGG
Sbjct: 338 FDWKMLYKQLPVLPEDALDK----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGG 393

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG-PLITYNYKRVIETW 345
           E +ELSFKIWMCGG +  VPCSR+GH++R  M     K     +G   +  N+KRV E W
Sbjct: 394 EQYELSFKIWMCGGLLLDVPCSRVGHIFRGPM-----KPRGNPRGHNFVAKNHKRVAEVW 448

Query: 346 FDEKHKAYFYTREPLAM-FLDMGDISEQ 372
            DE +K Y Y R+P     +D GD++ Q
Sbjct: 449 MDE-YKEYVYKRDPATYDNVDAGDLTRQ 475


>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Cricetulus griseus]
          Length = 513

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 203/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y LDLP  SVI+ 
Sbjct: 26  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLDLPATSVIIT 85

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 86  FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 140

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 141 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSPIIDVISL 200

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 201 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 257

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 258 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 311

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 312 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 355


>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
 gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
 gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
 gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
          Length = 647

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    +  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476


>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
           sapiens]
 gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_d [Homo sapiens]
 gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
          Length = 532

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)

Query: 6   ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           AD  L + +P    + +   +  +  +L       GD     Y  N   S  IS +R IP
Sbjct: 15  ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ RTP   + EIILVDDFS+
Sbjct: 75  DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
             D  ++L         KV+ +RN ER+GL+R+R RGA  ++G  + FLD+HCEV  +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  +  D   +  PVID I+  T+ +    E     RG F+W + ++  +L   +  
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
           PCSR+GHV+R   PY F        G   TY  N KR  E W DE +K Y+Y   P A+ 
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359

Query: 364 LDMGDI 369
              G++
Sbjct: 360 RPFGNV 365


>gi|109732606|gb|AAI16333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 [Mus musculus]
          Length = 930

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
            PG+ G+   +P   +   +    E   N+  S+ I  DR I D R   C       DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F  
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y +RK +  P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
           VI+ I+ +   + +V   D+  RG+F W M +    +P +  AK     ++  + P  AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
           F K  DR+K   +  N  RV E W D+  + ++ +    +   LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770


>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Gorilla gorilla gorilla]
          Length = 598

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 200/344 (58%), Gaps = 18/344 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 111 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 169

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 170 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 224

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WLPP+L  +  D   +  P+ID I 
Sbjct: 225 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 284

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 285 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 341

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 342 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 396

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++
Sbjct: 397 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 438


>gi|158749624|ref|NP_766443.2| polypeptide N-acetylgalactosaminyltransferase 5 [Mus musculus]
 gi|341940730|sp|Q8C102.2|GALT5_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           AltName: Full=Polypeptide GalNAc transferase 5;
           Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 5;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|148694985|gb|EDL26932.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 5 [Mus musculus]
          Length = 930

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 215/352 (61%), Gaps = 10/352 (2%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
            PG+ G+   +P   +   +    E   N+  S+ I  DR I D R   C       DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 142
             S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ Y+ +F  
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
           KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y +RK +  P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605

Query: 203 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAG 261
           VI+ I+ +   + +V   D+  RG+F W M +    +P +  AK     ++  + P  AG
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNGIKETDIIRCPVMAG 662

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++R+  PY+
Sbjct: 663 GLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYS 722

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYF-YTREPLAMFLDMGDISEQ 372
           F K  DR+K   +  N  RV E W D+  + ++ +    +   LD+G++++Q
Sbjct: 723 FPK--DRMKT--VERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQ 770


>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Nomascus leucogenys]
          Length = 532

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)

Query: 6   ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           AD  L + +P    + +   +  +  +L       GD     Y  N   S  IS +R +P
Sbjct: 15  ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ RTP   + EIILVDDFS+
Sbjct: 75  DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
             D  ++L         KV+ +RN ER+GL+R+R RGA  ++G  + FLD+HCEV  +WL
Sbjct: 135 DPDDCKQLVKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  +  D   +  PVID I+  T+ +    E     RG F+W + ++  +L   +  
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
           PCSR+GHV+R   PY F        G   TY  N KR  E W DE +K Y+Y   P A+ 
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359

Query: 364 LDMGDI 369
              G++
Sbjct: 360 RPFGNV 365


>gi|324506451|gb|ADY42754.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Ascaris suum]
          Length = 618

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/363 (41%), Positives = 215/363 (59%), Gaps = 33/363 (9%)

Query: 23  GPGEGGKAYHLP----------EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           GPGEGGK   +P          E YR  G  +          S+ I  +R++ D+R ++C
Sbjct: 96  GPGEGGKPVAIPTDPEIKKKQEELYRVNGYDAF--------VSDLIPLNRSVKDIRHKDC 147

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           +   Y   LP  SVI  FH+E  S+L+R+ +S+I RTP + L+EIILVDD S+K  L + 
Sbjct: 148 QNLRYLEALPSVSVIFPFHDEHNSTLLRSAYSVIARTPKEILKEIILVDDASTKPFLKKP 207

Query: 133 LEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           L++Y++  + +  V+++R  +REGLIR R  GA+ +  +++VFLDAH E   NWLPPL+ 
Sbjct: 208 LDEYLKSAKLDHIVKVVRTKKREGLIRARQIGAQHATADIMVFLDAHSEPNYNWLPPLIE 267

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           PI  D + +  P +D ID  T+E+R+    D   RG F+W   YK   L E + K   + 
Sbjct: 268 PITLDYRTVVCPFVDVIDCDTFEYRA---QDEGARGSFDWEFNYKRLPLTEDDLK---HP 321

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
           + P+KSP  AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++   PCSR+
Sbjct: 322 TRPFKSPVMAGGYFAISRKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGNMVDAPCSRV 381

Query: 311 GHVYRS-FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           GH+YR   +P+    + D      I+ NYKRV E W D+ +K Y Y R       D GD+
Sbjct: 382 GHIYRCKHVPFPNPGVGD-----FISRNYKRVAEVWMDD-YKKYLYQRRHGMENADEGDL 435

Query: 370 SEQ 372
           ++Q
Sbjct: 436 TKQ 438


>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
 gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
          Length = 721

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    +  + A D    +   N   S+ IS  RT+PD R
Sbjct: 135 IDPPAN-FEENPGELGKPVRLPKEMSDDMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 193

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 194 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 253

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 254 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 312

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 313 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 364

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 365 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 424

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 425 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 475


>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
 gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
 gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9
 gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
 gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
          Length = 650

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 206/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    +  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE +  Y+Y R
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKKNSVRLAEVWMDE-YSQYYYHR 476


>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Sus scrofa]
          Length = 552

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 196/336 (58%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  ++ +R +PD R+  C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERVASNRVVPDTRLFRCTLLVYCADLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RTV SI+ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 129 RTVRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I   T+++    
Sbjct: 183 RIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD
Sbjct: 240 ESATELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFDYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388


>gi|195385643|ref|XP_002051514.1| GJ11806 [Drosophila virilis]
 gi|194147971|gb|EDW63669.1| GJ11806 [Drosophila virilis]
          Length = 653

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/365 (42%), Positives = 212/365 (58%), Gaps = 33/365 (9%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEY-----GMNMETSNHISFDRTIPDLRMEECKYW 75
           + G GE G    LP       + +L E      G N   S+ IS +R++PD+R E+CK  
Sbjct: 128 RSGLGEHG----LPATIEDPAEKTLEEQEYRRNGFNGYLSDRISVNRSLPDVRHEKCKTR 183

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
            Y   LP  SV+++F+NE F +L+RTV+SI+ RTP + L +I+LVDD S    L  +L+ 
Sbjct: 184 KYLAKLPNVSVVIIFYNEHFQTLLRTVYSIVNRTPKELLHQIVLVDDGSEWETLKDQLDQ 243

Query: 136 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           Y+  ++   V ++ N ER GLI  R  GA+ + GEV+VF D+H EV  NWLPPLL PI  
Sbjct: 244 YVALQWPHLVDVVHNPERRGLIGARLAGARVATGEVMVFFDSHIEVNYNWLPPLLEPIVI 303

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEP 253
           + KI T P++D ID+  + +   Y+     RG F+W   YK+   LPE    K    S P
Sbjct: 304 NNKISTCPIVDIIDHNNFAYNGGYQ--EGTRGGFDWRFFYKQLPVLPEDSVDK----SLP 357

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
           Y+SP   GGLFA++  FF +LGGYD  L +WGGE +ELSFKIWMCGG +  VPCSR+ H+
Sbjct: 358 YRSPVMMGGLFAINSEFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHI 417

Query: 314 YRSFM-----PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMG 367
           +R  M     P N+           +  N+KRV E W DE +K + Y R+P     +D G
Sbjct: 418 FRGQMDPRPNPRNYN---------FVARNHKRVAEVWMDE-YKEHVYRRDPATYDNIDAG 467

Query: 368 DISEQ 372
           D+S Q
Sbjct: 468 DLSRQ 472


>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
          Length = 582

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 218/357 (61%), Gaps = 14/357 (3%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           E  +EGPGE GK   L +      +  L  E G +   S+ I+ +R++PD R  +C+   
Sbjct: 56  ESKREGPGEHGKPLKLEKLEDIKLNEKLFKENGYSAVVSDMIALNRSVPDARHVQCRKKR 115

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP  SVI++F+NE +S+L+RTVHSI+ R+P++ L+EI+LV+D S+K  L + L+DY
Sbjct: 116 YLQELPTVSVIVIFYNEHWSTLLRTVHSILNRSPSKLLKEIVLVNDHSTKEFLWEPLQDY 175

Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++ +   KV+L     R GLI  R  GAK + G+V++ LD+H EV +NWLPPL+ PI  +
Sbjct: 176 VRSKLPSKVKLFNLPVRSGLIAARLAGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEN 235

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            +    P IDGI + T+E++   E     RG F+W  LYK   LP R  + +   +EP+ 
Sbjct: 236 YRTCVCPYIDGIAHDTFEYKPQSE---GRRGAFDWKFLYKR--LPLR-PQDQTDPTEPFD 289

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGGLFA+   FF ELGGYD  L +WGGE +ELSFKIW CGG +   PCS +GHVYR
Sbjct: 290 SPIMAGGLFAISAKFFWELGGYDEELDIWGGEQYELSFKIWQCGGRMVDAPCSHVGHVYR 349

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              P+   +  +      +T N+KRV E W DE +K + + R P     D GD+++Q
Sbjct: 350 GLAPFPNPRGTN-----FVTRNFKRVAEVWMDE-YKQFLFERNPEYDKTDAGDLTKQ 400


>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           3 [Pan paniscus]
          Length = 517

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 34  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S+I RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 94  RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350


>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Cavia porcellus]
          Length = 552

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 197/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHPRCTLLGYHTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNGERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA+ ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 MRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNIESR 388


>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Callithrix jacchus]
          Length = 578

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 157/372 (42%), Positives = 209/372 (56%), Gaps = 25/372 (6%)

Query: 12  NLEPPLEPYKEGP-------GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           N E   +P  E P       GE GKA    L E      +  +  Y +N+  S+ IS  R
Sbjct: 55  NTEDLSQPLYEKPPADSHALGEWGKASKLRLNEGELKQQEELIERYAINIYLSDRISLHR 114

Query: 63  TIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 121
            I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVD
Sbjct: 115 HIEDKRMYECKSKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVD 174

Query: 122 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 181
           D S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE  
Sbjct: 175 DLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECN 233

Query: 182 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELP 240
             WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P
Sbjct: 234 SGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVP 290

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           + E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG
Sbjct: 291 KHERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
            +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R P 
Sbjct: 351 KLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPP 400

Query: 361 AMFLDMGDISEQ 372
           A     GDISE+
Sbjct: 401 ARKEAYGDISER 412


>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Callithrix jacchus]
          Length = 552

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  Q+L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Tupaia chinensis]
          Length = 566

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y +DLP  SVI+
Sbjct: 79  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSVDLPATSVII 137

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 138 TFHNEARSTLLRTVRSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 192

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 193 NDRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 252

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLDQKMTRTDPTRPIRTPVIAGGIFVIDK 309

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 310 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 364

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 365 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 408


>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Pan paniscus]
          Length = 552

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S+I RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
          Length = 638

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 142/366 (38%), Positives = 206/366 (56%), Gaps = 23/366 (6%)

Query: 5   KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHISFDR 62
           + + K G+ E  L     G G G           A G+  LG   +G N   S  I   R
Sbjct: 123 RPEAKEGDPESQLLSLPLGDGNGA----------ATGERPLGLETHGFNEALSERIPLRR 172

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            +P++R   C   +Y   LP ASVI+ FH+E +S+L+RTVHSI+   P   L++IILVDD
Sbjct: 173 ELPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSILNTAPKASLKDIILVDD 232

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L   L +YI + +G V+LIR+  R G+IR R  GA  + G+V+VF+D+HCE   
Sbjct: 233 LSQQGPLKSALSEYISKLDG-VKLIRSNRRLGVIRGRMLGAARATGDVLVFMDSHCECQK 291

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLLA + S+R  +  P+ID ID++T+++   Y     +RG+F+W + +    +PE 
Sbjct: 292 GWLEPLLARLSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEH 348

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
           E K R+  + P +SP  AG + AMDR +F  +G YD  + +WG EN ELS + W+CGGS+
Sbjct: 349 EEKVRQSPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSV 408

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
           E +PCSR+GHVYR  +P+ F           I  N  R+ ETW D   K  FY  + +A 
Sbjct: 409 EIIPCSRVGHVYRHHIPHAFS------YEEAIVRNKIRIAETWLD-SFKENFYKNDTVAF 461

Query: 363 FLDMGD 368
            +   +
Sbjct: 462 LISKAE 467


>gi|269115411|gb|ACZ26277.1| N-acetyl galactosaminyl transferase-like protein [Mayetiola
           destructor]
          Length = 638

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 157/379 (41%), Positives = 219/379 (57%), Gaps = 37/379 (9%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  + G GE G+   + +   A         G N   S++IS +R++ D+R ++C    Y
Sbjct: 88  EKQRTGIGEHGEPAFVADNEEAERKRLFDLNGFNALLSDYISINRSVKDIRHKDCAKIKY 147

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SV++ F NE FS+L+RTV+S++ R+PA+ + EIILVDD S++ ++ + L++YI
Sbjct: 148 LSELPSVSVVVPFFNEHFSTLLRTVYSVLNRSPAELIMEIILVDDASNRDNVKKPLDNYI 207

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA------- 190
            +   KV+LIR  ER GLI  R  GA+ ++G+V++FLD+H E   NWLPPLL        
Sbjct: 208 AKHLPKVKLIRLPERSGLILARLAGARAAKGDVLIFLDSHTEPNTNWLPPLLGKNEQNEI 267

Query: 191 --------------PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 236
                         PI  + K+   P ID I Y T+E+R+    D   RG F+W   YK 
Sbjct: 268 ILFSENKNKKTQTEPIAENYKVCMCPFIDVISYDTFEYRA---QDEGARGAFDWQFYYKR 324

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
             L E +    K+ + P+KSP  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW
Sbjct: 325 LPLLEDDL---KHPTRPFKSPVMAGGLFAISAKFFWELGGYDDGLDIWGGEQYELSFKIW 381

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV--KGPLITYNYKRVIETWFDEKHKAYF 354
            CGG +   PCSR+GH+YR       G +A     KG  +  NYKRV E W DE +K Y 
Sbjct: 382 QCGGEMYDAPCSRVGHIYRG------GGIAQPTGRKGDFLHKNYKRVAEVWMDE-YKEYL 434

Query: 355 YTREPLAM-FLDMGDISEQ 372
           Y REP     +D GD+++Q
Sbjct: 435 YKREPERYEAIDAGDLTKQ 453


>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Nomascus leucogenys]
          Length = 535

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 52  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 111

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 112 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 166

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 167 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 223

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 224 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 283

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 284 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 337

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G++
Sbjct: 338 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 368


>gi|195115611|ref|XP_002002350.1| GI13183 [Drosophila mojavensis]
 gi|193912925|gb|EDW11792.1| GI13183 [Drosophila mojavensis]
          Length = 655

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 209/358 (58%), Gaps = 24/358 (6%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+   +  + +          G N   S+ IS +R++PD+R E CK   Y   LP
Sbjct: 130 GLGEHGQPASVDPSEKELEQQEYRRNGFNGYLSDRISVNRSVPDVRKEACKTRKYLAKLP 189

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
             SVI +F+NE F +L+R+++SI+ RTP + L++I+LVDD S    L + L+DY+  ++ 
Sbjct: 190 NVSVIFIFYNEHFQTLLRSIYSIVNRTPPELLKQIVLVDDGSEWDTLKKHLDDYVALQWP 249

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             V ++ N ER GLI  R  GAK + GEV+VF D+H EV  NWLPPLL PI  + KI T 
Sbjct: 250 KLVDVVHNPERRGLIGARLAGAKVATGEVMVFFDSHIEVNYNWLPPLLEPIVINNKIATC 309

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPTHA 260
           P++D ID+  + +   Y+     RG F+W   YK+   LPE    K    S PY+SP   
Sbjct: 310 PIVDIIDHNNFAYNGGYQ--EGSRGGFDWRFFYKQLPVLPEDSVDK----SLPYRSPVMM 363

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM-- 318
           GGLFA++  +F +LGGYD  L +WGGE +ELSFKIWMCGG +  VPCSR+ H++R  M  
Sbjct: 364 GGLFAINSKWFWDLGGYDDELEIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGQMDP 423

Query: 319 ---PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM-FLDMGDISEQ 372
              P N+           +  N+KRV E W DE +K + Y R+P     +D GD++ Q
Sbjct: 424 RPNPRNYN---------FVARNHKRVAEVWMDE-YKEFVYKRDPATYNNIDAGDLTRQ 471


>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
 gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
 gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
 gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
 gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
          Length = 578

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 149/362 (41%), Positives = 204/362 (56%), Gaps = 16/362 (4%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           ++PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM EC
Sbjct: 65  IKPPADSHALGEWGRASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYEC 124

Query: 73  KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           K   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  
Sbjct: 125 KAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKA 184

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LE YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  
Sbjct: 185 QLETYISNLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243

Query: 192 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
           I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +R   
Sbjct: 244 ISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSR 300

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
            +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +
Sbjct: 301 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 360

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+    PY           P    N  R  E W DE +K +FY R P A     GD+S
Sbjct: 361 GHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPARKEAYGDLS 410

Query: 371 EQ 372
           E+
Sbjct: 411 ER 412


>gi|345781283|ref|XP_853759.2| PREDICTED: LOW QUALITY PROTEIN:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Canis lupus
           familiaris]
          Length = 559

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 207/351 (58%), Gaps = 14/351 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           E  G+ GK ++        G   L +YG N   S  +  D  +PD R + C    YP  L
Sbjct: 91  ETAGKLGKDFNYSNPEFIDG---LLKYGFNTILSKSLGSDSKVPDTRNKMCLQKRYPAKL 147

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P ASVI+ FHNE F++L RT+ S+   TP   LEEIILVDD S   DL +KL+ +++ F 
Sbjct: 148 PTASVIICFHNEEFNALFRTLSSVGNLTPHYILEEIILVDDMSDFDDLKEKLDHHLEIFR 207

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
           GK+++IRN +REGL+R+R  GA  + G+V+VFLD+HCEV   WL PLL  I  D K++  
Sbjct: 208 GKIKVIRNKKREGLVRSRLIGASRASGDVLVFLDSHCEVNTAWLQPLLHAIAKDSKMVVC 267

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P+ID ID  T E    Y+     RG F W + +K + +   E    +  + P +SP  AG
Sbjct: 268 PLIDVIDSMTLE----YQSSPVVRGAFNWHLDFKWDSVYSYEMDGPEGPTRPIRSPAMAG 323

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           G+FA++R +F E+G YD G+ +WG EN ELS +IWMCGG +  +PCSR+GH+ +      
Sbjct: 324 GIFAINRHYFNEIGQYDKGMDLWGAENLELSLRIWMCGGQLFIIPCSRVGHISKQ----R 379

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           F    + VK   +TYN  R++  W DE +K  F+ ++P    +  G+ISE+
Sbjct: 380 FSNQPELVKA--MTYNNLRLVHVWLDE-YKEQFFLQQPGLKSVAYGNISER 427


>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Otolemur garnettii]
          Length = 558

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 145/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y LDLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   ++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVATAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
          Length = 517

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R IPD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 34  VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 94  RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350


>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           2 [Macaca mulatta]
          Length = 532

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 146/366 (39%), Positives = 206/366 (56%), Gaps = 17/366 (4%)

Query: 6   ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           AD  L + +P    + +   +  +  +L       GD     Y  N   S  IS +R IP
Sbjct: 15  ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
           D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ RTP   + EIILVDDFS+
Sbjct: 75  DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSN 134

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
             D  ++L         KV+ +RN ER+GL+R+R RGA  ++G  + FLD+HCEV  +WL
Sbjct: 135 DPDDCKQLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            PLL  +  D   +  PVID I+  T+ +    E     RG F+W + ++  +L   +  
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGENFE+SF++WMCGGS+E V
Sbjct: 247 RRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIV 306

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMF 363
           PCSR+GHV+R   PY F        G   TY  N KR  E W DE +K Y+Y   P A+ 
Sbjct: 307 PCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVWMDE-YKQYYYAARPFALE 359

Query: 364 LDMGDI 369
              G++
Sbjct: 360 RPFGNV 365


>gi|432096894|gb|ELK27469.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Myotis davidii]
          Length = 940

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 148/359 (41%), Positives = 216/359 (60%), Gaps = 12/359 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G+   +P+      +    E   N+  S+ I  DR I D R   C   
Sbjct: 432 PRDP--KAPGQFGRPVLVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCAKQ 489

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L   L+ 
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 549

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNIGWLEPLLERVYLS 608

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK R   ++  
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDVI 665

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 666 RCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 725

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE K   Y +    +   LD+G++++Q
Sbjct: 726 RNDNPYSFPK--DRMKT--VERNLVRVAEVWLDEYKELFYGHGNHLIDQGLDVGNLTQQ 780


>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
           grunniens mutus]
          Length = 509

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 194/333 (58%), Gaps = 19/333 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  I+ +R +PD R+  C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 26  VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 85

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ SI+ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 86  RTIRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 139

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I   T+ +    
Sbjct: 140 RIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---I 196

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD
Sbjct: 197 ESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYD 256

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  
Sbjct: 257 TDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFP------DGNANTYIK 310

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           N KR  E W DE +K Y+Y   P A+    G+I
Sbjct: 311 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNI 342


>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Loxodonta africana]
          Length = 552

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 142/332 (42%), Positives = 196/332 (59%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCNLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFSS  D D KL   +     KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSSDPD-DCKLLIKL----PKVKCVRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            +GA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDS 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 EMDIWGGENFEMSFRVWMCGGSLEIIPCSRVGHVFRKKHPYIFP------DGNTNTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G+I
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNI 385


>gi|68342011|ref|NP_001020319.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Rattus
           norvegicus]
 gi|50926898|gb|AAH78995.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Rattus
           norvegicus]
          Length = 443

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 137/328 (41%), Positives = 203/328 (61%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N+ TS  +  +R +PD R + C+   YP +LP ASVI+ F+NE F++L+RTV S
Sbjct: 83  LSRYGLNVITSRRLGIERQVPDSRNKICQQKHYPFNLPTASVIICFYNEEFNTLLRTVSS 142

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ +++ F GK++L+RN +REGLIR+R  GA 
Sbjct: 143 VMNLSPKHLLEEIILVDDMSEFDDLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGAS 202

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  PVID ID  T ++  V  P    
Sbjct: 203 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPVIDVIDELTLDY--VGSP--IV 258

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + ++ +++   E    +  S P +SP  +GG+FA++R +F ELG YD  + +W
Sbjct: 259 RGAFDWNLNFRWDDVFSYELDGPEGPSTPIRSPAMSGGIFAINRHYFNELGQYDKDMDLW 318

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+GH  ++            V    ++ N  RV+  
Sbjct: 319 GGENVELSLRIWMCGGQLFILPCSRVGHNNKALSKNRL------VNQSALSKNLLRVVHV 372

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P    +  G+IS++
Sbjct: 373 WLDE-YKENFFLQRPSLTHVSCGNISDR 399


>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
           sapiens]
 gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
           AltName: Full=Polypeptide GalNAc transferase 14;
           Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 14;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 14
 gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
           sapiens]
 gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
 gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Homo sapiens]
 gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
 gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
           isoform CRA_a [Homo sapiens]
 gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
           [synthetic construct]
 gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
           sapiens]
          Length = 552

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R IPD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
           floridanus]
          Length = 597

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 149/373 (39%), Positives = 218/373 (58%), Gaps = 16/373 (4%)

Query: 5   KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 64
           K + K+  LE  + P   G GE GK  +L    +  G+A+L +  +N+  SN IS  R +
Sbjct: 69  KYEDKILKLEYNVVP---GLGENGKPAYLYGKDKFQGEAALKKKALNVILSNKISLTRKL 125

Query: 65  PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           PD+R   C    Y   LP ASV+++F+NE +S L+RTVHS++K +P   L+EIILVDD S
Sbjct: 126 PDIRNSLCMNITYDKLLPSASVVIIFYNEPWSVLLRTVHSVLKGSPPHLLKEIILVDDHS 185

Query: 125 SKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 183
            + +L  +L+ Y+  R   KV+L+R + R+GLIR R  GA+ ++G+V+VFLDAHCEV  +
Sbjct: 186 EEEELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKD 245

Query: 184 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 243
           WL PLL  I  ++  + +P+ID I  +T E+    E      G F W   +    + + E
Sbjct: 246 WLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHE 305

Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
            + R     P +SPT AGGLFA++R +F E+G YD  +  WGGEN E+SF+IW CGG++E
Sbjct: 306 VESRPSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSFRIWQCGGTLE 365

Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
            +PCSR+GH++R+F PY F    D         N  R+   W D   + +   R   + F
Sbjct: 366 IIPCSRVGHIFRNFHPYKFPNDKDTH-----GINTARLAFVWMDGYKRLFLLHR---SEF 417

Query: 364 LD----MGDISEQ 372
            D     GD+SE+
Sbjct: 418 KDNPKLFGDVSER 430


>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
          Length = 518

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 143/353 (40%), Positives = 212/353 (60%), Gaps = 14/353 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           GE GK  ++ ++  +  +    E G      N   S+ +S  RT+PD+R +EC+  +Y  
Sbjct: 2   GENGKGLNIDKSKLSPEELKKYEKGYQRNAFNQYASDQMSLHRTLPDVRDKECRDRNYAT 61

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
           +LP  S+I++FHNE +S L+RTV S + R+P   ++EIILVDDFS    L   L+++   
Sbjct: 62  ELPDTSIIVIFHNEAWSVLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQAPLQEFADS 121

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
              KVRL+R  +REGLIR R  GA  ++G V+ FLD+HCE  + WL PLL  I  ++  +
Sbjct: 122 -QEKVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRISQNKSNV 180

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
             PVID I+  T +++          G F+W + +  + +P+ E K+RK + +P +SPT 
Sbjct: 181 VTPVIDVINDDTIQYQYSSAKSTSVGG-FDWNLQFNWHGIPDHEKKRRKSDVDPVRSPTM 239

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++ R +F  LG YDPG+ +WGGEN ELSF+IWMCGGS++  PCS +GH++R   P
Sbjct: 240 AGGLFSISREYFEYLGTYDPGMDIWGGENLELSFRIWMCGGSLDIAPCSHVGHIFRKRSP 299

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y++    + VK      N  R+ E W DE  K Y+Y R    +  D GD+S +
Sbjct: 300 YSWKTGVNVVKK-----NSIRLAEVWLDEFSK-YYYERFNYDLG-DYGDVSAR 345


>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
          Length = 667

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 180 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 239

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 240 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 294

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 295 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 354

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 355 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 411

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 412 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 465

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 466 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 509


>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Nomascus leucogenys]
          Length = 552

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
          Length = 655

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 201/343 (58%), Gaps = 22/343 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 168 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 227

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 228 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 282

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 283 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 342

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 343 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 399

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 400 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 453

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++
Sbjct: 454 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVA 495


>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Myotis davidii]
          Length = 518

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ ++ DR I D R   C    Y  DLP  SVI+ 
Sbjct: 28  KAYLAAKQLKPGEDPYRQHAFNQLESDKLTSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 87

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 88  FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 142

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL PLL  +  D   +  P+ID I  
Sbjct: 143 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPLLQRVQEDHTRVVSPIIDVISL 202

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 203 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 259

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 260 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 313

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 314 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 357


>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
          Length = 622

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   E         D    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTLLETQEKDEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SV++VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+Q+    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+  +  P I  ID  T++F + +     H RG F+W + +    LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +A+F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K        +I  N  R+ E W D+ +K  FY R   A  +  
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMAK 455

Query: 365 --DMGDISEQ 372
             + GD+SE+
Sbjct: 456 ENNFGDVSER 465


>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Amphimedon queenslandica]
          Length = 512

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 140/326 (42%), Positives = 190/326 (58%), Gaps = 17/326 (5%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
             N E S+  S DR +PD R   C    Y   LP  SVI+ FHNE  S+L+RT+ S++ R
Sbjct: 54  AFNQEASDKTSIDRKVPDTRHSWCYNQVYHPTLPSTSVIITFHNEARSTLLRTIVSVLNR 113

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P   +EEIILVDDFS   +    L         K++LIRN  REGL+R+R  GA  ++G
Sbjct: 114 SPPHLIEEIILVDDFSEDVNTGLLLTQM-----PKIKLIRNERREGLVRSRIFGADAAKG 168

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           E++ FLD+HCE  + WL PLL  +  DR I+  P+ID I   T+++          RG F
Sbjct: 169 EILTFLDSHCECNIGWLEPLLHRVSQDRTIVVSPIIDVISMDTFDYIGASS---ELRGGF 225

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +K +     +  KRK   EP K+P  AGGLF+++R  F+E G YD  + +WGGEN
Sbjct: 226 DWSLHFKWDGFTPAQRAKRKSPIEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGEN 285

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETWF 346
           FE+SF+ WMCGGS+E +PCSR+GHV+R   PY F        G  +TY  N KR  E W 
Sbjct: 286 FEISFRTWMCGGSLEIIPCSRVGHVFRKRHPYVFP------GGNAMTYMKNTKRAAEVWM 339

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           D  +K Y+Y+  P A   DMG I  +
Sbjct: 340 DN-YKDYYYSARPSAKGRDMGSIKSR 364


>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
           glaber]
          Length = 582

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 149/361 (41%), Positives = 204/361 (56%), Gaps = 16/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 70  KPPADSHALGEWGRASKLELGEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMSECK 129

Query: 74  YWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              Y    LP  SV++ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +
Sbjct: 130 SKTYDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKAQ 189

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 190 LETYISSLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 248

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P++E  +R    
Sbjct: 249 GRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQERDRRTSRI 305

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 306 DPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 365

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W D+ +K +FY R P A     GDISE
Sbjct: 366 HVFPKRAPY---------ARPNFLQNTARAAEVWMDD-YKEHFYNRNPPARKEAYGDISE 415

Query: 372 Q 372
           +
Sbjct: 416 R 416


>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Canis lupus
           familiaris]
          Length = 557

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSADLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Anolis carolinensis]
          Length = 941

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 152/373 (40%), Positives = 218/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           VF  D   G  +P         G+ G+   +P   +        E   N+  S+ I  DR
Sbjct: 424 VFSIDKTFGPRDP------NAAGQFGRPAVVPNEKQEEAKRRWNEGNFNVYLSDMIPIDR 477

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       DLP  S+I+ F +E +S+L+R+VHS++ R+P Q ++EIILVDD
Sbjct: 478 AIDDTRPIGCSDILVHNDLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPQLIKEIILVDD 537

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
           FS+K  L  KL+ Y+ +F  KVR++   ER GLIR R  GA+ ++G+V+ FLD+H E  +
Sbjct: 538 FSTKEYLKDKLDKYMAQF-PKVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVECNV 596

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
            WL PLL  I+ +RK +  PVI+ I  +   + +V   D+  RGIF W M +    +P  
Sbjct: 597 GWLEPLLERIHLNRKKVPCPVIEVISDKDMSYMTV---DNFQRGIFNWPMNFGWKPIPPD 653

Query: 243 EAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             +K K   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG 
Sbjct: 654 VIEKNKIKETDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGE 713

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++RS  PY+F K  DR+    +  N  RV E W D+ +K  FY      
Sbjct: 714 IEIIPCSRVGHIFRSDNPYSFPK--DRLT--TVERNLARVAEVWLDD-YKDLFYGHGYHL 768

Query: 360 LAMFLDMGDISEQ 372
           +   LD+GD+++Q
Sbjct: 769 VQKNLDVGDLTQQ 781


>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
           aries]
          Length = 552

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 195/336 (58%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  I+ +R +PD R+  C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ SI+ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I   T+ +    
Sbjct: 183 RIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 300 TDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388


>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 683

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 163/415 (39%), Positives = 221/415 (53%), Gaps = 63/415 (15%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYR---AAGDASLGEYGMNMETSNHIS 59
           V  A   +G L PP  P  +GPGE G+   L +  +   A           N   S+ IS
Sbjct: 120 VDHAPAPVGVLAPPQNP--DGPGEMGRPVVLKDLTKEQEAKVKQGWDRNAFNQYISDMIS 177

Query: 60  FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
             R++PD+R  ECK   Y  DLP  SVI+ FHNE +S L+RTVHSII R+P + L EIIL
Sbjct: 178 LHRSLPDVRDSECKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIIL 237

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR---------------------- 157
           VDD+S    L QKLEDY+  F  KV+++R  +REGLIR                      
Sbjct: 238 VDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCE 296

Query: 158 -------------TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
                         R+     +   V+ +LD+HCE    WL PLL  I  +   +  PVI
Sbjct: 297 CTEGWLEPLLDRIARNSTTVXATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVVCPVI 356

Query: 205 DGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
           D I   T+E+        HYR       G F+W + +  + +PERE ++RK++ +P  SP
Sbjct: 357 DVISDSTFEY--------HYRDSGGVNVGGFDWNLQFSWHAVPERERQRRKHSWDPVWSP 408

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
           T AGGLF++D+AFF +LG YD G  +WGGEN ELSFK WMCGG++E VPCS +GH++R  
Sbjct: 409 TMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKR 468

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            PY +     R    ++  N  R+ E W DE +K Y+Y R    +  D GD+S +
Sbjct: 469 SPYKW-----RSGVNVLRRNSVRLAEVWLDE-YKQYYYQRIGDDLG-DFGDVSAR 516


>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           3 [Gorilla gorilla gorilla]
          Length = 517

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 34  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 94  RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350


>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
           partial [Oryctolagus cuniculus]
          Length = 499

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y LDLP  SVI+ 
Sbjct: 12  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLDLPATSVIIT 71

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 72  FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 126

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   ++ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 127 DRREGLIRSRVRGADVAAAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 186

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+A
Sbjct: 187 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKITRTDPTRPIRTPVIAGGIFVIDKA 243

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 244 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 297

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 298 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 341


>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
          Length = 650

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 148/355 (41%), Positives = 204/355 (57%), Gaps = 27/355 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    +  + A D    +   N   S+ IS  RT+PD R
Sbjct: 136 IDPPAN-FEEDPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LPK  VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I  +T E+        HYR       G F+W + +  + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+    +EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 366 ERERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 425

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
           ++E VPCS +GH++R   PY +     R    +   N  R+ E W DE  + Y++
Sbjct: 426 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVPKKNSVRLAEVWMDEYSQCYYH 475


>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
 gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
          Length = 644

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 207/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           L PP E  +E PGE GK   LP    +  + A +    +   N   S+ IS  RT+PD R
Sbjct: 130 LLPPSE-LEETPGEMGKPVKLPKDMPDDMKKAVEDGWTKNAFNQYASDLISVHRTLPDPR 188

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  DLPK  VI+ FHNE +S L+RTVHS++ R+P   + ++ILVDD+S   
Sbjct: 189 DAWCKDTARYLTDLPKTDVIICFHNEAWSVLLRTVHSVLDRSPEHLIGKVILVDDYSDMP 248

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV+++R  +REGLIR R  GA+ ++  V+ +LD+HCE    WL P
Sbjct: 249 HLKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWLEP 307

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I+  T E+        HYR       G F+W + +  + +P
Sbjct: 308 LLDRIARNSTTVVCPVIDVINDDTLEY--------HYRDSTGVNVGGFDWNLQFSWHAVP 359

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+   ++EP  SPT AGGLF++DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 360 EREKKRHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSFKTWMCGG 419

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W D+ +  Y+Y R
Sbjct: 420 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDD-YAQYYYHR 470


>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
          Length = 522

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 210/361 (58%), Gaps = 20/361 (5%)

Query: 16  PLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           P +PY   PGE GK     L    +   +  + +Y +N+  S+ IS  R I D R+  CK
Sbjct: 12  PPDPY--SPGEWGKPSRLQLSSEEKKQEEELIEKYAINIYLSDKISLHRHIEDNRLSGCK 69

Query: 74  YWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              Y    LP  SVI+ F+NE +S+L+RT+HS+++ +P+  L+EIILVDD S K  L   
Sbjct: 70  AKSYNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPSVLLKEIILVDDLSDKVYLKTD 129

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI     +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 130 LEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERI 188

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             +  ++  PVID ID++T+E+     EP     G F+W + ++ + +P+ E  +RK  +
Sbjct: 189 AENETVIVCPVIDTIDWKTFEYYMQTAEP---MIGGFDWRLTFQWHSVPKHERLRRKSET 245

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 246 DPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVG 305

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W DE +K +FY R P A   + GD+SE
Sbjct: 306 HVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPSARKENYGDLSE 355

Query: 372 Q 372
           +
Sbjct: 356 R 356


>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
           davidii]
          Length = 507

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 144/332 (43%), Positives = 195/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     +  N   S  IS +R IPD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 24  VGDDPYKLHAFNQRESERISSNRAIPDTRHLRCTLLMYCRDLPPTSIIITFHNEARSTLL 83

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+        E+ I+    KV+ +RN +REGL+R+R
Sbjct: 84  RTIRSVLNRTPMNLIKEIILVDDFSNDPG---DCEELIKL--PKVKCLRNDQREGLVRSR 138

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 139 IRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFSY---IE 195

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   SEP ++P  AGGLF MD+++F  LG YD 
Sbjct: 196 SATELRGGFDWSLHFQWEQLSPEQKAQRLDPSEPIRTPIIAGGLFVMDKSWFNFLGKYDM 255

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 256 DMDIWGGENFEMSFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 309

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K YFY   P A+    GDI
Sbjct: 310 TKRTAEVWMDE-YKQYFYAARPFALERPFGDI 340


>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           1 [Gorilla gorilla gorilla]
          Length = 552

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
 gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
          Length = 383

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 206/349 (59%), Gaps = 7/349 (2%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           GE G++ +L +               N+  S+ I  +RT+ D R   C+   Y   LP  
Sbjct: 2   GELGRSVNLNDNDSKLAKHLFQINQFNIVASDRIPLNRTLIDARRAACRNKTYSSALPTT 61

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           SVI+VFHNE +S+L+RTV S+I R+P + L+EIILVDD S +A L + L++++      V
Sbjct: 62  SVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKKALDNFVLNLPVPV 121

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
            ++R+ ER GLI+ R  GA+++ G+V+ FLD+HCE    WL PLL  I  DRKI   PVI
Sbjct: 122 LIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPLLDRIAFDRKIAVAPVI 181

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
           D I+ +T++++   +    YRG F W + ++    P  E K+R  + + P ++PT AGGL
Sbjct: 182 DVINDETFQYQKGIDV---YRGGFNWNLQFRWYSSPPSELKRRGNDVTHPVRTPTIAGGL 238

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F++DR FF E+G YD  + +WGGEN E+SF+IW CGG +E +PCS +GHV+R   P++F 
Sbjct: 239 FSIDRQFFFEIGAYDKEMKIWGGENLEMSFRIWQCGGQLEIIPCSHVGHVFRKKSPHDFP 298

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +         +T N  RV E W DE    ++          ++ D+SE+
Sbjct: 299 RGN---SARTLTTNLVRVAEVWMDEWKSLFYIISSAAKNISEIIDVSER 344


>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
 gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
          Length = 552

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGHANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Monodelphis domestica]
          Length = 563

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 143/346 (41%), Positives = 204/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+++
Sbjct: 79  KAY-LASKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVHYASDLPTTSIVI 137

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 138 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 192

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 193 NDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDK 309

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           A+F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 310 AWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 408


>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Equus caballus]
          Length = 619

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 201/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y +DLP  SVI+ 
Sbjct: 133 KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSVDLPATSVIIT 192

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 193 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 247

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 248 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 307

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 308 DNFAYLAA---SAILRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 364

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 365 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 418

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 419 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 462


>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 569

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 144/357 (40%), Positives = 214/357 (59%), Gaps = 15/357 (4%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWD 76
           + PGE G  Y   ++   + +    E G      N   SN IS  R++ D R +EC    
Sbjct: 59  KAPGELGSPYIFNKSQLTSKEKLEYETGWKKNNFNEFASNRISLQRSLKDPRDKECHNLT 118

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP+ S+I+ FHNE +S L+R+V+SI+ RTP   L+E+ILVDDFSS   L + L+ +
Sbjct: 119 YSENLPEVSIIVTFHNEAWSVLIRSVYSILNRTPDSLLKEVILVDDFSSLEHLKEPLDQF 178

Query: 137 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
           +++F  KV+++R TER+GLIR R RG +E+ G+V+VFLD+H E    W  PL+ PI  + 
Sbjct: 179 MEQFQ-KVKIVRATERQGLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPLIDPIARNW 237

Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE-PYK 255
             +  PVID ID +T+++           G F+W +++  + +PE E K+R+     P +
Sbjct: 238 STVMTPVIDVIDKETFQY-GFQAASATNVGGFDWSLMFTWHFVPETEQKRRQNKHYLPVR 296

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SPT AGGLFA+ R +F  +G YD G+ +WGGEN ELSF+IWMCGG++   PCS +GHV+R
Sbjct: 297 SPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSFRIWMCGGTLLTAPCSHVGHVFR 356

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              PY+FG   + VK  L+     R+ E W D+    Y+Y +       + GD+S +
Sbjct: 357 HTPPYSFGPKKNVVKNNLV-----RMAEVWLDD--FKYYYYQHINYTLGNYGDVSAR 406


>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
 gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
           taurus]
          Length = 552

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 196/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  I+ +R +PD R+  C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ SI+ RTP   ++EIILVDDFS+  +  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I   T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF MD+++F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYASRPFALERPFGNIESR 388


>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
           mulatta]
          Length = 517

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R IPD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 34  VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 94  RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 148

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 319

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G++
Sbjct: 320 TKRTAEVWMDE-YKQYYYAARPFALERPFGNV 350


>gi|241682071|ref|XP_002411622.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215504373|gb|EEC13867.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 473

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 144/351 (41%), Positives = 205/351 (58%), Gaps = 15/351 (4%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G  G+  +L  A +   DA   + G N+  S+ I  +R++ DLR   C+   +P DLP  
Sbjct: 106 GSRGQGVYLGGAEKKEADAQFSKAGFNVYVSDRIPLNRSLADLRPLPCQALRFPKDLPSV 165

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF--NG 142
           SV++ F+NE  S+L+RTV+S++ R+P + L E+ILVDDFS   ++  +L  +++R    G
Sbjct: 166 SVVITFYNEILSALLRTVYSVVNRSPRRILREVILVDDFSDLPEVKGQLYRFLKRHFRPG 225

Query: 143 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 202
            V+L+R   REGLIR R  GAKE+ G V+VFLD+HCE    WL PL+  +  D   +  P
Sbjct: 226 FVKLLRLPRREGLIRARLVGAKEAAGHVLVFLDSHCEATRQWLEPLVTAVNDDPTTVASP 285

Query: 203 VIDGIDYQTWEFRSV-YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           +I  ID  T+    + + P     G FEW   +     P     +    + P +SPT AG
Sbjct: 286 IITIIDGNTFAHEDMGFLP----LGSFEWNGDFTWIHPPP--GWRSPDQTAPVRSPTIAG 339

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+DR +F ++GGYDPG+  WGGEN ELSF+IWMCGG +  VPCS++GHV+R+  PY 
Sbjct: 340 GLFAVDRTYFFQMGGYDPGMNGWGGENLELSFRIWMCGGRLVVVPCSQVGHVFRTDRPYT 399

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                D         N KR  E W DE +K  FY  +P+   +D GD+SE+
Sbjct: 400 IPNETDS-----HARNTKRAAEVWMDE-YKEIFYKEKPVMQTIDAGDVSER 444


>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Cricetulus griseus]
          Length = 541

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 141/333 (42%), Positives = 194/333 (58%), Gaps = 19/333 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 58  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 117

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 118 RTIRSVLNRTPTHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 171

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 172 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 228

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 229 ESASELRGGFDWSLHFQWEQLSPEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 288

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGGS+E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 289 VDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 342

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           N KR  E W DE +K Y+Y   P A+    G+I
Sbjct: 343 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNI 374


>gi|355689622|gb|AER98894.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 [Mustela putorius
           furo]
          Length = 351

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 141/270 (52%), Positives = 188/270 (69%), Gaps = 7/270 (2%)

Query: 2   PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 58
           PV +  G LGN EP   EP+    GPGE  K   L   ++ A  AS+ E+G NM  S+ I
Sbjct: 83  PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141

Query: 59  SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 118
           S DR++ DLR EECKYW Y  +L  +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201

Query: 119 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 177
           L+DDFS+K  L  KL+DYI+ +NG V++ RN  REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKGKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 235
           CEV LNW  PL+API  DR I TVP+ID I+  T+E   +   + D + RG ++W ML+K
Sbjct: 262 CEVALNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321

Query: 236 ENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
              L  RE K RK  +EPY+SP  AGGLF+
Sbjct: 322 RVPLTPREKKMRKTKTEPYRSPAMAGGLFS 351


>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
          Length = 553

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 139/349 (39%), Positives = 210/349 (60%), Gaps = 19/349 (5%)

Query: 30  AYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           AY   +AY + G    GE        N E S+ +  +R IPD R ++C   ++  DLP  
Sbjct: 58  AYFNEKAYISKGKLKPGEDAYHNNKFNQEASDTLESNRAIPDYRHKKCLDLEFSKDLPST 117

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           SVI+ FHNE  S+L+RT+ S++ R+P+  ++EIILVDDFS+ A   ++L         KV
Sbjct: 118 SVIITFHNEARSTLLRTIVSVLNRSPSHLIKEIILVDDFSNDASDGRELVQI-----EKV 172

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
            L+RN++REGL+R+R +GA+ + GE + FLD+HCE    WL PLLA +  DR  +  PVI
Sbjct: 173 ILVRNSKREGLVRSRVKGAEIATGEFLTFLDSHCECNEGWLEPLLARVVEDRTRIVCPVI 232

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
           D I   ++++ +        RG F+W +++K   LP  E   RK + + P ++P  AGGL
Sbjct: 233 DVIAMDSFQYIAA---STELRGGFDWNLVFKWELLPAEEKANRKTDPTIPIRTPMIAGGL 289

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F +DR +F +LG YD  + +WGGEN E+SF+ W CGG +E VPCSR+GHV+R   PY+F 
Sbjct: 290 FVIDRQYFQKLGSYDLQMDIWGGENLEISFRTWQCGGRLEIVPCSRVGHVFRKQHPYSFP 349

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             +    G +   N +R  E W D+ +K Y++   P+A  +  G+I+++
Sbjct: 350 GGS----GTIFARNTRRAAEVWMDD-YKKYYFAAVPMARTVTFGNITDR 393


>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
           5 [Macaca mulatta]
          Length = 552

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R IPD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Mus musculus]
 gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1;
           AltName: Full=Polypeptide GalNAc transferase-like
           protein 1; Short=GalNAc-T-like protein 1;
           Short=pp-GaNTase-like protein 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase-like
           protein 1; AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
 gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
 gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
 gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
 gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
          Length = 558

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 186 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
 gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
          Length = 489

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 145/358 (40%), Positives = 210/358 (58%), Gaps = 14/358 (3%)

Query: 5   KADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISF 60
           K D     L PP E  +E PGE GK   LP    +A + A +    +   N   S+ IS 
Sbjct: 92  KEDAAQKVLLPPSE-LEETPGEMGKPVELPTNMSDAMKKAVEDGWTKNAFNQYASDLISV 150

Query: 61  DRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
           +R +PD R   CK    Y  DLPK  VI+ FHNE +S+L+RTVHS++ R+P   + ++IL
Sbjct: 151 NRKLPDPRSAWCKDTARYLTDLPKTDVIICFHNEAWSTLLRTVHSVLARSPEHLIGKVIL 210

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD+S    L  +L++Y   +  KV+L+R  +REGL+R R  G + +   V+ FLD+HCE
Sbjct: 211 VDDYSDMPHLKIQLKEYFSLY-PKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCE 269

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
               WL PLL  I  +R  +  P ID ID +T+++   Y+  +   G+F+W + +    +
Sbjct: 270 CTEGWLEPLLDRIARNRNTVASPTIDMIDPKTFQYN--YDGANDVLGVFDWNLEFYWIPI 327

Query: 240 PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCG 299
           P RE K+R + +EP ++PT AGGLFA+D  FF  +G YDPG  +WGG+N ELSFK WMCG
Sbjct: 328 PLRELKRRNHFAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSFKTWMCG 387

Query: 300 GSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           G +E +PCS +GH++R   PY +       +  ++  N  R+ E W D+  K Y+Y R
Sbjct: 388 GILEIIPCSHVGHIFRDDSPYEWPS----SRAMMVESNLARLAEVWLDDYAK-YYYER 440


>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 511

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 139/307 (45%), Positives = 195/307 (63%), Gaps = 11/307 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +RT+PD+R + C  KY + P  LP  SV++VFHNE +S+L+RTV S+I R
Sbjct: 35  NLLASDRIPLNRTLPDVRKKRCLTKYQNLPELLP-TSVVIVFHNEAWSTLLRTVQSVIDR 93

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + L EIILVDD S++  L + L++Y+ R    V++IR  EREGLIR R  GAKE++G
Sbjct: 94  SPRELLTEIILVDDGSTRKFLKEDLDEYVARLPVPVKVIRTKEREGLIRARMIGAKEAKG 153

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           +V+ FLDAHCE    WL PLL  +  DRK +  PVID I+  T+ +   +E   H+ G F
Sbjct: 154 QVLTFLDAHCECTKGWLEPLLVRVSEDRKKVVCPVIDIINDDTFAYVRSFE--LHW-GAF 210

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L   E KKRK + +EP+ +P  AGGLFA+ R +F E+G YD  + +WGGE
Sbjct: 211 NWNLHFRWYTLGTTEIKKRKNDVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGE 270

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF+ W CGGS+E VPCS +GH++R   PY F        G ++  N  RV   W D
Sbjct: 271 NLEMSFRGWQCGGSVEIVPCSHVGHLFRKSSPYTFPGGV----GEILHANLARVALVWMD 326

Query: 348 EKHKAYF 354
           E  + +F
Sbjct: 327 EWQEFFF 333


>gi|195028169|ref|XP_001986949.1| GH20244 [Drosophila grimshawi]
 gi|193902949|gb|EDW01816.1| GH20244 [Drosophila grimshawi]
          Length = 599

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 204/357 (57%), Gaps = 14/357 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLP 82
           G  G A HL  A +A GD    +  +N E S  +S++RT+ D R   C  + +D P  LP
Sbjct: 87  GNKGVATHLKGAAKARGDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQRYDNPATLP 146

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
            ASVI++F+NE +S L+RTVHS +     Q L+EIILVDD S  A+L  KL+ Y++ RF 
Sbjct: 147 TASVIVIFYNEPYSVLLRTVHSTLNTCNEQALKEIILVDDGSDNAELGGKLDHYVKTRFP 206

Query: 142 -GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
            GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE    W  PLL  I   R  + 
Sbjct: 207 IGKVTVLRLNNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQRIKDSRTSVL 266

Query: 201 VPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
           VP+ID ID     Y T  ++S       + G F+W  L +  +L +     +     P  
Sbjct: 267 VPIIDVIDSVDFQYSTNGYKSFQVGGFQWNGHFDWVNLPEREKLRQSRECNQPREICPAY 326

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +PCSR+GH++R
Sbjct: 327 SPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRVGHIFR 386

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F PY F    D   G     N  R+   W DE    +F  R  L    D+GD++ +
Sbjct: 387 DFHPYKFPNDRD-THG----INTARMALVWMDEYINVFFLNRPDLKFHPDIGDVTHR 438


>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 557

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 199/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR   D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKAGEDPYRQHAFNQLESDKLSSDRPTRDTRHYSCPSLSYSADLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVASAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Callithrix jacchus]
          Length = 558

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 145/346 (41%), Positives = 200/346 (57%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  SVI+
Sbjct: 71  KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVIS 244

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDK 301

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF      
Sbjct: 302 SWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP----- 356

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 -EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVASR 400


>gi|195384663|ref|XP_002051034.1| GJ22477 [Drosophila virilis]
 gi|194145831|gb|EDW62227.1| GJ22477 [Drosophila virilis]
          Length = 598

 Score =  267 bits (682), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 152/367 (41%), Positives = 211/367 (57%), Gaps = 16/367 (4%)

Query: 17  LEPYKEGP--GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC-- 72
           L+  K+ P  G  G A HL  A +A GD    +  +N E S  +S++RT+ D R   C  
Sbjct: 76  LDLKKQDPSLGNKGAAVHLHGAAKARGDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLA 135

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
           + +D P  LP ASVI++F+NE +S L+RTVHS +     + L+E+ILVDD S  A+L  K
Sbjct: 136 QKYDDPGTLPTASVIIIFYNEPYSVLVRTVHSTLNTCNQKALKEVILVDDGSDNAELGGK 195

Query: 133 LEDYIQ-RF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           L+ Y + RF +GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PLL 
Sbjct: 196 LDHYTRTRFPSGKVTILRLKNRLGLIRARLAGARIASGDVLIFLDAHCEANVGWCEPLLQ 255

Query: 191 PIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
            I   R  + VP+ID ID     Y T  ++S       + G F+W  L +  +L +    
Sbjct: 256 RIKDSRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWVNLSEREKLRQSREC 315

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
            +     P  SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +
Sbjct: 316 SQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 375

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
           PCSR+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L    D
Sbjct: 376 PCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINVFFLNRPDLKFHAD 430

Query: 366 MGDISEQ 372
           +GD++ +
Sbjct: 431 IGDVTHR 437


>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
          Length = 612

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 208/352 (59%), Gaps = 20/352 (5%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
           G GE G+   L +      + +      N+  S+ I+ +R++PD+R  +C+   Y    +
Sbjct: 44  GAGEDGRPVRLSKEDERLSEDTFVINQFNLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 103

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++  L  +LE  + + 
Sbjct: 104 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPRENLKEIILVDDFSTRTFLKVELEKLVAQL 163

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             ++++IR  ER GLIR R  GA E+ G+V+ FLD+HCE    W+ PLLA I  +RK + 
Sbjct: 164 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 223

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            PVID I+ +T+ ++   E    +RG F W + ++   LP    K R  + ++P  SPT 
Sbjct: 224 CPVIDIINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRSDDPTKPIISPTM 280

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELS----------FKIWMCGGSIEWVPCSR 309
           AGGLF++DR +F E+G YD  + +WGGEN E+S          F +W CGG +E +PCS 
Sbjct: 281 AGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRLKLLKKNCFLVWQCGGRVEILPCSH 340

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +GHV+R   P++F     R  G ++  N  RV E W DE  K +FY   P A
Sbjct: 341 VGHVFRRTSPHDF---PGRKSGTILNSNLLRVAEVWMDE-WKFHFYRTAPQA 388


>gi|432097046|gb|ELK27544.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5, partial [Myotis davidii]
          Length = 363

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 139/325 (42%), Positives = 199/325 (61%), Gaps = 11/325 (3%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           YG+N   S  +   R +PD R + C    YP  LP AS+++ FHNE F++L RTV S++ 
Sbjct: 15  YGLNTIISKSLGNQRPVPDTRDKMCLKKRYPTRLPSASIVICFHNEEFNTLFRTVSSVMN 74

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
            TP Q LEEIILVDD S   DL +KL+ +++ F GK+++IR T+REGLIR R  GA  + 
Sbjct: 75  LTPHQILEEIILVDDMSEFDDLKEKLDYHLEMFRGKIKVIRTTKREGLIRARLIGAAHAS 134

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G+V+VFLD+HCEV   WL PLLA I  DRK++  P++D ID+ T      Y P    RG 
Sbjct: 135 GDVLVFLDSHCEVNRVWLEPLLAAIAKDRKMVVCPMVDSIDHLTLN----YYPAPIVRGA 190

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F+W + +  + +   E    +  + P +SP  +GG+FA++R +F ELG YD  + +WG E
Sbjct: 191 FDWHLRFVWDTVFSYEMDGPEGPTTPIRSPAMSGGIFAINRHYFNELGQYDKDMNLWGAE 250

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N ELS +IWMCGG +  +PCSR+GHV R  +  N  ++   ++     YN  R++  W D
Sbjct: 251 NLELSLRIWMCGGQLFILPCSRVGHVDRHIVQ-NVTQVLRALR-----YNNLRLVHVWLD 304

Query: 348 EKHKAYFYTREPLAMFLDMGDISEQ 372
           E +K  F+ R P    +  G+ISE+
Sbjct: 305 E-YKEQFFLRRPDLKSIPYGNISER 328


>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
          Length = 553

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 67  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSDLPATSVIIT 126

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 127 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 181

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 182 DRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 241

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 242 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 298

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 299 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 352

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 353 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 396


>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Saimiri boliviensis boliviensis]
 gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Saimiri boliviensis boliviensis]
          Length = 622

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 216/370 (58%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P   GPG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NGPGADGKAFQKRKWTPLETQEKEEGFKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Ailuropoda melanoleuca]
          Length = 555

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 69  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSDLPATSVIIT 128

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 129 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 183

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 184 DRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 243

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 244 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 300

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 301 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 354

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 355 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 398


>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 591

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 190/313 (60%), Gaps = 13/313 (4%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYP-LDLPKASVILVFHNEGFSSLMRTVHSI 105
           ++  N+  SN +   R++PD R   C+  ++    LP ASV++ F+NE +S+L+RTVHSI
Sbjct: 90  QHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVHSI 149

Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++RTPA  L E+ILVDD S+  +L  +L  Y+       VRLIR   REGLIR R  GA 
Sbjct: 150 LERTPAALLHELILVDDNSTLPELGLQLSRYVASELPSHVRLIRTPAREGLIRARMYGAH 209

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV + WL P+LA I ++R  +T PVID I+  T+E    Y      
Sbjct: 210 NASGQVLVFLDSHCEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFE----YSASPIV 265

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F WG+ +K    P     ++    +P  SPT AGGLFAMDR +F ELG YD G+ +W
Sbjct: 266 RGGFNWGLHFKWESPPRLRGPQQAI--DPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIW 323

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN E+SF+IWMCGG +E +PCSR+GHV+R   PY      D      +T N  RV   
Sbjct: 324 GGENLEISFRIWMCGGRLEILPCSRVGHVFRRRRPYGSPSGED-----TLTKNSLRVAHV 378

Query: 345 WFDEKHKAYFYTR 357
           W DE    Y  TR
Sbjct: 379 WMDEYKTYYLQTR 391


>gi|291391583|ref|XP_002712189.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Oryctolagus cuniculus]
          Length = 941

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/360 (41%), Positives = 214/360 (59%), Gaps = 14/360 (3%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           P +P  + PG+ G    +P            E   N+  S+ I  DR I D R   C   
Sbjct: 433 PRDP--QAPGQFGLPVVVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 490

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
               +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD S+K  L   L+ 
Sbjct: 491 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNLDK 550

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL  +Y  
Sbjct: 551 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 609

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPY 254
           RK +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +  AK +   ++  
Sbjct: 610 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFLWPMNFGWKTIPPDVVAKNKIKETDII 666

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR+GH++
Sbjct: 667 RCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 726

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMGDISEQ 372
           R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      +   LD+G++++Q
Sbjct: 727 RNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHLIEQGLDVGNLTQQ 781


>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Cavia
           porcellus]
          Length = 556

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   ++ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVAAAAILTFLDSHCEVNVEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D+A
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGGIFVIDKA 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 580

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/356 (40%), Positives = 214/356 (60%), Gaps = 13/356 (3%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--L 79
            GPGE GKA  +P+   +  +        N+  S+ IS +RT+PD+RM+ CK   YP   
Sbjct: 72  NGPGEMGKAVIIPQDKESLKNEMFRINQFNLLASDMISINRTLPDVRMDGCKRKSYPPVS 131

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
           +LP  S+++VFHNE +S+L+R++HSII R+P + L EIILVDD S +  L Q+L+DY++R
Sbjct: 132 ELPSTSIVIVFHNEAWSTLLRSIHSIINRSPRELLTEIILVDDASERDFLGQQLDDYVKR 191

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP--LLAPIYSDRK 197
               V + R   R GLIR R RGA   +G V+ FL +H +   + L P  L A    DR+
Sbjct: 192 LQVPVHVERMGTRSGLIRARLRGAGLVKGHVLGFLXSHDQCSASSLRPVYLEASRRHDRR 251

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKS 256
            +  P+ID I    + F +    D  Y G F W + ++   +P+REA +R  + + P +S
Sbjct: 252 NVVCPIIDVISDDNFAFHT--GSDMTYGG-FNWKLQFRWYPVPQREADRRGGDRTIPLRS 308

Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
           PT AGGLF++D+ +F E+G YD G+ VWGGEN E+SF+IWMCGG++E V CS +GHV+R 
Sbjct: 309 PTMAGGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRIWMCGGTLEIVTCSHVGHVFRK 368

Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             PY F     R+    I  N +R+ E W D+  + ++Y   P     + GD+S++
Sbjct: 369 STPYTFPGGTGRI----INRNNQRLAEVWMDD-FRHFYYRISPGVRKTEFGDVSQR 419


>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Columba livia]
          Length = 502

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+I+
Sbjct: 18  KAY-LSSKQLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 76

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  +  Q L         KV+ +R
Sbjct: 77  TFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 131

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           NT REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 132 NTRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 191

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F +D+
Sbjct: 192 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDK 248

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 249 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 303

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++E+
Sbjct: 304 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSVAER 347


>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
           abelii]
          Length = 578

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 156/371 (42%), Positives = 212/371 (57%), Gaps = 26/371 (7%)

Query: 12  NLEPPLEPYKEGP------GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 63
           +L  PL  YK+ P      GE GKA    L E      +  +  Y +N+  S+ IS  R 
Sbjct: 58  DLSQPL--YKKPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISLHRH 115

Query: 64  IPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
           I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD
Sbjct: 116 IEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDD 175

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FLD HCE   
Sbjct: 176 LSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNS 234

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPE 241
            WL PLL  I  D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+
Sbjct: 235 GWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPK 291

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
           ++  ++    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG 
Sbjct: 292 QKRDRQISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGK 351

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLA 361
           +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY R P A
Sbjct: 352 LEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYNRNPPA 401

Query: 362 MFLDMGDISEQ 372
                GDISE+
Sbjct: 402 RKEAYGDISER 412


>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Nasonia vitripennis]
          Length = 610

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/360 (42%), Positives = 207/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P +     PGE G+   LP     E  +   D  +     N   S+ IS  R++P
Sbjct: 95  GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 153

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDD+S
Sbjct: 154 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 213

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV+++R ++REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 214 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 272

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  ++  +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 273 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 324

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLFA+DR FF  LG YD G  +WGGEN ELSFK WM
Sbjct: 325 AVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWM 384

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 385 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 438


>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Metaseiulus occidentalis]
          Length = 610

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/336 (42%), Positives = 204/336 (60%), Gaps = 10/336 (2%)

Query: 22  EGPGEGGK-AYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 79
           EG G  G+  Y LP E  R+    S+  +  N+  S+ IS DRT+ D R   C+   Y  
Sbjct: 97  EGAGNMGQPVYPLPSEVVRSKMLYSINRF--NLLVSDKISVDRTLADARKSVCRNISYAY 154

Query: 80  DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
           DLP  SVI+VFHNE +S+L+RTVHS+I R+P   ++EI+LVDD S +  L + L+ Y++ 
Sbjct: 155 DLPDTSVIIVFHNEAWSTLLRTVHSVINRSPRDLVKEIMLVDDASDREFLKRSLDAYVRS 214

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
            N  +++IR+ +R GLIR R  GA+ + G+V+ FLDAHCE    WL PLL  I  DR  +
Sbjct: 215 LNFPIKVIRSPKRSGLIRARLMGARAAEGKVLTFLDAHCECTTGWLEPLLQRIKEDRTRV 274

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPT 258
             P+ID I   T+ +   +E   H+ G   W M ++   +     K+R  + SEP+K+P 
Sbjct: 275 VCPIIDIIHDDTFAYVKSFE--LHW-GAINWEMHFRWYPVGPHVLKQRHGDPSEPFKTPV 331

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGGLF++D+ +F E+G YD  + +WGGEN E+SF+IW CGGS+E VPCS +GHV+R   
Sbjct: 332 MAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMSFRIWQCGGSLEIVPCSHVGHVFRRSS 391

Query: 319 PYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           PY F     +  G ++  N  RV E W D+  + YF
Sbjct: 392 PYTFPH--PKGVGGILFSNLARVAEVWMDDWAEFYF 425


>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Nasonia vitripennis]
          Length = 646

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/360 (42%), Positives = 207/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P +     PGE G+   LP     E  +   D  +     N   S+ IS  R++P
Sbjct: 94  GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 152

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDD+S
Sbjct: 153 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 212

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV+++R ++REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 213 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 271

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  ++  +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 272 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 323

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLFA+DR FF  LG YD G  +WGGEN ELSFK WM
Sbjct: 324 AVPEREKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWM 383

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 384 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 437


>gi|51316066|sp|Q95JX4.2|GLTL5_MACFA RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5;
           AltName: Full=Polypeptide GalNAc transferase 15;
           Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 15;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 15
 gi|15207881|dbj|BAB62965.1| hypothetical protein [Macaca fascicularis]
          Length = 443

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 201/331 (60%), Gaps = 17/331 (5%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ FHNE F +L RTV S
Sbjct: 97  LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP  +LEEIILVDD S   DL +KL+ +++ F GK+++IRN +REGLIR R  GA 
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+V LD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 217 HASGDVLVILDSHCEVNRVWLEPLLHAIAKDPKMVVRPLIDVIDDRTLE----YKPSPVV 272

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    +  ++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 273 RGAFDWNLQFKWDNVFSYEMDGPEGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT---YNYKRV 341
           GGEN ELS +IWMCGG +  +PCSR+GH+          K   R    +I+   +NY R+
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHI---------SKKQTRKTSAIISATIHNYLRL 383

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +  W DE +K  F+ R+P   ++  G+I E+
Sbjct: 384 VHVWLDE-YKEQFFLRKPGLKYVTYGNIHER 413


>gi|345328051|ref|XP_003431229.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           2 [Ornithorhynchus anatinus]
          Length = 863

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 217/368 (58%), Gaps = 13/368 (3%)

Query: 9   KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
           K+  L+  L P   + PG+ G A  +P   +        E   N+  S+ I  DR I D 
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R + C       DLP  ++I+ F +E +S+L+R++HS++ R+P   ++EIILVDDFS+K 
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L   L+ Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LL  +   RK +  PVI+ I  +   +++V   D+  RGIF W M +    +P    +K 
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666

Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           K   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VP
Sbjct: 667 KMKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVP 726

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFL 364
           CSR+GH++R+  PY+F K  DRVK   +  N  RV E W DE +K  FY      L    
Sbjct: 727 CSRVGHIFRNDNPYSFPK--DRVK--TVERNLVRVAEVWLDE-YKDLFYGHGLHLLERRS 781

Query: 365 DMGDISEQ 372
           D+G++++Q
Sbjct: 782 DIGNLTQQ 789


>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
 gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
          Length = 688

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/357 (41%), Positives = 205/357 (57%), Gaps = 28/357 (7%)

Query: 13  LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 68
           ++PP   ++E PGE GK   LP    +  + A D    +   N   S+ +S  R++PD R
Sbjct: 138 IDPPGN-FEENPGEMGKPVRLPKEMPDDMKKAVDDGWTKNAFNQYVSDLVSVHRSLPDPR 196

Query: 69  MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
              CK    Y  +LP   VI+ FHNE ++ L+RTVHS++ R+P   + +IILVDD+S   
Sbjct: 197 DAWCKDSTQYLTNLPTTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 256

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LEDY   +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    WL P
Sbjct: 257 HLKKQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSAVLTYLDSHCECTEGWLEP 315

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 240
           LL  I  +   +  PVID I   T E+        HYR       G F+W + +  + +P
Sbjct: 316 LLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHSVP 367

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           ERE K+   ++EP  SPT AGGLFA+DR FF  LG YD G  +WGGEN ELSFK WMCGG
Sbjct: 368 ERERKRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGG 427

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           ++E VPCS +GH++R   PY +     R    ++  N  R+ E W D+ +  Y+Y R
Sbjct: 428 TLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDD-YAQYYYHR 478


>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
 gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
          Length = 575

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 202/333 (60%), Gaps = 14/333 (4%)

Query: 41  GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
           GD +  +   N++ S+ +  DR +PD+R ++CK   +P DLP  ++I+ FHNEG S+L+R
Sbjct: 99  GDDAYAKNAYNIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLR 158

Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
           TV S + R+P   L+EIILVDDFSS     ++L         KV+LIRNT+REGLIR+R 
Sbjct: 159 TVISALNRSPPHLLKEIILVDDFSSDPKDGRRLLKL-----PKVKLIRNTKREGLIRSRV 213

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
           +GA  +RGEV+ FLD+HCE   NWL PLL  I    K +  P+ID I+  T+++      
Sbjct: 214 KGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDYLG---S 270

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
               RG F W + +K + LP     +R+   + P KSP  AGGLF++ + +F  LG YD 
Sbjct: 271 SADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSPVIAGGLFSVAKKWFETLGKYDM 330

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYK 339
            + VWGGEN E+SF+ W CGG++E +PCSR+GHV+R+  PY F   +  V       N +
Sbjct: 331 QMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNRHPYQFPGGSMNV----FQKNTR 386

Query: 340 RVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R +E W D+ +K Y+Y   P A     GDI E+
Sbjct: 387 RAVEVWMDD-YKRYYYAAVPYAKNTPYGDIEER 418


>gi|149639580|ref|XP_001512277.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           1 [Ornithorhynchus anatinus]
          Length = 949

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 217/368 (58%), Gaps = 13/368 (3%)

Query: 9   KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 67
           K+  L+  L P   + PG+ G A  +P   +        E   N+  S+ I  DR I D 
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490

Query: 68  RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           R + C       DLP  ++I+ F +E +S+L+R++HS++ R+P   ++EIILVDDFS+K 
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L   L+ Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 247
           LL  +   RK +  PVI+ I  +   +++V   D+  RGIF W M +    +P    +K 
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666

Query: 248 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           K   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VP
Sbjct: 667 KMKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVP 726

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFL 364
           CSR+GH++R+  PY+F K  DRVK   +  N  RV E W DE +K  FY      L    
Sbjct: 727 CSRVGHIFRNDNPYSFPK--DRVK--TVERNLVRVAEVWLDE-YKDLFYGHGLHLLERRS 781

Query: 365 DMGDISEQ 372
           D+G++++Q
Sbjct: 782 DIGNLTQQ 789


>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1,
           partial [Felis catus]
          Length = 553

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 200/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 67  KAYLAAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVAYSADLPATSVIIT 126

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 127 FHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 181

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 182 DRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 241

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 242 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 298

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 299 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 352

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 353 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 396


>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Rattus norvegicus]
          Length = 558

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 202/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R RGA  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 186 DKREGLIRSRVRGADVAGASVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Saimiri boliviensis boliviensis]
          Length = 552

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 194/335 (57%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  +LP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTELPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   + EIILVDDFS+  D  Q+L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + +   +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFHWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G++  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERPFGNVESR 388


>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
           glaber]
          Length = 553

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/332 (42%), Positives = 194/332 (58%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS  R +PD R   C    Y   LP  S+I+ FHNE  S+L+
Sbjct: 70  VGDDPYKLYAFNQRESERISSHRAVPDTRHPRCMLLVYHTALPPTSIIITFHNEARSTLL 129

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  D  ++L         KV+ +RN+ER+GL+R+R
Sbjct: 130 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNSERQGLVRSR 184

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 185 MRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 241

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 242 SASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 301

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 302 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 355

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W DE +K Y+Y   P A+    G+I
Sbjct: 356 TKRTAEVWMDE-YKQYYYAARPFALERPFGNI 386


>gi|291397402|ref|XP_002715124.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
           [Oryctolagus cuniculus]
          Length = 439

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/360 (40%), Positives = 206/360 (57%), Gaps = 17/360 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 72
           EP  E  K      G   H  PE Y     +   +YG+N+  S  +   R +PD R + C
Sbjct: 66  EPAFEHLKSYSKPIGNFNHSNPEFY-----SGFFKYGLNILISRSVGIRRDVPDTRDKIC 120

Query: 73  KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
               YP  LP AS+I+ FHNE  ++L+RT+ S++  TP+  LEEIILVDD S   DL ++
Sbjct: 121 HQKRYPHRLPTASIIICFHNEEINALLRTLSSVVNLTPSHLLEEIILVDDMSEFDDLKEE 180

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           L+  ++ F G V+LIRN  REGLIR R  GA  + G+V+VFLD+HCEV   WL PLL+ I
Sbjct: 181 LDQKLEDFRGVVKLIRNKRREGLIRARLIGAAHASGDVLVFLDSHCEVNKVWLEPLLSVI 240

Query: 193 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 252
             D   +  P+ID ID  T E    Y+P    RG F W + +K + +   E +  +  ++
Sbjct: 241 AKDPHTVVCPIIDVIDEMTLE----YKPSPIVRGTFNWMLQFKWDNVFSYEMEGPEGPAK 296

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P +SP+ AGG+FA+ R +F E+G YD  + +WGGEN E+S +IWMCGG +  +PCSR+GH
Sbjct: 297 PIRSPSMAGGIFAIHRHYFKEIGQYDKDMDLWGGENVEISLRIWMCGGQLFIIPCSRVGH 356

Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + R     N            +T NY R++ TW DE +K  F+   P    +  G+ISE+
Sbjct: 357 ITRKSPEPNLAVTK------AVTRNYLRLVHTWLDE-YKEQFFLHRPGLRSIPYGNISER 409


>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Sarcophilus harrisii]
          Length = 563

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 204/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+++
Sbjct: 79  KAY-LASKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVHYASDLPATSIVI 137

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     K++ +R
Sbjct: 138 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KIKCLR 192

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 193 NDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D+
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQPIRTPVIAGGIFVIDK 309

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 310 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 408


>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Anolis carolinensis]
          Length = 574

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 144/356 (40%), Positives = 205/356 (57%), Gaps = 21/356 (5%)

Query: 22  EGPGEGG---KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
           E PG  G   KAY +      AG+    ++  N   S+ +S DR I D R   C    Y 
Sbjct: 80  EKPGLRGFDEKAY-VSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCASIHYG 138

Query: 79  LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 138
            DLP  S+I+ FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  +  Q L     
Sbjct: 139 ADLPSTSIIITFHNEARSTLLRTVTSVLNRTPANLIQEIILVDDFSSDPEDCQLLTKI-- 196

Query: 139 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 198
               KV+ +RN  REGLIR+R RGA  +  +++ FLD+HCEV   WL P+L  +  D   
Sbjct: 197 ---PKVKCLRNNRREGLIRSRVRGADMATADILTFLDSHCEVNSEWLQPMLQRVKEDYTR 253

Query: 199 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPT 258
           +  P+ID I    + + +        RG F+W + +K  ++P  +   R   ++  ++P 
Sbjct: 254 VVSPIIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKLSRTDPTQSIRTPV 310

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
            AGG+F +D+++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   
Sbjct: 311 IAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRH 370

Query: 319 PYNFGKLADRVKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           PY+F       +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 371 PYDFP------EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSIADR 419


>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
          Length = 558

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 201/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           EAY +A     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  EAYLSAKQLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
            +REGLIR+R R A  +   V+ FLD+HCEV + WL P+L  +  D   +  P+ID I  
Sbjct: 186 DKREGLIRSRVRRADVAGATVLTFLDSHCEVNVEWLQPMLQRVMEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDLTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Oryctolagus cuniculus]
          Length = 622

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 213/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P  + PG  G+A+   E         D    ++  N   S+ IS  R + PD R  
Sbjct: 106 PPQDP--KSPGADGRAFQKSEWTPQETQEKDEGYKKHCFNAFASDRISLQRALGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++   PA  L EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLPSTSVIIVFHNEAWSTLLRTVYSVLHTAPAILLREIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 YLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D  ++  P I  ID  T+EF + V     H RG F+W + +    +P  E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENRR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMD-NYKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
           saltator]
          Length = 598

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 146/352 (41%), Positives = 203/352 (57%), Gaps = 7/352 (1%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 82
           G GE G+  +L    +  G+  L +  +N+  SN IS  R +PD+R   C    Y   LP
Sbjct: 84  GLGENGEPAYLHGKEKVEGETVLAKKALNVVLSNKISLTRKLPDVRNPLCANLTYDTLLP 143

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
             SVI++F+NE +S L+RTVHS++K +    L+EIILVDD S + +L  +L+ Y+  R  
Sbjct: 144 SVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLKEIILVDDHSEEEELQGQLDYYLSTRLP 203

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV+L+R   R+GLIR R  GAK + G+V+VFLDAHCEV  +WL PLL  I   R  + +
Sbjct: 204 TKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDWLQPLLQRIKEKRNAVLM 263

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           P+ID I  +T E+    E      G F W   +    + + E K R     P +SPT AG
Sbjct: 264 PIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHELKSRLSLISPTRSPTMAG 323

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA+DR +F E+G YD  +  WGGEN E+SF+IW CGG++E +PCSR+GH++R+F PY 
Sbjct: 324 GLFAIDRKYFWEVGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIPCSRVGHIFRNFHPYK 383

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDM-GDISEQ 372
           F    D   G     N  R+   W DE  + +   R        + GDISE+
Sbjct: 384 FPNDKD-THG----INTARLAFVWMDEYKRLFLLHRSEFKNKSSLFGDISER 430


>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Metaseiulus occidentalis]
          Length = 648

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 150/349 (42%), Positives = 202/349 (57%), Gaps = 10/349 (2%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
           G+ G A  L    +   D    +   N+  S+ +  +R++ D R   CK   YP+ +LP 
Sbjct: 134 GKNGHAVILGPEEQLEADKEFSKAAFNVYVSDRLPLNRSLRDTRHRHCKAVTYPMAELPT 193

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNG 142
           ASV+++F +E FS+L+RT+ S I R+P   L EIILVDDFS   DL  +L+ YI   F  
Sbjct: 194 ASVVIIFTDEIFSTLLRTIVSTINRSPNHLLREIILVDDFSQSEDLKDRLQRYITHHFRA 253

Query: 143 KV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            V RLIR  ER GLIR R  GA+ ++G+V++FLD+HCE    WL PLL PI  DR+ +  
Sbjct: 254 DVVRLIRLPERSGLIRARLAGARAAKGDVLIFLDSHCETTPGWLEPLLEPIRRDRRAVVC 313

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 261
           PVID ID +T ++ +  E D    G F W   +  + +P    K R   +EP +SPT AG
Sbjct: 314 PVIDIIDDKTLQYVAA-EGDRFQIGGFNWKGEFSWHNIPAAWRKNRTSIAEPMRSPTMAG 372

Query: 262 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYN 321
           GLFA++R +F E G YD  +  WGGEN E+SF+IW CGG I   PCS +GH++R + PY 
Sbjct: 373 GLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIAPCSHVGHIFRDYHPYK 432

Query: 322 FGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           F K  D         N KR +E W DE  K YFY   P    + +GDIS
Sbjct: 433 FPKGKD-----TNAINTKRAVEVWMDE-FKKYFYQTRPELTKMKVGDIS 475


>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
           norvegicus]
 gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
           [Rattus norvegicus]
 gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
          Length = 552

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 195/336 (58%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN+ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNSERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388


>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
 gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
           adhaerens]
          Length = 491

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 142/337 (42%), Positives = 202/337 (59%), Gaps = 17/337 (5%)

Query: 38  RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 97
           R + D    ++  N   S+ I   R +PD R   CK   Y L++P  SV+++FHNE  S+
Sbjct: 11  RGSKDEGYEKHQFNQFESDIIGAYRRVPDTRNPLCKNKIYRLNMPSVSVVIIFHNEARST 70

Query: 98  LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 157
           L+RTV S++ RTP   L EI+LVDD S  A L Q+L         KV+LIRN +REGLIR
Sbjct: 71  LLRTVQSVLDRTPPHLLSEIVLVDDNSDDATLGQELLTL-----PKVKLIRNKKREGLIR 125

Query: 158 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 217
           +R  G K S+G+ I+FLD+HCEV   W  PLL  I  + K +  PV+D ID  T+E++  
Sbjct: 126 SRVFGVKSSQGKAIIFLDSHCEVNQQWAEPLLEQIVLNPKAIVSPVLDNIDMNTFEYQ-- 183

Query: 218 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 277
            E     RG F+W + ++ + + E    +R   + P K+PT AGG++A+ + +F +LG Y
Sbjct: 184 -EGTEDVRGGFDWSLTFRWDYMTEAMINQRIDPTSPIKTPTIAGGIYAVSKQWFNDLGEY 242

Query: 278 DPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY- 336
           D G  +WGGEN ELSF+ WMCGG ++ +PCSR+GHV+R   PY F + A R      TY 
Sbjct: 243 DMGQKIWGGENLELSFRAWMCGGFMKIIPCSRVGHVFRLQHPYIFPEGAGR------TYY 296

Query: 337 -NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            N +RV+E W DE +K YFY    +   +D G++  +
Sbjct: 297 RNLRRVVEVWLDE-YKVYFYQIRKIIKSIDYGNVKSR 332


>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
          Length = 550

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPHTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  +  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFALERHFGNIENR 388


>gi|332243646|ref|XP_003270989.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5
           [Nomascus leucogenys]
          Length = 443

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 136/325 (41%), Positives = 201/325 (61%), Gaps = 11/325 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ F+NE F++L RTV S
Sbjct: 97  LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEEFNALFRTVSS 156

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  TP  +LEEIILVDD S   DL +KL+ +++ F  K+++IRN +REGLIR R  GA 
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFREKIKIIRNKKREGLIRARLIGAS 216

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKVVVCPLIDVIDDRTLE----YKPSPVV 272

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    +  ++P  SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGPTKPIWSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+GH+ +       GK +  +    +T+NY R++  
Sbjct: 333 GGENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDI 369
           W DE +K  F+ R+P   ++  G+I
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNI 410


>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Bos taurus]
 gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
 gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
           taurus]
          Length = 557

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE  K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-FKQYYYEARPSAIGKAFGSVATR 400


>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Callithrix jacchus]
          Length = 622

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+    L        +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKRKLTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + +     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W D   K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMD-SFKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Bos grunniens mutus]
          Length = 557

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE  K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-FKQYYYEARPSAIGKAFGSVATR 400


>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1
           [Taeniopygia guttata]
          Length = 570

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 202/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+I+
Sbjct: 86  KAY-LSSKVLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 144

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  +  Q L         KV+ +R
Sbjct: 145 TFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 199

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           NT REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 200 NTHREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 259

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F +D+
Sbjct: 260 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDK 316

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 317 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 371

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++++
Sbjct: 372 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSFGSVADR 415


>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
           [Otolemur garnettii]
          Length = 678

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 191/324 (58%), Gaps = 17/324 (5%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S     +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ 
Sbjct: 77  YAFNQRESERTPSNRAVPDTRHSRCTLLVYYTDLPPTSIIITFHNEARSTLLRTIRSVLN 136

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           RTP   ++EIILVDDFS+  D  ++L         KV+ +RN ER+GL+R+R RGA  ++
Sbjct: 137 RTPMHLIQEIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADVAQ 191

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G  + FLD+HCEV  +WL PLL  I  D   +  PVID I+  T+ +    E     RG 
Sbjct: 192 GTTLTFLDSHCEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTY---IESASELRGG 248

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F+W + ++  +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD  + +WGGE
Sbjct: 249 FDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGE 308

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETW 345
           NFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N KR  E W
Sbjct: 309 NFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKNTKRTAEVW 362

Query: 346 FDEKHKAYFYTREPLAMFLDMGDI 369
            DE +K Y+Y   P A+    G+I
Sbjct: 363 MDE-YKQYYYAARPFALERPFGNI 385


>gi|357622639|gb|EHJ74065.1| putative N-acetylgalactosaminyltransferase [Danaus plexippus]
          Length = 646

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/348 (41%), Positives = 205/348 (58%), Gaps = 29/348 (8%)

Query: 40  AGDASLGEYGMNMET-----SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 94
           A D  + E G NM       S  I   R +PD R + C+   YP  LPKAS+I+ F+NE 
Sbjct: 132 ADDVRIREKGYNMHAFNTLISQRIGNHRGLPDTRNKLCRSQKYPDKLPKASIIICFYNEH 191

Query: 95  FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREG 154
           F +LMR+VHSI+ RT  +YL+EIILVDD+S   DL ++++  +   NGK+ +   + REG
Sbjct: 192 FETLMRSVHSILDRTDLKYLKEIILVDDYSDITDLHEEVQKAVNELNGKMLITLTSTREG 251

Query: 155 LIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI----------YSDRKIMTVPVI 204
           LIR R  GA  S G+V+VFLD+H EV ++WLPPLL  +          +S R +   P+I
Sbjct: 252 LIRARLYGADNSVGDVLVFLDSHIEVNVDWLPPLLTRLSEGVDGVNVRFSPRAV--TPII 309

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLF 264
           D I+  T+E+ S        RG F WG+ +K + LP+   K  +   +P +SPT AGGLF
Sbjct: 310 DVINADTFEYTS----SPLVRGGFNWGLHFKWDNLPKGTLKDDEDFIKPIRSPTMAGGLF 365

Query: 265 AMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGK 324
           A+ R +F ++G YD G+ +WGGEN E+SF+IWMCGG +E  PCSR+GHV+R   PY  G+
Sbjct: 366 AIYREYFNKIGKYDSGMNLWGGENLEISFRIWMCGGVLELCPCSRVGHVFRKRRPYGAGE 425

Query: 325 LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                    +  N  R+   W DE +      + P A  + +GDISE+
Sbjct: 426 -------DYMLRNSMRMARVWMDE-YVNKVIEQNPSAAHVSIGDISER 465


>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Nomascus leucogenys]
          Length = 622

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 153/371 (41%), Positives = 215/371 (57%), Gaps = 24/371 (6%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P    PG  GKA+      P   R   +    ++  N   S+ IS  R++ PD R 
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETREK-EEGYKKHCFNAFASDRISLQRSLGPDTRP 162

Query: 70  EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++
Sbjct: 163 PECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTE 221

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL 
Sbjct: 222 EHLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           PLLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E +
Sbjct: 281 PLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ 340

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
           PCS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMA 454

Query: 362 MFLDMGDISEQ 372
                GDISE+
Sbjct: 455 QEKSFGDISER 465


>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
           norvegicus]
          Length = 578

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 147/361 (40%), Positives = 202/361 (55%), Gaps = 16/361 (4%)

Query: 14  EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 73
           +PP + +  G         L E      +  +  Y +N+  S+ IS  R I D RM ECK
Sbjct: 66  KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125

Query: 74  YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 132
              +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EIILVDD S +  L  +
Sbjct: 126 AKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQ 185

Query: 133 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 192
           LE YI   + +VRL R  +REGL+R R  GA  + G+V+ FLD HCE    WL PLL  I
Sbjct: 186 LEAYISNLD-RVRLTRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERI 244

Query: 193 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
             D   +  PVID ID+ T+EF     EP     G F+W + ++ + +P+ E  +R    
Sbjct: 245 SRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRTSRI 301

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG +E  PCS +G
Sbjct: 302 DPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVG 361

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W D+ +K +FY R P A      DISE
Sbjct: 362 HVFSKRAPY---------ARPNFLQNTAREAEVWMDD-YKEHFYNRNPPARKETYDDISE 411

Query: 372 Q 372
           +
Sbjct: 412 R 412


>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Oryzias latipes]
          Length = 533

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 135/346 (39%), Positives = 202/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AGD    E+  N++ S+ +  +R I D R   C    Y  DLP  +VI+
Sbjct: 52  KAY-LSAKQLKAGDDPYREHAFNLQESDRLGGERAIRDTRHYRCAALSYDADLPSTTVII 110

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ R+P   ++E++L+DDFSS  +  Q L         KVR +R
Sbjct: 111 TFHNEARSTLLRTVKSVLMRSPPSLIQEVLLIDDFSSDLEDCQLLAQI-----PKVRCLR 165

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N+ REGLIR+R +GA  +   ++ FLD+HCEV  +WL P++  +  D   +  P+ID I 
Sbjct: 166 NSRREGLIRSRVKGANSASAPILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVIS 225

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F MD+
Sbjct: 226 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTLPIRTPVIAGGIFVMDK 282

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E +PCSR+GHV+R   PY+F      
Sbjct: 283 SWFNHLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRVGHVFRKRHPYDFP----- 337

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N +R  E W DE +K ++Y+  P A     G I+E+
Sbjct: 338 -EGNALTYIKNTRRAAEVWMDE-YKQFYYSARPSAQGKAFGSITER 381


>gi|395846631|ref|XP_003796006.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Otolemur garnettii]
          Length = 943

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 149/373 (39%), Positives = 220/373 (58%), Gaps = 18/373 (4%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           V K D  L   +P      + PG+ G+   +P       +    E   N+  S+ I  DR
Sbjct: 426 VLKIDVTLSPRDP------KAPGQFGRPVVVPLGKEKEAERRWKEGNFNVYLSDLIPVDR 479

Query: 63  TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 122
            I D R   C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++EI+LVDD
Sbjct: 480 AIEDTRPVGCAEQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 539

Query: 123 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 182
            S+K  L   L++Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD+H E  +
Sbjct: 540 CSTKDYLKDNLDEYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 598

Query: 183 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-E 241
            WL PLL  +Y  R+ +  PVI+ I+ +   + +V   D+  RGIF W M +    +P +
Sbjct: 599 GWLEPLLERVYLSRQKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPD 655

Query: 242 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 301
             AK +   ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG 
Sbjct: 656 VVAKNKIKETDIIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGE 715

Query: 302 IEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EP 359
           IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  FY      
Sbjct: 716 IEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELFYGHGDHL 770

Query: 360 LAMFLDMGDISEQ 372
           +   L++G++++Q
Sbjct: 771 IDQGLEVGNLTQQ 783


>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
           musculus]
          Length = 551

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 70  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 129

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 130 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 183

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 184 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 240

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 241 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 300

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 301 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 354

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 355 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 389


>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Anolis carolinensis]
          Length = 634

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 156/371 (42%), Positives = 217/371 (58%), Gaps = 24/371 (6%)

Query: 15  PPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +     PG  GKA+   +L    +   +    ++  N   S+ IS  R + PD R  
Sbjct: 115 PPQD--SNAPGASGKAFKTINLSPDEQKEKERGDEKHCFNAFASDRISLHRDLGPDTRPP 172

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTVHS++  +PA  L+EIILVDD S   
Sbjct: 173 ECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDD 231

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L  KL+DY+++F+  V+++R  ER+GLI  R  GA  + GE + FLDAHCE    WL P
Sbjct: 232 YLQDKLDDYVKQFH-IVKVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEP 290

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           LLA I  +   +  P I  ID  T+EF   S Y   H+ RG F+W + +    LPE E+K
Sbjct: 291 LLARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHN-RGNFDWSLSFGWESLPEHESK 349

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           KRK  + P K+PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 350 KRKDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSFRVWQCGGQLEII 409

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
           PCS +GHV+RS  P++F K        +IT N  R+ E W DE +K  FY R   A  + 
Sbjct: 410 PCSVVGHVFRSKSPHSFPKGTQ-----VITRNQVRLAEVWMDE-YKNIFYRRNTEAAKIV 463

Query: 365 ---DMGDISEQ 372
                GDIS++
Sbjct: 464 KQQTFGDISKR 474


>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
           gorilla gorilla]
          Length = 622

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFQRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
           AltName: Full=Polypeptide GalNAc transferase 14;
           Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 14;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 14
          Length = 550

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388


>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
 gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Mus musculus]
          Length = 550

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388


>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           6 [Ciona intestinalis]
          Length = 627

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 150/364 (41%), Positives = 213/364 (58%), Gaps = 20/364 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH-----ISFDRTIPDLRM 69
           P ++P    PGE GKAY + +   +A    L + G +    NH     IS  R++ D R 
Sbjct: 115 PKVDP--SAPGEYGKAYKVTD--NSAEVKKLVKEGWDKHAFNHYVCQKISLHRSVGDKRD 170

Query: 70  EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           +ECK   +   LP  SVI++FHNE + +L+RTVHS+++ +P   L+EIILVDD S+ ++L
Sbjct: 171 QECKVRKWRKPLPDTSVIIIFHNEAWCALLRTVHSVLENSPKILLKEIILVDDASTLSNL 230

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            ++L DY+ +    V++IR   R GLIR R  GA+E++G V+ FLD+HCE   +WL P+L
Sbjct: 231 GKELTDYVAKLQ-IVKIIRLPSRAGLIRARLAGAQEAQGSVLTFLDSHCECAPHWLEPML 289

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER-EAKKRK 248
             I  D   +  PVI+ ID  T+   S+        GI  W + +  N  P +    +  
Sbjct: 290 ERIAEDNTRVVCPVIEVIDADTFAM-SLTTARSVQTGILSWSLGF--NWAPRKINPGQPI 346

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
            N E   S T AGGLFAM R +F  LG YD  +LVWGGEN E+S +IWMCGGS+E  PCS
Sbjct: 347 KNDEALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSLRIWMCGGSLEIHPCS 406

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
            +GHV+R   PY+    +D     +IT+N KRV E W DE +K  +Y R P A  ++ GD
Sbjct: 407 HVGHVFRKRAPYSHPGGSD-----VITHNNKRVAEVWLDE-YKEQYYKRVPRARAVEAGD 460

Query: 369 ISEQ 372
           ++ +
Sbjct: 461 LTAR 464


>gi|326427851|gb|EGD73421.1| GALNT4 protein [Salpingoeca sp. ATCC 50818]
          Length = 537

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 20/311 (6%)

Query: 51  NMETSNHISFDRTIPDLRMEEC---KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSII 106
           N   S+ +S  R   D R  EC   KY  YPL +LP  SVIL+F+NE  S+L+RTV S++
Sbjct: 172 NQWISDRLSLHRRAYDTRPVECLHKKY--YPLSELPTVSVILIFYNEARSTLLRTVWSVL 229

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            R+P   ++EI+LVDD SS   L   L+  +     K R+IR  ER GLIR +  GA+++
Sbjct: 230 DRSPRSLIKEILLVDDHSSMPHLGYPLDQEVAGIP-KTRVIRLPERSGLIRAKVYGAQQA 288

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYR 225
           RG+V+V+LD+HCEV   WL PLL  I  +RK + +P+ID IDY+TWE R+ + E     R
Sbjct: 289 RGDVLVYLDSHCEVNDGWLEPLLDRIRRNRKTVAMPIIDAIDYETWEHRTGLLE-----R 343

Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
           GIF+W +++K  +L   + + R  +++P+ SP  AGGLFAMDR +F E+G YD G+  WG
Sbjct: 344 GIFDWSLVFKWKQLTADDKRGRPDDTDPFASPAMAGGLFAMDRKYFFEVGAYDMGMETWG 403

Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGP--LITYNYKRVIE 343
           GEN E+S ++W CGG IE +PCS + HV+R   PY F     + K P   I  N  RV E
Sbjct: 404 GENIEMSMRVWACGGRIEALPCSHVAHVFRKKTPYEF-----KTKDPQETIARNLNRVAE 458

Query: 344 TWFDEKHKAYF 354
            W DE    Y+
Sbjct: 459 VWMDEYKDVYY 469


>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
          Length = 622

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
 gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
 gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
          Length = 622

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
          Length = 622

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 213/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   E         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L EIIL+DD S+  
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLNEIILMDDASTDE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LE Y+Q+    VR++R  ER GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+  +  P I  ID  T++F R V     H RG F+W + +    LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +A+F  +G YD  + +WGGEN E+SF++W CGG +  +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLGIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K        +I  N  R+ E W D+ +K  FY R   A  +  
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKMVQ 455

Query: 365 --DMGDISEQ 372
             + GDISE+
Sbjct: 456 ENNFGDISER 465


>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Pan paniscus]
 gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pan paniscus]
          Length = 622

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
          Length = 622

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|281485547|ref|NP_660335.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Homo sapiens]
 gi|322510123|sp|Q7Z4T8.3|GLTL5_HUMAN RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5;
           AltName: Full=Polypeptide GalNAc transferase 15;
           Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 15;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 15
          Length = 443

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 135/328 (41%), Positives = 203/328 (61%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R + C    YP  LP AS+++ F+NE  ++L +T+ S
Sbjct: 97  LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEECNALFQTMSS 156

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           +   TP  +LEEIILVDD S   DL +KL+ +++ F GKV++IRN +REGLIR R  GA 
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    + +++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           G EN ELS +IWMCGG +  +PCSR+GH+ +       GK +  +    +T+NY R++  
Sbjct: 333 GRENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ R+P   ++  G+I E+
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNIRER 413


>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
           caballus]
          Length = 539

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 137/335 (40%), Positives = 194/335 (57%), Gaps = 17/335 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 56  VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTTLVYCTDLPPTSIIITFHNEARSTLL 115

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  D   +L         KV+ +RN  R+GL+R+R
Sbjct: 116 RTIRSVLNRTPMNLIKEIILVDDFSNDPDDCNQLIKL-----PKVKCLRNENRQGLVRSR 170

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  + G ++ F+D+HCEV  +WL PLL  +  D   +  PVID I+   + +    E
Sbjct: 171 IRGADFAEGAILTFMDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDNFNY---IE 227

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +  +R   +EP ++P  AGGLF M++++F  LG YD 
Sbjct: 228 SATELRGGFDWSLHFQWEQLSPEQKAQRLDPAEPIRTPVIAGGLFVMNKSWFDYLGKYDM 287

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGGS+E VPCSR+GHV+R   PY F        G   TY  N
Sbjct: 288 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFP------DGNANTYIKN 341

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            KR +E W DE +K Y+Y   P A+    G+I  +
Sbjct: 342 TKRTVEVWMDE-YKQYYYAARPFALERPFGNIDSR 375


>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pongo abelii]
 gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           3 [Pongo abelii]
          Length = 622

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 493

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 202/329 (61%), Gaps = 15/329 (4%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECK---YWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           +  N E S+ ++ +R IPD R  +C          +LP  SV++ FHNE  S+L+RT+ S
Sbjct: 84  HKFNQEASDALASNRAIPDTRHPQCAKEGLLKPQEELPATSVVITFHNEARSALLRTIVS 143

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++ R+PA+ +EEIILVDDFS      ++L   IQ    K+RL+RNT+REGL+R+R RGA+
Sbjct: 144 VLNRSPAELIEEIILVDDFSDDPSDGEELAK-IQ----KIRLLRNTQREGLVRSRVRGAR 198

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            ++  V+ FLD+HCE    WLPPLL  +  D + +  PVID I+ +++++   +      
Sbjct: 199 AAKAPVLTFLDSHCECNQGWLPPLLRRVKEDPRRVVCPVIDVINLESFKY---FGASSDL 255

Query: 225 RGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLV 283
           RG F W +++K   L  +E ++R  N + P ++P  AGGLF +DRA F  LG YD  + +
Sbjct: 256 RGGFNWNLVFKWEFLSNKEREERANNPTLPIRTPMIAGGLFVVDRAQFERLGAYDTAMDI 315

Query: 284 WGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIE 343
           WGGEN ELSF+ W CGGS+E +PCSR+GHV+R   PY+F   +  V       N +R  E
Sbjct: 316 WGGENLELSFRAWQCGGSLEILPCSRVGHVFRKQHPYSFPGGSGNVFAR--QANTRRAAE 373

Query: 344 TWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            W D+ +K Y+Y   P+A  + MG + E+
Sbjct: 374 VWMDD-YKKYYYATVPVARNVPMGSVEER 401


>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 510

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 140/349 (40%), Positives = 203/349 (58%), Gaps = 23/349 (6%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           GE GK   + ++ +   +        N+  S+ I+ +R++PD+R   C+  +YP  L   
Sbjct: 6   GEMGKPVFIADSQKEKMNQLFPLNQFNVMASDMIALNRSLPDIRPRGCQNREYPGVLQTT 65

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 144
           SV++VFHNE +++L+RTVHS+I R+P   L EIILVDD+S++                 V
Sbjct: 66  SVVIVFHNEAWTTLLRTVHSVINRSPRHLLTEIILVDDYSNRV---------------PV 110

Query: 145 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 204
            +    +REGL R R  GA  + GEV+ FLD+HCE    WL PLLA I  D+  +  PVI
Sbjct: 111 MVHHCQQREGLTRARLIGAAMATGEVVTFLDSHCECTRGWLEPLLARIAEDKTNVVCPVI 170

Query: 205 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGL 263
           + I   T+EF  +   D    G F+W +++  + +P RE ++ K++ + P +SPT AGGL
Sbjct: 171 NIISDTTFEF--INGSDATQVGGFDWRLIFNWHVVPHRELQRIKFDRTSPVRSPTMAGGL 228

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           F++ + FF  LG YDPG  VWG EN ELSFK WMCGG++E+VPCS +GHV+R   P+ F 
Sbjct: 229 FSIHKEFFTRLGTYDPGFDVWGAENLELSFKTWMCGGTLEFVPCSHVGHVFRKRSPHRFP 288

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
                V    +  N +R+ E W DE +K  +Y   P  +  D GDISE+
Sbjct: 289 PTTHNV----MQRNNRRLAEVWLDE-YKYLYYNAHPEILKTDPGDISER 332


>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 624

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R + PD R  
Sbjct: 108 PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 165

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 166 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 224

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L + LE Y+Q+    VR++R   R+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 225 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 283

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D   +  P I  ID  T+EF + V +   H RG F+W + +    LP  E ++
Sbjct: 284 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 343

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  ++P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 344 RKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 403

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W DE +K  FY R   A  +  
Sbjct: 404 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDE-YKEIFYRRNIQAAKMAR 457

Query: 365 --DMGDISEQ 372
               GDISE+
Sbjct: 458 EKSFGDISER 467


>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 599

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R + PD R  
Sbjct: 98  PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 155

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 156 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 214

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L + LE Y+Q+    VR++R   R+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 215 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 273

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D   +  P I  ID  T+EF + V +   H RG F+W + +    LP  E ++
Sbjct: 274 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 333

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  ++P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 334 RKDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 393

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W DE +K  FY R   A  +  
Sbjct: 394 CSVVGHVFRTKSPHTFPKGTN-----VIARNQVRLAEVWMDE-YKEIFYRRNIQAAKMAR 447

Query: 365 --DMGDISEQ 372
               GDISE+
Sbjct: 448 EKSFGDISER 457


>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Acyrthosiphon pisum]
          Length = 674

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 154/360 (42%), Positives = 205/360 (56%), Gaps = 28/360 (7%)

Query: 25  GEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           GE GK   LP    A      D        N   S+ IS  RT+PD R E CK     LD
Sbjct: 131 GEMGKPVVLPANLTADVKKLVDEGWKNNAFNQYASDLISLHRTLPDPRDEWCKKPGRYLD 190

Query: 81  -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 139
            LP+ SVI+ FHNE +S L+RTVHSI+ R+P   + EIILVDDFS    L  +LE+Y + 
Sbjct: 191 NLPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDMPHLKTQLEEYSEN 250

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
           +  K++++R  +REGLIR R  GA+ +   V+ +LD+HCE    WL PLL  I  +   +
Sbjct: 251 Y-PKIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPLLDRIAREASTV 309

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRKYNSE 252
             PVID ID  T EF        HYR       G F+W + +  + +P++E K+ K  +E
Sbjct: 310 VCPVIDVIDDSTLEF--------HYRDAGGVNVGGFDWNLQFNWHVVPDKEKKRHKNAAE 361

Query: 253 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGH 312
           P  SPT AGGLFA+D+ FF  LG YD G  +WGGEN ELSFK WMCGG++E VPCS +GH
Sbjct: 362 PVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGH 421

Query: 313 VYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           ++R   PY +     R    ++  N  R+ E W D+  K Y+Y R    +  D GDI+ +
Sbjct: 422 IFRKRSPYKW-----RTGVNVLKKNSIRLAEVWMDDYAK-YYYERIGNDLG-DYGDITSR 474


>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Apis mellifera]
          Length = 635

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 207/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P EP    PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 83  GVLVAPREPDASAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 141

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDDFS
Sbjct: 142 DPRDPWCKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFS 201

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 202 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 260

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  +   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 261 LEPLLDRIARNPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 312

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 313 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWM 372

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 373 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 426


>gi|328792011|ref|XP_624873.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
           [Apis mellifera]
          Length = 637

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/353 (41%), Positives = 208/353 (58%), Gaps = 16/353 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           ++G  E G   +L +  +   D     Y  N+  S++I   R +PD R + C+   Y   
Sbjct: 109 EQGLDELGMIKNLDDQRKR--DEGYKNYSFNILVSDNIGLHRELPDTRHKLCELQKYSSK 166

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QR 139
           L  AS+++ F+NE + +L+R++HSII RTP   L EIILV+D+S    L +K++ YI   
Sbjct: 167 LSNASIVICFYNEHYMTLLRSLHSIIDRTPTNLLHEIILVNDWSDSKILHEKIKIYIANN 226

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
           FNGKV+  +  +REGLIR R  GA+++ GE+++FLD+H EV   W+ PLL+ I   + I 
Sbjct: 227 FNGKVKYFKTEKREGLIRARIFGARKATGEILIFLDSHIEVNRQWIEPLLSRIVYSKTIT 286

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
            +PVID I+  T++    Y      RG F WG+ +K + +P       +   +P KSPT 
Sbjct: 287 AMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPIGTFVHDEDFVKPIKSPTM 342

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R   P
Sbjct: 343 AGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPCSRVGHVFRKRRP 402

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           Y      D      +  N  RV   W DE +K YF         +D GDI+E+
Sbjct: 403 YGAYDQHDT-----MLKNSLRVAHVWLDE-YKDYFLQN---IKKIDYGDITER 446


>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 626

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 144/352 (40%), Positives = 205/352 (58%), Gaps = 11/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
           G+GG    L  A +   +      G N    + +  +RT+ D R   C+  +Y + +LP 
Sbjct: 102 GKGGAGVTLTGAEKEKANKEFSRAGFNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPT 161

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL-DQKLEDYIQRF-- 140
           ASV+++F +E FS+L+RTV+S+I RTP + L EIILVDD+S   ++ + +LE +I+R   
Sbjct: 162 ASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLERFIRRHFR 221

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
            G V+LI   +REGLIR R  GA+ + G+V+VFLD+HCE   +WL P++  I  DR  + 
Sbjct: 222 PGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELIKKDRTTVV 281

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID +T ++      D +  G F W   +     PE   K RK  ++P +SPT A
Sbjct: 282 CPIIDVIDDKTLQYMGT-SSDFYQIGGFNWKGEFIWINTPEAWRKARKSKADPMRSPTMA 340

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F E G YD  +  WGGEN E+SF+IWMCGGS+   PCS +GH++R + PY
Sbjct: 341 GGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVGHIFRDYHPY 400

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F    D         N  R+ E W D  +K YFY   P    +  GDISE+
Sbjct: 401 KFPSNKD-----THGINTARLAEVWMD-NYKYYFYQNRPELRKISFGDISER 446


>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
 gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
          Length = 442

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 141/362 (38%), Positives = 216/362 (59%), Gaps = 14/362 (3%)

Query: 15  PPLEPYKEGPGEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           PP++P    PGEGG   +L    PE  R   +  L     N   S+ IS  R++PDLR  
Sbjct: 16  PPVDP--TAPGEGGHGVNLQPSTPEEKRLYKEG-LKNNSFNAWASSKISLHRSLPDLRHR 72

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            CK   +   LP+ SVI++F+NE +S+L+RTVHS+++ +PA+ L E+ILVDD S+   L 
Sbjct: 73  LCKQKQFFRPLPQTSVIIIFYNEAWSTLLRTVHSVLEASPAELLREVILVDDCSTFDHLK 132

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
             LE Y+     +VRL+R+ +R+GLIR R  GA  +RGEV+ FLD+HCE    WL P L 
Sbjct: 133 APLETYLSTLP-QVRLVRSPKRQGLIRARLLGALHARGEVLTFLDSHCECMHGWLEPQLE 191

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            I  +   + + V+D I + T+++  +        GI    + +    +PE E +++K  
Sbjct: 192 TIARNYTTVPISVLDNILHDTFQYTFMDLQSTQMGGINFKELTFIWEPIPEHERRRQKSP 251

Query: 251 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 310
            +P +SPT AGG+F++++ +F  LG YD G+ VWGGEN E+SF+IW CGG+I  +PCS +
Sbjct: 252 VDPIRSPTMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMSFRIWQCGGTIVVLPCSHV 311

Query: 311 GHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
           GHV+R   PY+ G    +     + +N +R+ E W D+ +K  +Y + P     DMGD++
Sbjct: 312 GHVFRPTSPYSTGDAWKK-----LVHNNRRMAEVWMDD-YKEIYYRKHPEYRKYDMGDVT 365

Query: 371 EQ 372
           ++
Sbjct: 366 QR 367


>gi|395519600|ref|XP_003763931.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
           [Sarcophilus harrisii]
          Length = 945

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 147/365 (40%), Positives = 217/365 (59%), Gaps = 13/365 (3%)

Query: 12  NLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 70
           NL+  L P   + PG+ G    +P            E   N+  S+ I  DR I D R  
Sbjct: 430 NLDVTLSPRNPKAPGQFGNPVVVPFGKEKEVKRRWKEGNFNVYLSDLIPLDRAIDDTRPS 489

Query: 71  ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
            C       +LP  S+I+ F +E +S+L+R+VHS++ R+P   ++EI+LVDDFS+K  L 
Sbjct: 490 GCADQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKGYLK 549

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
            +L+ Y+ +F  KVR++   ER GLIR R  GA+ + G+V+ FLD+H E  + WL PLL 
Sbjct: 550 DQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLLE 608

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 250
            +Y ++K +  PVI+ I+ +   + +V   D+  RGIF W M +   ++P    K+ K  
Sbjct: 609 RVYLNKKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPEIIKQNKIK 665

Query: 251 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
            ++  + P  AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PCSR
Sbjct: 666 ETDVIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIEIIPCSR 725

Query: 310 IGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR--EPLAMFLDMG 367
           +GH++R   PY+F +  +R+K   I  N  RV E W DE +K  FY      L   L++G
Sbjct: 726 VGHIFRKDNPYSFPE--NRIK--TIERNLIRVAEVWLDE-YKELFYGHGYHLLDQSLNVG 780

Query: 368 DISEQ 372
           ++++Q
Sbjct: 781 NLTQQ 785


>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
           1 [Pteropus alecto]
          Length = 557

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 141/345 (40%), Positives = 198/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y +DLP  S ++ 
Sbjct: 71  KAYLAAKQLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSVSYSVDLPATSFVIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   ++ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVASAAILTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   + P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKISRTDPTRPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Oryctolagus cuniculus]
          Length = 551

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 139/332 (41%), Positives = 191/332 (57%), Gaps = 17/332 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     +  N   S  I  +R +PD R   C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 68  VGDDPYKLHAFNQRESERIPSNRVVPDTRHNRCALLVYCKDLPPTSIIITFHNEARSTLL 127

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RTV SI+ RTP   ++EIILVDDFSS  D   +L         KV+ +RN ER+GL+R+R
Sbjct: 128 RTVRSILNRTPMHLIQEIILVDDFSSDPDDCNQLIKL-----PKVKCLRNNERQGLVRSR 182

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 183 IRGADIAQGATLTFLDSHCEVNKDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 239

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + +   +L   +  +R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 240 SASELRGGFDWSLHFHWEQLSPEQKARRLDPTEPIRTPVIAGGLFVIDKAWFDYLGKYDT 299

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMC GS+E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 300 DMDIWGGENFEISFRVWMCRGSLEIIPCSRVGHVFRKKHPYAFP------NGNTNTYIKN 353

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            KR  E W D+ +K Y+Y   P A+    G+I
Sbjct: 354 TKRTAEVWMDD-YKQYYYAARPFALERPFGNI 384


>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 [Ovis
           aries]
          Length = 557

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 198/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +      FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVAAAAFFTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIARTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
           gallus]
          Length = 590

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 155/365 (42%), Positives = 217/365 (59%), Gaps = 24/365 (6%)

Query: 14  EPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +P LEP     GE G+A  L    A +   + S+  + +N+  S+ IS  R +P+     
Sbjct: 74  KPDLEP--GALGELGRAVRLELSPAEKRLQEESIRRHQINIYLSDRISLHRRLPERWHPL 131

Query: 72  CK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           CK   +DY   LPK SV++ F+NE +S+L+RTVHS+++ +P   LEE+ILVDD+S K  L
Sbjct: 132 CKGKKYDY-YSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHL 190

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            + LE+Y+     KVRLIR  +REGL+R R  GA  +RG+++ FLD HCE    WL PLL
Sbjct: 191 KEPLENYVAGLR-KVRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPLL 249

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
             I  +   +  PVID ID+ T+E+  +  EP     G F+W +++  +  PERE K+RK
Sbjct: 250 ERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKRRK 306

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
              +  +SPT AGGLF++ + +F  LG YD G+ VWGGEN E SF+IW CGGS+E  PCS
Sbjct: 307 SKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 366

Query: 309 RIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
            +GHV+    PY+  K LA+ V          R  E W DE +K  +Y R P A     G
Sbjct: 367 HVGHVFPKQAPYSRSKALANSV----------RAAEVWMDE-YKELYYHRNPHARLEPYG 415

Query: 368 DISEQ 372
           D+SE+
Sbjct: 416 DVSER 420


>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 526

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 144/352 (40%), Positives = 205/352 (58%), Gaps = 11/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 83
           G+GG    L  A +   +      G N    + +  +RT+ D R   C+  +Y + +LP 
Sbjct: 102 GKGGAGVTLTGAEKEKANKEFSRAGFNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPT 161

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL-DQKLEDYIQRF-- 140
           ASV+++F +E FS+L+RTV+S+I RTP + L EIILVDD+S   ++ + +LE +I+R   
Sbjct: 162 ASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLERFIRRHFR 221

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
            G V+LI   +REGLIR R  GA+ + G+V+VFLD+HCE   +WL P++  I  DR  + 
Sbjct: 222 PGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELIKKDRTTVV 281

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID +T ++      D +  G F W   +     PE   K RK  ++P +SPT A
Sbjct: 282 CPIIDVIDDKTLQYMGT-SSDFYQIGGFNWKGEFIWINTPEAWRKARKSKADPMRSPTMA 340

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+DR +F E G YD  +  WGGEN E+SF+IWMCGGS+   PCS +GH++R + PY
Sbjct: 341 GGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVGHIFRDYHPY 400

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            F    D         N  R+ E W D  +K YFY   P    +  GDISE+
Sbjct: 401 KFPSNKD-----THGINTARLAEVWMD-NYKYYFYQNRPELRKISFGDISER 446


>gi|195402751|ref|XP_002059968.1| GJ14949 [Drosophila virilis]
 gi|194140834|gb|EDW57305.1| GJ14949 [Drosophila virilis]
          Length = 666

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 199/345 (57%), Gaps = 16/345 (4%)

Query: 7   DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
           D  +   EP     K+G G  G+   +P   R            N+  S+ I  +RT+ D
Sbjct: 80  DYNINQFEP-----KQGEGADGRPVVIPPRDRFRMQRFFKLNSFNILASDRIPLNRTLKD 134

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            R  EC+   Y   LP  SVI+VFHNE +S L+RT+ S+I R+P Q L+EIILVDD S +
Sbjct: 135 YRTNECRDKRYAHGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRQLLKEIILVDDASDR 194

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
           + L ++LE YI+  N   RL R  ER GL+  R  GA+ +RG+V+ FLDAHCE    WL 
Sbjct: 195 SFLKRQLEAYIKVLNVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLE 254

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREA 244
           PLLA I   R+++  PVID I    + +   +E  +H+ G F W + ++   ++   + +
Sbjct: 255 PLLARIKESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKRQTS 311

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
            K K ++ P  +P  AGGLFA+DR +F E+G YD  + +WGGEN E+SF+IW CGG IE 
Sbjct: 312 VKPKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSEMRIWGGENVEMSFRIWQCGGRIEI 371

Query: 305 VPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
            PCS +GH++RS  PY F G +++     ++T N  R    W D+
Sbjct: 372 SPCSHVGHIFRSSTPYTFPGGMSE-----VLTANLARAATVWMDD 411


>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
          Length = 622

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R   A  +  
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMTQ 455

Query: 365 --DMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Nasonia vitripennis]
          Length = 610

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/307 (44%), Positives = 194/307 (63%), Gaps = 11/307 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +RT+PD+R ++C  +Y +   DLP  SVI+VFHNE +S+L+RTVHS+I R
Sbjct: 126 NLLASDRIPLNRTLPDVRKKKCITRYANLG-DLPSTSVIIVFHNEAWSTLLRTVHSVINR 184

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + LEEIILVDD S +  L + L++Y+ + N   R++R+ +R GL+  R  GA E++G
Sbjct: 185 SPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLNVPTRVLRSDKRVGLVNARLMGANEAKG 244

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           EV+ FLDAHCE    WL PLL  I  +R  +  PVID I+  T+ +   +E   H+ G F
Sbjct: 245 EVLTFLDAHCECTAGWLEPLLEAISKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAF 301

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L     ++R+ N  +P+K+P  AGGLF+MDR +F ELG YD  + +WGGE
Sbjct: 302 NWDLHFRWLMLNGALLRERRENIVDPFKTPAMAGGLFSMDREYFFELGSYDEHMRIWGGE 361

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N ELSF++W CGGS+E  PCS +GH++R   PY F    D +    +  N  RV   W D
Sbjct: 362 NLELSFRVWQCGGSVEIAPCSHVGHIFRKSSPYTFPGGVDEI----LYGNLARVALVWMD 417

Query: 348 EKHKAYF 354
           E  K YF
Sbjct: 418 EWGKFYF 424


>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 12 [Taeniopygia
           guttata]
          Length = 594

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 153/365 (41%), Positives = 219/365 (60%), Gaps = 24/365 (6%)

Query: 14  EPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +P L+P     GE G+A  L    A +   + S+  + +N+  S+ IS  R +P+     
Sbjct: 78  KPALDP--GALGELGRAVRLELSPAEKRRQEESIRRHQINIYLSDRISLHRRLPERWHPL 135

Query: 72  C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           C  K +DY  +LPK SV++ F+NE +S+L+RTVHS+++ +P   LEEIILVDD+S K  L
Sbjct: 136 CREKKYDY-YNLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHL 194

Query: 130 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 189
            + LE+Y+     KVRLIR  +REGL+R R  GA  ++G+++ FLD HCE    WL PLL
Sbjct: 195 KETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLL 253

Query: 190 APIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           A I  +   +  PVID ID+ T+E+  +  EP     G F+  +++  +  PERE K+RK
Sbjct: 254 ARIAEEETAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDXRLVFTWHSTPEREQKRRK 310

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
             ++  +SPT AGGLF++ + +F  LG YD G+ VWGGEN E SF+IW CGGS+E  PCS
Sbjct: 311 SKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 370

Query: 309 RIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
            +GHV+    PY+  K LA+ V          R  E W DE +K  +Y R P A     G
Sbjct: 371 HVGHVFPKQAPYSRAKALANSV----------RAAEVWMDE-YKQLYYHRNPHARLEPYG 419

Query: 368 DISEQ 372
           D++E+
Sbjct: 420 DVTER 424


>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
 gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
          Length = 587

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 152/352 (43%), Positives = 207/352 (58%), Gaps = 29/352 (8%)

Query: 23  GPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           GPGE GK   L   EA          + G N   S+ IS +R+I DLR            
Sbjct: 75  GPGEQGKPATLSPEEATSELRKELYYKNGFNALLSDKISINRSIADLR------------ 122

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
               SV++ F+ E +S+L+RT++S++ R+P   L+EII+VDD S+K  L  KLEDY+++ 
Sbjct: 123 --HPSVVVPFYEEHWSTLLRTIYSVLNRSPPHLLKEIIIVDDGSTKEFLHNKLEDYVKQN 180

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KV+L+R  ER GLI+ R  GAK + G+V++FLD+H E G NWLPPLL PI  + K   
Sbjct: 181 LPKVKLVRQPERTGLIKARLAGAKIASGDVLIFLDSHTEAGYNWLPPLLEPIAENPKTCV 240

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID ID QT++   V+  D   RG+F+W   YK   +   +   R   +EP+ SP  A
Sbjct: 241 CPLIDVIDDQTFD---VHPQDEGGRGLFDWTFHYKRVVIKNED---RISPTEPFPSPVMA 294

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+   FF ELGGYD  L +WG E +E+SFKIW CGG +   PCSR GH+YR++ P+
Sbjct: 295 GGLFAIGADFFWELGGYDEELDIWGAEQYEISFKIWQCGGRMLDAPCSRFGHIYRTYSPF 354

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF-LDMGDISE 371
              +  D      IT N+KRV E W DE +K Y Y R+P      D GD+S+
Sbjct: 355 PNSRKYD-----FITRNHKRVAEIWMDE-YKQYIYDRDPERYAKTDAGDMSK 400


>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 578

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 212/376 (56%), Gaps = 28/376 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHI 58
           RP++K        +PP +      GE GKA    L E      +  +  Y +N+  S+ I
Sbjct: 61  RPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R I D RM ECK   +    LP  SVI+ F+NE +S+L+RT+HS+++ +PA  L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           ILVDD S +  L  +LE YI   + +VRLIR  +REGL+R R  GA  + G+V+ FL  H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLYCH 229

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I      +  PVID ID+ T+EF   + EP     G F+W + ++ 
Sbjct: 230 CECNSGWLEPLLERIGRYETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQW 286

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +P++E  +R    +P +SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W
Sbjct: 287 HSVPKQERDRRISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVW 346

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYT 356
            CGG +E  PCS +GHV+    PY           P    N  R  E W DE +K +FY 
Sbjct: 347 QCGGKLEIHPCSHVGHVFPKRAPY---------ARPNFLQNTARAAEVWMDE-YKEHFYN 396

Query: 357 REPLAMFLDMGDISEQ 372
           R P A     GDISE+
Sbjct: 397 RNPPARKEAYGDISER 412


>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
 gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 6 [Homo sapiens]
          Length = 622

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
          Length = 575

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 195/312 (62%), Gaps = 11/312 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +R++PD R ++C   + DYP   PK S+I+VFHNE +S+L+RTV S+I R
Sbjct: 91  NLLASDRIPLNRSLPDFRRKKCATLFGDYPT-YPKTSIIIVFHNEAWSTLLRTVWSVINR 149

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + LEEIILVDD S +  L + L+DY+       +++R+  R GLI+ R +GA  ++G
Sbjct: 150 SPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKG 209

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
            V+ FLDAHCE    WL  LL+ I  DR  +  PVID I+  T+ +   +E   H+ G F
Sbjct: 210 PVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAF 266

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L  RE K RK + ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGE
Sbjct: 267 NWNLQFRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGE 326

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF+IW CGG ++  PCSR+GH++R   PY+F    ++     +  N  RV   W D
Sbjct: 327 NLEMSFRIWQCGGKVQIAPCSRVGHLFRKSSPYSFPGGINKT----LFSNLARVARVWMD 382

Query: 348 EKHKAYFYTREP 359
           +  + YF   EP
Sbjct: 383 DWARFYFKFNEP 394


>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Loxodonta africana]
          Length = 622

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/371 (40%), Positives = 218/371 (58%), Gaps = 24/371 (6%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P   GPG  GKA+      P+  +   +    ++  N   S+ IS  R + PD R 
Sbjct: 106 PPQDP--NGPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162

Query: 70  EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++   PA +L+EIILVDD S++
Sbjct: 163 PECLDQKFRRCP-QLPTTSVIIVFHNEAWSTLLRTVYSVLHTAPAIFLKEIILVDDASTE 221

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L ++L+ Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL 
Sbjct: 222 EYLKEQLDQYVKQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           PLLA I  D  ++  P I  ID  T+EF + V     H RG F+W + +    +P  E +
Sbjct: 281 PLLARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQ 340

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL- 364
           PCS +GHV+R+  P+ F K  +     +I  N  R+ E W D+ +K  FY R   A  + 
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGIN-----VIARNQVRLAEVWMDD-YKEIFYRRNLQAAKMA 454

Query: 365 ---DMGDISEQ 372
                GDISE+
Sbjct: 455 EEKSFGDISER 465


>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
 gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
          Length = 585

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 149/357 (41%), Positives = 207/357 (57%), Gaps = 26/357 (7%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           E  + GPGE G+ Y L      A +A L  E G +   S+ I+ +R+            +
Sbjct: 70  ESKRTGPGEHGRPYKLSSEQDIALNAKLFKENGYSAVVSDMIALNRS------------E 117

Query: 77  YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 136
           Y  +LP  SVI++F+NE +S+L+RTV+S++ R+P   L+EIILV+D S+K  L   L ++
Sbjct: 118 YLKELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPALLKEIILVNDHSTKPFLWTPLREF 177

Query: 137 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           ++     KVRL+   ER GLI  R  GA+E+RG+V++ LD+H EV  NWLPPLL PI  D
Sbjct: 178 VESELAPKVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEPIAED 237

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYK 255
            +    P ID I + T+++RS    D   RG F+W   YK   L   +       ++P+ 
Sbjct: 238 YRTCVCPFIDVIAHDTFQYRS---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFN 291

Query: 256 SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYR 315
           SP  AGGLFA+   FF ELGGYD GL +WGGE +ELSFKIW CGG +   PCSR+GHVYR
Sbjct: 292 SPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 351

Query: 316 SFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            + P+   +  +      +  N+KRV E W DE +  + Y R P     D GD+S Q
Sbjct: 352 GYAPFGNPRGVN-----FVVRNFKRVAEVWMDE-YSQFLYERNPQFAKTDPGDLSAQ 402


>gi|195467145|ref|XP_002076010.1| GK16099 [Drosophila willistoni]
 gi|194172095|gb|EDW86996.1| GK16099 [Drosophila willistoni]
          Length = 348

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/313 (45%), Positives = 195/313 (62%), Gaps = 13/313 (4%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + G GE G+  H+    +   D      G N   S+ IS +R++PD+R EECK   Y   
Sbjct: 36  RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKTRKYLAK 95

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 139
           LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S    L Q+L+DY+   
Sbjct: 96  LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 155

Query: 140 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
           F   V ++RN ER GLI  R  GAK + GEV+VF D+H EV  NWLPPLL PI  D KI 
Sbjct: 156 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 215

Query: 200 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKRKYNSEPYKSPT 258
           T P++D I++ T+ +   ++     RG F+W   YK+   LPE    K    S PY++P 
Sbjct: 216 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQLPVLPEDSLDK----SLPYRNPV 269

Query: 259 HAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFM 318
             GGLFA++  FF +LGGYD  L +WGGE +ELSFKIWMCGG +  VPCSR+ H++R  M
Sbjct: 270 MMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRGPM 329

Query: 319 -----PYNFGKLA 326
                P N+  +A
Sbjct: 330 DARPNPRNYNFVA 342


>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
          Length = 568

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/358 (39%), Positives = 215/358 (60%), Gaps = 13/358 (3%)

Query: 20  YKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 78
           +K  P G  G    +P+  +   ++   E   N+  S  IS +RT+PD R + C+     
Sbjct: 64  FKYSPHGSNGDGVKIPDHLKNLEESRFSENNFNVVASEMISVNRTLPDYRSDACRISGGK 123

Query: 79  LD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
           ++   LP+AS+I+ FHNE +++++RT+HSI  R+P   +EEI+LVDD+S K  L   L+ 
Sbjct: 124 INTTELPRASIIITFHNEAWTTIIRTLHSISNRSPRHLIEEIVLVDDYSDKYWLKGPLDI 183

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           Y+++F   V +    ER GLIR R  GAK ++G +++FLD+H EV   WL PL++ +  D
Sbjct: 184 YVRQFEIPVHVTHLPERSGLIRARLTGAKIAKGPILLFLDSHIEVSEGWLEPLISRVADD 243

Query: 196 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR-KYNSEPY 254
           R  +  P+ID I  + + F S    D    G F W + +K  ++   + ++     +EP 
Sbjct: 244 RTRIIAPIIDNISDEDFGF-STGRTD--LWGGFSWILSFKWFDMNGNDTQRLIAKKAEPI 300

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           ++PT AGGLFA++R +F E+G YD G+ VWGGEN E+SF+IWMCGGS+E  PCS +GHV+
Sbjct: 301 RTPTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISFRIWMCGGSMEIHPCSHVGHVF 360

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R+  PY+F K  + V    I  N  R  E W DE +K +F+   P A  +++GD+ E+
Sbjct: 361 RTKTPYSFTKEVNFV----IRRNQARTAEVWMDE-YKEFFFKMVPSAQKMEIGDLQER 413


>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
          Length = 607

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 195/312 (62%), Gaps = 11/312 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +R++PD R ++C   + DYP   PK S+I+VFHNE +S+L+RTV S+I R
Sbjct: 123 NLLASDRIPLNRSLPDFRRKKCATLFGDYP-TYPKTSIIIVFHNEAWSTLLRTVWSVINR 181

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + LEEIILVDD S +  L + L+DY+       +++R+  R GLI+ R +GA  ++G
Sbjct: 182 SPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKG 241

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
            V+ FLDAHCE    WL  LL+ I  DR  +  PVID I+  T+ +   +E   H+ G F
Sbjct: 242 PVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAF 298

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L  RE K RK + ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGE
Sbjct: 299 NWNLQFRWFTLGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGE 358

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF+IW CGG ++  PCSR+GH++R   PY+F    ++     +  N  RV   W D
Sbjct: 359 NLEMSFRIWQCGGKVQIAPCSRVGHLFRKSSPYSFPGGINKT----LFSNLARVARVWMD 414

Query: 348 EKHKAYFYTREP 359
           +  + YF   EP
Sbjct: 415 DWARFYFKFNEP 426


>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
           echinatior]
          Length = 595

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 134/308 (43%), Positives = 197/308 (63%), Gaps = 13/308 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +R++PD+R ++C  +Y +   +LPK S+I+VFHNE +S+L+RTVHS+I R
Sbjct: 132 NLMASDKIPLNRSLPDVRKKKCISRYTNLG-NLPKTSIIIVFHNEAWSTLLRTVHSVINR 190

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + LEEIILVDD S +  L   L+DY++  +   R++R+ ER GLI+ R  GA +++G
Sbjct: 191 SPKELLEEIILVDDNSEREFLKNSLDDYVKNLSVSTRVLRSNERIGLIKARLLGANDAKG 250

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           EV+ FLDAHCE  + WL PLL  +  +   +  PVID I+  T+ +   +E   H+ G F
Sbjct: 251 EVLTFLDAHCECTIGWLEPLLEAVGKNATRIVAPVIDIINDNTFSYTRSFE--LHW-GAF 307

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L  R  K+R+ N  EP+++P  AGGLF+M+R +F +LG YD  + +WGGE
Sbjct: 308 NWDLHFRWLTLNGRLLKERRDNIVEPFRTPAMAGGLFSMNRDYFFKLGSYDDQMRIWGGE 367

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           N ELSF+ W CGGSIE  PCS +GH++R   PY F G + D + G     N  RV   W 
Sbjct: 368 NLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYG-----NLARVALVWM 422

Query: 347 DEKHKAYF 354
           D+  + YF
Sbjct: 423 DQWAEFYF 430


>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Ornithorhynchus anatinus]
          Length = 555

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 134/328 (40%), Positives = 197/328 (60%), Gaps = 17/328 (5%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           ++  N   S+ +S DR I D R   C    YP DLP  S+++ FHNE  S+L+RTV S++
Sbjct: 88  QHAFNQLESDKLSSDRAIRDTRHYRCTSAHYPSDLPVTSIVITFHNEARSTLLRTVKSVL 147

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            RTPA  + EIILVDDFS+  + D +L   I     KV+ + N +REGLIR+R RGA+ +
Sbjct: 148 NRTPANLVREIILVDDFSADPE-DCQLLTRIP----KVKCLHNNQREGLIRSRVRGAEVA 202

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
             +++ FLD+HCEV   WL PLL  +  D   +  P+ID I    + + +        RG
Sbjct: 203 TADILTFLDSHCEVNSEWLQPLLQRVKEDYTRVVSPIIDVISLDNFAYLAA---SADLRG 259

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +K  ++P  +   R   ++  ++P  AGG+F +D+++F  LG YD  + +WGG
Sbjct: 260 GFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGG 319

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIET 344
           ENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F       +G  +TY  N KR  E 
Sbjct: 320 ENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP------EGNALTYIKNTKRAAEV 373

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W D+ +K Y+Y   P A+    G ++E+
Sbjct: 374 WMDD-YKQYYYEARPSAIGKAFGSVAER 400


>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
            [Saccoglossus kowalevskii]
          Length = 2434

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 135/344 (39%), Positives = 207/344 (60%), Gaps = 19/344 (5%)

Query: 29   KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
            KAY + +     G  +      N   S+ +S+DR IPD R   CK  D+   LP+ SVI+
Sbjct: 1943 KAY-ISKTVVQTGQDAYARNKFNQVESDKLSYDRDIPDTRNPLCKKLDWKTALPQTSVII 2001

Query: 89   VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
             FHNE  S+L+RTV S++ R+P   ++EIILVDD+S  A+  ++LE        KV+++R
Sbjct: 2002 TFHNEARSTLLRTVVSVLNRSPTSIIKEIILVDDYSDNAEDGKELEKI-----PKVKVLR 2056

Query: 149  NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
            N +REGL+R+R RGA  + G ++ FLD+HCE   NW+ PL+  I  + K +  P+ID I+
Sbjct: 2057 NEKREGLMRSRVRGADYATGTILTFLDSHCECNQNWIEPLITKIQENNKAVVSPIIDVIN 2116

Query: 209  YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY---KSPTHAGGLFA 265
               +++ +        +G F+W +++K + +   E  KRK  S+P    ++P  AGGLFA
Sbjct: 2117 MDNFQYVAASA---DLKGGFDWNLVFKWDYMTPAERNKRK--SDPIAAIRTPMIAGGLFA 2171

Query: 266  MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
            + +++F ELG YD  + VWGGEN E+SF++W CGG++E +PCSR+GHV+R   PY F   
Sbjct: 2172 ISKSWFEELGKYDMMMDVWGGENLEISFRVWQCGGTLEIIPCSRVGHVFRKQHPYTFPGG 2231

Query: 326  ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            +    G +   N +R  E W DE +K Y+Y+  P +  +  G+I
Sbjct: 2232 S----GNVFAKNTRRAAEVWMDE-YKKYYYSAVPSSKNIAFGNI 2270


>gi|345491789|ref|XP_001607575.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Nasonia vitripennis]
          Length = 566

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 137/333 (41%), Positives = 199/333 (59%), Gaps = 10/333 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G+ G+A +L ++ +  G     +  +N+  SN I   R I D+R   CK   Y   LP  
Sbjct: 63  GDFGEAAYLSDSEKQNGSLVYSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPTT 122

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
           SV+++FHNE +S L+RTV+S+++ +P ++L+EIILVDD S++ +L+  L  YI+ R   K
Sbjct: 123 SVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELEDILAYYIETRLPKK 182

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+L+R  +R+GLIR R  GA+++ G+V+VFLDAHCEV   WL PLL  I +    + +PV
Sbjct: 183 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHRIKARPNAVLIPV 242

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID ID +T E++      H   G F+W   +    + +   +      +P  +PT AGGL
Sbjct: 243 IDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRTTASPIDPINTPTMAGGL 302

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           FA+DR +F  +G YD  +  WGGEN E+SF+IW CGGSIE VPCSR+GH++R F PY F 
Sbjct: 303 FAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCSRVGHIFRDFFPYEFP 362

Query: 324 KLADRVKGPLITY--NYKRVIETWFDEKHKAYF 354
              D       TY  N  R    W D+  + +F
Sbjct: 363 SSRD-------TYLINTARAAHVWMDDYKRLFF 388


>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Takifugu rubripes]
          Length = 618

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/370 (40%), Positives = 216/370 (58%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P    PG  GKA+      PE      D  +  +  N   S+ IS  R++  D R 
Sbjct: 100 PPQDP--GSPGADGKAFKKDQMSPEEETEKKDG-MTRHCFNQFASDRISLSRSLGEDTRP 156

Query: 70  EECKYWDYPL--DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
            EC    +P    LP  SVI+VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S   
Sbjct: 157 RECVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASVAG 216

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LE+++ +F   VR++R  ER+GLI  R  GA E++GEV+ FLDAHCE    WL P
Sbjct: 217 HLKEQLEEFVLQFK-IVRVLRQPERKGLITARLLGASEAQGEVLTFLDAHCECFHGWLEP 275

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAKK 246
           LLA I  +   +  P I  ID ++++F       H + RG F+W + +   ++PE   K 
Sbjct: 276 LLARIVEEPTAVVSPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGWEQIPEAARKL 335

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P K+PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 336 RKDETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIP 395

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K  +     +IT N  R+ E W D+ +K  FY R   A  +  
Sbjct: 396 CSVVGHVFRTKSPHTFPKGTE-----VITRNQVRLAEVWMDD-YKKIFYRRNKNAAKMAK 449

Query: 365 --DMGDISEQ 372
             + GDISE+
Sbjct: 450 ENNYGDISER 459


>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
          Length = 580

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/318 (45%), Positives = 194/318 (61%), Gaps = 25/318 (7%)

Query: 10  LGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           L N + P+  YK G    PGEGGKA  +     A  +  + + G N    N         
Sbjct: 97  LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFN--------- 145

Query: 66  DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
                +CK   Y  DLP  SVI+ FHNE +S L+RTVHS+++RTP   L EIILVDDFS 
Sbjct: 146 -----QCKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSD 200

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
            A L   LE Y+++F  KVR++R  +REGLIR R +GA  S+G VI +LD+HCE    W+
Sbjct: 201 MAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWM 259

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 244
            PLL  I  + K +  PVID ID  T+E+   Y   +    G F+W + +  + +PE++ 
Sbjct: 260 EPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDR 317

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           K R+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCGG +E 
Sbjct: 318 KGRR-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEI 376

Query: 305 VPCSRIGHVYRSFMPYNF 322
           VPCS +GH++R   PY +
Sbjct: 377 VPCSHVGHIFRKRSPYKW 394


>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
 gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
          Length = 563

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 198/343 (57%), Gaps = 17/343 (4%)

Query: 32  HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
           +L   +  AG+    ++  N   S+ +S +R I D R   C    Y  DLP  SVI+ FH
Sbjct: 81  YLSSKFIKAGEDPYRQHAFNQLESDKLSSERPIRDTRHYRCTSVHYDNDLPSTSVIITFH 140

Query: 92  NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
           NE  S+L+RT+ S++ R+P   ++EIILVDDFS+  D  Q L         KV+ +RN  
Sbjct: 141 NEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQLLTKI-----PKVKCLRNNR 195

Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
           REGLIR+R RGA+ +   V+ FLD+HCEV   WL PLL  +  D   +  P+ID I    
Sbjct: 196 REGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDN 255

Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
           + + +        RG F+W + +K  ++P  +   R   +   ++P  AGG+F +D+++F
Sbjct: 256 FAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWF 312

Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
            +LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY F        G
Sbjct: 313 NQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYEFP------DG 366

Query: 332 PLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             +TY  N KR +E W DE +K Y+Y   P A+    G ++++
Sbjct: 367 NALTYIKNTKRTVEVWMDE-YKQYYYQARPSAIGKSYGSVADR 408


>gi|380016857|ref|XP_003692388.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like,
           partial [Apis florea]
          Length = 556

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 199/332 (59%), Gaps = 14/332 (4%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           D     Y  N+  S++I   R +PD R + C+   Y   L  AS+++ F+NE + +L+R+
Sbjct: 47  DEGYKNYSFNILVSDNIGLHRELPDTRHKLCEIQKYSSKLSNASIVICFYNEHYMTLLRS 106

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRS 160
           +HSII RTP   L EIILV+D+S    L +K++ YI   FNGKV+  +  +REGLIR R 
Sbjct: 107 LHSIIDRTPTYLLHEIILVNDWSDSKILHEKIKIYIANNFNGKVKYFKTEKREGLIRARI 166

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
            GA+++ GE+++FLD+H EV   W+ PLL+ I   + I  +PVID I+  T++    Y  
Sbjct: 167 FGARKATGEILIFLDSHIEVNKQWIEPLLSRIVYSKTITAMPVIDIINPDTFQ----YTG 222

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F WG+ +K + +P       +   +P KSPT AGGLFAM+R +F +LG YD G
Sbjct: 223 SPLVRGGFNWGLHFKWDNVPIGTFVHDEDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAG 282

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKR 340
           + +WGGEN E+SF+IWMCGGSIE +PCSR+GHV+R   PY      D      +  N  R
Sbjct: 283 MDIWGGENLEISFRIWMCGGSIELIPCSRVGHVFRKRRPYGAYDQHDT-----MLKNSLR 337

Query: 341 VIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           V   W DE +K YF         +D GDI+E+
Sbjct: 338 VAHVWLDE-YKDYFLQN---IKKIDYGDITER 365


>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
          Length = 550

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +R+ ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRHNERQGLVRS 182

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 218
           R RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    
Sbjct: 183 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 239

Query: 219 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYD 278
           E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD
Sbjct: 240 ESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYD 299

Query: 279 PGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY-- 336
             + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  
Sbjct: 300 VDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIK 353

Query: 337 NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           N KR  E W DE +K Y+Y   P A+    G+I  +
Sbjct: 354 NTKRTAEVWMDE-YKQYYYAARPFALERPFGNIENR 388


>gi|195488539|ref|XP_002092358.1| GE11714 [Drosophila yakuba]
 gi|194178459|gb|EDW92070.1| GE11714 [Drosophila yakuba]
          Length = 601

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/365 (40%), Positives = 207/365 (56%), Gaps = 15/365 (4%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L+  K G GE G A HL  A +  GDA   +  +N E S  +S++R++ D R   C    
Sbjct: 83  LQKQKAGLGEQGVAVHLSGAAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPLCAKQR 142

Query: 77  Y-PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
           +    LP ASV+++F NE +S L+RTVHS +     + L+EIILVDD S   +L  KL+ 
Sbjct: 143 FDAASLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202

Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           Y++     GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PLL  I 
Sbjct: 203 YVRTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262

Query: 194 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE- 252
             R  + VP+ID ID   +++ +         G F+W   +    LPERE ++++   + 
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGG-FQWNGHFDWINLPEREKQRQRRECKH 321

Query: 253 -----PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 307
                P  SPT AGGLFA+DR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +PC
Sbjct: 322 DREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPC 381

Query: 308 SRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMG 367
           SR+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L    D+G
Sbjct: 382 SRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIG 436

Query: 368 DISEQ 372
           D++ +
Sbjct: 437 DVTHR 441


>gi|149050681|gb|EDM02854.1| rCG61782, isoform CRA_a [Rattus norvegicus]
          Length = 397

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 136/325 (41%), Positives = 192/325 (59%), Gaps = 17/325 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  +  ++L         KV+ +RN+ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNSERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSVEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
            KR  E W DE +K Y+Y   P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378


>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
 gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
          Length = 488

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 200/344 (58%), Gaps = 23/344 (6%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
           GKA  +P+      +        N+     I+ +RT+PD+RME CK   YP +LP+ SV+
Sbjct: 2   GKAVVIPKEKEKEKNEKFKINQFNLMACEMIALNRTLPDVRMEGCKSKTYPKELPRMSVV 61

Query: 88  LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
           +VFHNE + +L+R+V+SII RTP  YLEEIILVDD S +                 V+L 
Sbjct: 62  IVFHNEAWCTLLRSVNSIINRTPRPYLEEIILVDDASERGV--------------PVKLE 107

Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
           R  +R GLIR R RG+  ++G VI FLDAH E    W  PLL  I  DR  +  P+ID I
Sbjct: 108 RMGKRSGLIRARLRGSGAAKGPVITFLDAHIECTEGWAEPLLTRIAEDRTTVVCPIIDVI 167

Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
              T+E+ +    D  Y G F W + ++   +P+RE  +R  + + P ++PT AGGLFA+
Sbjct: 168 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRGGDRTMPLRTPTMAGGLFAI 224

Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLA 326
           D+++F E+G YD G+ +WGGEN E+SF+IW CGG++E V CS +GHV+R   PY F    
Sbjct: 225 DKSYFEEIGTYDSGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGT 284

Query: 327 DRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDIS 370
               G +I  N +R+ E W D   K +FY   P    +D GD++
Sbjct: 285 ----GQIINKNNRRLAEVWMD-NFKDFFYIISPGVTKVDYGDVT 323


>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Canis lupus familiaris]
          Length = 622

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 149/370 (40%), Positives = 215/370 (58%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE-AYRAAGDASLG--EYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +  ++   +   G  ++  N   S+ IS  R + PD R  
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTHQETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SV++VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++LE Y+++    VR++R  ER+GLI  R  GA  ++ +V+ FLDAHCE    WL P
Sbjct: 223 YLKEQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D  ++  P I  ID  T+EF + V     H RG F+W + +    +P  E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGVS-----VIARNQVRLAEVWMD-NYKEIFYRRNMQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus impatiens]
          Length = 637

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P E     PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 85  GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  D   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428


>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 3 [Bombus terrestris]
          Length = 637

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P E     PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 85  GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  D   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428


>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
          Length = 614

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P  + PG  GKA+   +         +    ++  N   S+ IS  R + PD R  
Sbjct: 98  PPQDP--KSPGADGKAFQKNNWTPLETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 155

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 156 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTED 214

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L  KLE Y++     V+++R  ER+GLI  R  GAK ++ EV+ FLDAHCE    WL P
Sbjct: 215 YLKDKLEQYVKELQ-VVKVVRQVERKGLITARLLGAKVAQAEVLTFLDAHCECFHGWLEP 273

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 274 LLARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSRGNFDWSLTFGWETLPPHEKQR 333

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
            K  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 334 HKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 393

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K  +     +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 394 CSVVGHVFRTKSPHTFPKGIN-----VIARNQVRLAEVWMDS-YKQIFYRRNLQAAKMAQ 447

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 448 EKSFGDISER 457


>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
 gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
          Length = 597

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 212/370 (57%), Gaps = 15/370 (4%)

Query: 12  NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +++  LE  K G GE G + HL    +  GDA   +  +N E S  +S++R++ D R   
Sbjct: 74  SIQLDLEKQKIGLGEQGASVHLSGKAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPL 133

Query: 72  C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           C  +++D    LP ASVI++F+NE +S L+RTVHS +     Q L+EIILVDD S   +L
Sbjct: 134 CLAQHFDSST-LPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPEL 192

Query: 130 DQKLEDYIQRFN--GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
             KL+ YI+     GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  P
Sbjct: 193 GGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEP 252

Query: 188 LLAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
           LL  I   R  + VP+ID ID     Y T  ++S       + G F+W  L +  +  +R
Sbjct: 253 LLHRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQR 312

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
              K++    P  SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+I
Sbjct: 313 RECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 372

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
           E +PCSR+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L  
Sbjct: 373 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEFINIFFLNRPDLKF 427

Query: 363 FLDMGDISEQ 372
             D+GD++ +
Sbjct: 428 HADIGDVTHR 437


>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
 gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
          Length = 604

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 149/358 (41%), Positives = 210/358 (58%), Gaps = 16/358 (4%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLP 82
           G  G A HL  A +A G+    +  +N E S  +S++RT+ D R   C  + +D P  LP
Sbjct: 92  GNKGVAVHLTGAAKARGERIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQKYDDPSTLP 151

Query: 83  KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 141
            ASV+++F+NE +S L+RTVHS +     + L+EIILVDD S  A+L  KL+ Y++ RF 
Sbjct: 152 TASVVIIFYNEPYSVLVRTVHSTLNTCNEKSLKEIILVDDGSDNAELGGKLDYYVRTRFP 211

Query: 142 -GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
            GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE    W  PLL  I   R  + 
Sbjct: 212 PGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQRIKESRTSVL 271

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR-KYNSEPYK---- 255
           VP+ID ID + +++ +         G F+W   +    LPERE +++ +  S+P +    
Sbjct: 272 VPIIDVIDAKDFQYSTNGYKSFQVGG-FQWSGHFDWVNLPEREKQRQLRECSQPREICPA 330

Query: 256 -SPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
            SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +PCSR+GH++
Sbjct: 331 YSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRVGHIF 390

Query: 315 RSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           R F PY F    D   G     N  R+   W DE    +F  R  L    D+GD++ +
Sbjct: 391 RDFHPYKFPNDRD-THG----INTARMALVWMDEYINVFFLNRPDLKFHPDIGDVTHR 443


>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1-like [Sus
           scrofa]
          Length = 557

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 144/345 (41%), Positives = 199/345 (57%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA     GE     +  N   S+ +S DR I D R   C    Y  DLP  SVI+ 
Sbjct: 71  KAYLAAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIIT 130

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RTV S++ RTPA  ++EIILVDDFSS  + D  L   I     KV+ +RN
Sbjct: 131 FHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 185

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   V+ FLD+HCEV   WL P+L  +  D   +  P+ID I  
Sbjct: 186 DRREGLIRSRVRGADVAAAGVLTFLDSHCEVNTEWLQPMLQRVKEDHTRVVSPIIDVISL 245

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +       ++P ++P  AGG+F +D++
Sbjct: 246 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKIAWTDPTKPIRTPVIAGGIFVIDKS 302

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PYNF       
Sbjct: 303 WFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFP------ 356

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N KR  E W DE +K Y+Y   P A+    G ++ +
Sbjct: 357 EGNALTYIRNTKRTAEVWMDE-YKQYYYEARPSAIGKAFGSVATR 400


>gi|148706467|gb|EDL38414.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14, isoform CRA_c [Mus
           musculus]
          Length = 429

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 136/325 (41%), Positives = 191/325 (58%), Gaps = 17/325 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  +  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
            KR  E W DE +K Y+Y   P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378


>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
           echinatior]
          Length = 597

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/352 (41%), Positives = 208/352 (59%), Gaps = 13/352 (3%)

Query: 25  GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 84
           G  G+  +L    +  G+A+L +  +N+  SN IS  R +PD+R   C    Y   LP A
Sbjct: 85  GNNGEPAYLYGREKILGEAALAKKALNVILSNKISLTRKLPDVRNPLCANVTYDKLLPSA 144

Query: 85  SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 143
           S+I++F+NE +S L+RTVHS++K +P   L+EIILVDD S + +L  +L+ Y+  R   K
Sbjct: 145 SIIIIFYNEPWSVLLRTVHSVLKGSPPNLLKEIILVDDHSEEEELQGQLDYYLSTRLPAK 204

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+L+R   R+GLIR R  GAK + G+V+VFLDAHCEV  +WL PLL  I  ++  + +P+
Sbjct: 205 VKLLRLPYRQGLIRARLHGAKNAVGDVLVFLDAHCEVIKDWLQPLLQRIKDNKNAVLMPI 264

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID I  +T E+    E      G F W   +    + + E + R     P +SPT AGGL
Sbjct: 265 IDNISEETLEYFHDNEAFFFQVGGFTWSGHFTWITIQKHEVESRFSPISPTRSPTMAGGL 324

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
           FA++R +F E+G YD  +  WGGEN E+SF+IW CGG++E +PCSR+GH++R+F PY F 
Sbjct: 325 FAINRKYFWEIGSYDDKMDGWGGENLEISFRIWQCGGTLEIIPCSRVGHIFRNFHPYKFP 384

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD----MGDISE 371
              D   G     N  R+   W DE  + +   R   + F D    +GDISE
Sbjct: 385 NDKD-THG----INTARLAFVWMDEYKRLFLLHR---SEFKDNPELIGDISE 428


>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
 gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
          Length = 597

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 212/370 (57%), Gaps = 15/370 (4%)

Query: 12  NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +++  LE  K G GE G + HL    +  GDA   +  +N E S  +S++R++ D R   
Sbjct: 74  SIQLDLEKQKIGLGEQGASVHLSGKAKERGDAIYKKIALNEELSEQLSYNRSVGDHRNPL 133

Query: 72  C--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 129
           C  +++D    LP ASVI++F+NE +S L+RTVHS +     Q L+EIILVDD S   +L
Sbjct: 134 CLAQHFDSST-LPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPEL 192

Query: 130 DQKLEDYIQRFN--GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
             KL+ YI+     GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  P
Sbjct: 193 GGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEP 252

Query: 188 LLAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 242
           LL  I   R  + VP+ID ID     Y T  ++S       + G F+W  L +  +  +R
Sbjct: 253 LLHRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQR 312

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
              K++    P  SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+I
Sbjct: 313 RECKQEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 372

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
           E +PCSR+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L  
Sbjct: 373 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEFINIFFLNRPDLKF 427

Query: 363 FLDMGDISEQ 372
             D+GD++ +
Sbjct: 428 HADIGDVTHR 437


>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus impatiens]
          Length = 602

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P E     PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 85  GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  D   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428


>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Takifugu rubripes]
          Length = 577

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 154/377 (40%), Positives = 223/377 (59%), Gaps = 30/377 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHI 58
           RPV++        +PPL+     PGE G+A  L   E  +   + SL ++ +N+  S+ +
Sbjct: 56  RPVYE--------KPPLD--WNAPGEMGRAVRLTLSEEEKRKEEESLQKHQINIYISDKV 105

Query: 59  SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 117
           S  R +P+     C+   Y    LP  SVI+ F+NEG+S+L+RTVHS+++ +P   L+E+
Sbjct: 106 SLHRRLPERWNPLCRQLKYDYRSLPTTSVIIAFYNEGWSTLLRTVHSVLETSPDILLKEV 165

Query: 118 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 177
           +LVDD+S +A L + LE+YI     KVRLIR T+REGL+R R  GA  + G+V+ FLD H
Sbjct: 166 VLVDDYSDRAHLKEPLENYISGLK-KVRLIRATKREGLVRARLLGASITTGDVLTFLDCH 224

Query: 178 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKE 236
           CE    WL PLL  I  +   +  PVID ID+  +++  +  EP     G F+W +++  
Sbjct: 225 CECHEGWLEPLLHRIKEEPSAVVCPVIDVIDWNNFQYLGNAGEPQ---IGGFDWRLVFTW 281

Query: 237 NELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 296
           + +PE E K+RK  ++  +SPT AGGLFA+ + +F  LG YD G+ VWGGEN E SF+IW
Sbjct: 282 HSIPEYEQKRRKSPTDVIRSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFRIW 341

Query: 297 MCGGSIEWVPCSRIGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFY 355
            CGGS+E  PCS +GHV+    PY+  K LA+ V          R  E W DE +K  +Y
Sbjct: 342 QCGGSLEVHPCSHVGHVFPKKAPYSRNKALANSV----------RAAEVWMDE-YKEIYY 390

Query: 356 TREPLAMFLDMGDISEQ 372
            R P A     GD++E+
Sbjct: 391 HRNPHARLEAYGDVTER 407


>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus terrestris]
 gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus terrestris]
          Length = 602

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P E     PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 85  GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  DLP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  D   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 315 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWM 374

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 375 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 428


>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Oreochromis niloticus]
          Length = 567

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 132/345 (38%), Positives = 201/345 (58%), Gaps = 22/345 (6%)

Query: 35  EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 89
           +AY AA    LG+     +  N++ S+ +  +R I D R   C    Y  DLP  S+++ 
Sbjct: 86  KAYLAAKQLKLGDDPYKDHAFNLQESDRLGGERAIRDTRHYRCAALTYDTDLPSTSIVIT 145

Query: 90  FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 149
           FHNE  S+L+RT+ S++ R+P   ++EIIL+DDFSS  +  Q L         KVR +RN
Sbjct: 146 FHNEARSTLLRTIKSVLMRSPPSLIQEIILIDDFSSDPEDCQLLAQI-----PKVRCLRN 200

Query: 150 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 209
             REGLIR+R RGA  +   ++ FLD+HCEV  +WL P++  +  D   +  P+ID I  
Sbjct: 201 GRREGLIRSRVRGANMASASILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVISL 260

Query: 210 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
             + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F MDR+
Sbjct: 261 DNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTQAIRTPVIAGGIFVMDRS 317

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F  LG YD  + +WGGENFELSF++W+CGGS+E +PCSR+GHV+R   PY+F       
Sbjct: 318 WFNHLGQYDTHMDIWGGENFELSFRVWLCGGSLEILPCSRVGHVFRKRHPYDFP------ 371

Query: 330 KGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           +G  +TY  N +R  E W DE +K Y+Y+  P A     G ++++
Sbjct: 372 EGNALTYIKNTRRAAEVWMDE-YKQYYYSARPSAQGKAFGSVTDR 415


>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 isoform 2
           [Gallus gallus]
          Length = 557

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+I+
Sbjct: 73  KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 131

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  + D +L   I     KV+ +R
Sbjct: 132 TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCQLLTRIP----KVKCLR 186

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 187 NIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 246

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F +++
Sbjct: 247 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 303

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 304 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 358

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 359 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 402


>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1-like
           [Meleagris gallopavo]
          Length = 509

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 139/346 (40%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+I+
Sbjct: 25  KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDADLPATSLII 83

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  +  Q L         KV+ +R
Sbjct: 84  TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCQLLTKI-----PKVKCLR 138

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 139 NIRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 198

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F +++
Sbjct: 199 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 255

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 256 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 310

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 311 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 354


>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Acyrthosiphon pisum]
          Length = 581

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 137/343 (39%), Positives = 208/343 (60%), Gaps = 19/343 (5%)

Query: 36  AYRAAG-----DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 90
           AY AAG     D +      N   S+ +  +R +PD R  +C    Y +DLP+ SVI+ F
Sbjct: 93  AYVAAGGLRHGDDAYSRNKFNQLASDSLRSNRPVPDTRNAKCLTKKYRIDLPQTSVIITF 152

Query: 91  HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 150
           HNE  S+L+RTV S++ R+P   ++EIILVDDFS  +   Q+L   IQ    KV+LIRN 
Sbjct: 153 HNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDDSTDGQELSK-IQ----KVKLIRNE 207

Query: 151 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 210
           +REGL+R+R RG++ +   V+ FLD+H E  +NWL PLL  +  D   +  P+ID I+  
Sbjct: 208 KREGLMRSRVRGSEIATAPVLTFLDSHVECNVNWLEPLLDRVAEDPTRVVCPIIDVINMD 267

Query: 211 TWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRA 269
            +++          RG F+W +++K   L  E  A+++K  + P ++P  AGGLF MD+ 
Sbjct: 268 NFQY---IGASSELRGGFDWNLVFKWEYLSKEVRAQRQKDPTLPIRTPMIAGGLFVMDKD 324

Query: 270 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRV 329
           +F++LG YD  + +WGGEN E+SF++W CGGS+E +PCSR+GHV+R   PY F   +   
Sbjct: 325 YFVKLGTYDKEMNIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGS--- 381

Query: 330 KGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            G +  +N +R  E W D+ +K Y+Y   PL+  +  G+I+++
Sbjct: 382 -GNVFAHNTRRAAEVWMDQ-YKRYYYNAVPLSRIVPFGNIADR 422


>gi|328723398|ref|XP_001946977.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           isoform 1 [Acyrthosiphon pisum]
 gi|328723400|ref|XP_003247833.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 624

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 131/307 (42%), Positives = 196/307 (63%), Gaps = 11/307 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRT 109
           N+  S+ I  +R++PD+R + C+     +D LP ++VI+VFHNE +S+LMRTV S+I R+
Sbjct: 130 NLMASDRIPLNRSLPDVRKKSCRLKKIDIDKLPSSTVIIVFHNEAWSTLMRTVQSVIDRS 189

Query: 110 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 169
           P   L EIILVDD S++  L+++L+DY+ +     R+IR+ +R GLI+ R  GA++++G+
Sbjct: 190 PKYLLNEIILVDDASTRKFLEKELDDYVAKLPVLTRIIRSPKRIGLIKARLMGARQAKGK 249

Query: 170 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 229
           ++VFLDAHCE  L WL  L++ +  DRK +  PVID I  +T+ +   +E   H+ G F 
Sbjct: 250 ILVFLDAHCECTLGWLEALVSRVAEDRKRVVCPVIDIISDETFAYVRSFE--LHW-GAFN 306

Query: 230 WGMLYK--ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           W + ++      P+    +R   ++ +++P  AGGLFAMD+++F ELGGYD  + +WGGE
Sbjct: 307 WDLHFRWYTRTTPDIMKGQRDI-TQAFRTPAMAGGLFAMDKSYFFELGGYDERMEIWGGE 365

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N ELSF++W CGGSIE  PCS +GHV+R   PY F      V    +  N  RV   W D
Sbjct: 366 NLELSFRVWQCGGSIEIAPCSHVGHVFRKSSPYTFPGGVSHV----LYTNLARVALVWMD 421

Query: 348 EKHKAYF 354
           E  + YF
Sbjct: 422 EWQEFYF 428


>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 isoform 1
           [Gallus gallus]
          Length = 563

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 203/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L      AG+    ++  N   S+ +S DR I D R   C    Y  DLP  S+I+
Sbjct: 79  KAY-LSSKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVRYDTDLPATSLII 137

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ RTP   ++EIILVDDFSS  + D +L   I     KV+ +R
Sbjct: 138 TFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPE-DCQLLTRIP----KVKCLR 192

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA+ +  +++ FLD+HCEV   WL P+L  +  D   +  P+ID I 
Sbjct: 193 NIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++  ++P  AGG+F +++
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQSIRTPVIAGGIFVINK 309

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY+F      
Sbjct: 310 SWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDFP----- 364

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N KR  E W DE +K Y+Y   P A+    G I+++
Sbjct: 365 -EGNALTYIKNTKRTAEVWMDE-YKQYYYEARPSAIGKSYGSIADR 408


>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Taeniopygia guttata]
          Length = 669

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 201/323 (62%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR+IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 203 NQVESDKLRMDRSIPDTRHDQCQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSP 262

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
           +  ++EIILVDD+S+  D D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 263 SHLIKEIILVDDYSNDPD-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 317

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  D+  +  P+ID I+   +++          +G F+W
Sbjct: 318 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 374

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+++F ELG YD  + VWGGEN 
Sbjct: 375 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 434

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 435 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 489

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 490 YKNFYYAAVPSARNVPYGNIQSR 512


>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
           (Silurana) tropicalis]
 gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
           tropicalis]
          Length = 555

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/328 (43%), Positives = 192/328 (58%), Gaps = 18/328 (5%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S  I  DR I D R   C    Y  DLP  SVI+ FHNE  S+L+RT+ S++ 
Sbjct: 77  YAFNQRESERIPSDRAIKDTRHYRCTELHYQSDLPPTSVIITFHNEARSTLLRTIRSVLN 136

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE-GLIRTRSRGAKES 166
           RTP   + EI+LVDDFS   D D +L   +     KVR +RN +RE GLIR+R RGA  +
Sbjct: 137 RTPMHLIHEILLVDDFSDNLD-DCRLLSKLP----KVRCLRNEQREAGLIRSRVRGAGVA 191

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
           +  V+ FLD+HCEV  +WLPPLL  I  D   +  PVID I+  T+ + +        RG
Sbjct: 192 QAAVLTFLDSHCEVNKDWLPPLLHRIKEDPTRVVSPVIDIINLDTFAYIAA---SSDLRG 248

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +K  +L   +  KR   +EP K+P  AGGLF +++++F  LG YD  + +WGG
Sbjct: 249 GFDWSLHFKWEQLSAEQKAKRLDPTEPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGG 308

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIET 344
           ENFE+SF++WMCGGS+E +PCSR+GHV+R   PY F       +G   TY  N KR  E 
Sbjct: 309 ENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------EGNANTYIKNTKRTAEV 362

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE  K ++Y   P A     GDI ++
Sbjct: 363 WMDE-FKNHYYAARPAAQGRPYGDIQKR 389


>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
 gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
          Length = 541

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/384 (36%), Positives = 213/384 (55%), Gaps = 43/384 (11%)

Query: 28  GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 87
           GK   +P   +        E   N+  S+ IS +R++ D+R E C+   Y   LP  S++
Sbjct: 2   GKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASKLPTTSIV 61

Query: 88  LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 147
           +VFHNE +++L+RTV S+I R+P   L+EIILVDD S +  L ++LE+Y+ +   K  ++
Sbjct: 62  IVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVL 121

Query: 148 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 207
           R  +R GLIR R  GA+   GEVI FLDAHCE    WL PLLA I  +R+ +  P+ID I
Sbjct: 122 RTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVVCPIIDVI 181

Query: 208 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAM 266
             +T+E+  +   D  + G F W + ++   +P RE  +R  + + P ++PT AGGLF++
Sbjct: 182 SDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSI 238

Query: 267 DRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKL 325
           D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PCS +GHV+R   PY F G +
Sbjct: 239 DKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGV 298

Query: 326 ADRV-------------------------------------KGPLITYNYKRVIETWFDE 348
           A  V                                        ++ +N  R++E W D+
Sbjct: 299 AKIVLHNAARVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDD 358

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
             K ++Y+  P A     GD+S++
Sbjct: 359 -WKEFYYSFYPGARKASAGDVSDR 381


>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
           musculus]
          Length = 515

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 193/336 (57%), Gaps = 22/336 (6%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 31  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 90

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 158
           RT+ S++ RTP   ++EIILVDDFS+        ED  Q     KV+ +RN ER+GL+R+
Sbjct: 91  RTIRSVLNRTPMHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 144

Query: 159 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLL---APIYSDRKIMTVPVIDGIDYQTWEFR 215
           R RGA  ++G  + FLD+HCEV  +WL PLL     +  D   +  PVID I+  T+ + 
Sbjct: 145 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEVLQDYTRVVCPVIDIINLDTFNY- 203

Query: 216 SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELG 275
              E     RG F+W + ++  +L   +   R   +EP ++P  AGGLF +D+A+F  LG
Sbjct: 204 --IESASELRGGFDWSLHFQWEQLSLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLG 261

Query: 276 GYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLIT 335
            YD  + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   T
Sbjct: 262 KYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANT 315

Query: 336 Y--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           Y  N KR  E W DE +K Y+Y   P A+    G+I
Sbjct: 316 YIKNTKRTAEVWMDE-YKQYYYAARPFALERPFGNI 350


>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
          Length = 554

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 138/327 (42%), Positives = 189/327 (57%), Gaps = 23/327 (7%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S  I  +R + D R   C    Y  DLP  ++++ FHNE  S+L+RTV S++ 
Sbjct: 79  YAFNQRESERIPSNRALRDTRHYRCTTLHYDPDLPSTTIVITFHNEARSTLLRTVRSVLN 138

Query: 108 RTPAQYLEEIILVDDFSSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           RTP   + EIILVDDFS   +   L  KL         KV+ +RN  REGLIR+R RGA 
Sbjct: 139 RTPVHLIHEIILVDDFSEDPNDCLLLTKLP--------KVKCLRNKHREGLIRSRVRGAD 190

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            +  +++ FLD+HCEV  +WLPPLL  +  D   +  PVID I+  T+ + +        
Sbjct: 191 AAGAQILTFLDSHCEVNKDWLPPLLQRVKEDPTSVASPVIDIINMDTFAYVAA---SSDL 247

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K  +L   +  KR   +EP K+P  AGGLF +DR++F  LG YD  + +W
Sbjct: 248 RGGFDWSLHFKWEQLSAEKRAKRADPTEPIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIW 307

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVI 342
           GGENFE+SF++WMCGGS+E +PCSR+GHV+R   PY F       +G   TY  N +R  
Sbjct: 308 GGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYIFP------EGNANTYIKNTRRTA 361

Query: 343 ETWFDEKHKAYFYTREPLAMFLDMGDI 369
           E W DE  K ++Y+  P A     GDI
Sbjct: 362 EVWMDE-FKLFYYSARPAARGKSYGDI 387


>gi|71682529|gb|AAI00448.1| Galntl5 protein, partial [Mus musculus]
          Length = 447

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 132/328 (40%), Positives = 198/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 94  LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 153

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F GKV+LIRN +REGLIR++  GA 
Sbjct: 154 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 213

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T ++ +        
Sbjct: 214 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLDYMAA----PIV 269

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 270 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 329

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 330 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 383

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 384 WLDE-YKGNFFLQRPSLTYVSCGNISER 410


>gi|148671133|gb|EDL03080.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5, isoform CRA_a
           [Mus musculus]
          Length = 490

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 137 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 196

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F GKV+LIRN +REGLIR++  GA 
Sbjct: 197 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 256

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T +    Y      
Sbjct: 257 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 312

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 313 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 372

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 373 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 426

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 427 WLDE-YKGNFFLQRPSLTYVSCGNISER 453


>gi|12832954|dbj|BAB22325.1| unnamed protein product [Mus musculus]
          Length = 429

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 135/325 (41%), Positives = 191/325 (58%), Gaps = 17/325 (5%)

Query: 40  AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 99
            GD     Y  N   S  IS +R +PD R + C    Y  DLP  S+I+ FHNE  S+L+
Sbjct: 69  VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 128

Query: 100 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 159
           RT+ S++ RTP   ++EIILVDDFS+  +  ++L         KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183

Query: 160 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 219
            RGA  ++G  + FLD+HCEV  +WL PLL  +  D   +  PVID I+  T+ +    E
Sbjct: 184 MRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240

Query: 220 PDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
                RG F+W + ++  ++   +   R   +EP ++P  AGGLF +D+A+F  LG YD 
Sbjct: 241 SASELRGGFDWSLHFQWEQISLEQKALRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF++WMCGG +E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFP------DGNANTYIKN 354

Query: 338 YKRVIETWFDEKHKAYFYTREPLAM 362
            KR  E W DE +K Y+Y   P A+
Sbjct: 355 TKRTAEVWMDE-YKQYYYAARPFAL 378


>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Meleagris gallopavo]
          Length = 709

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 142/353 (40%), Positives = 209/353 (59%), Gaps = 10/353 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           + PG+ G    +P+  +    +   E   N+  S+ I  DR I D R   C       DL
Sbjct: 208 QAPGQFGHPVAVPDDKQEEAKSRWKEGNFNVFLSDLIPVDRAIADTRPAGCLEQQVHDDL 267

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  ++I+ F +E +S+L+R+VHS++ R+P   L+E+ILVDDFS+K  L +KL+ Y+ +F 
Sbjct: 268 PTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQFP 327

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV+++   ER GLIR R  GA+ + G V+ FLD+H E  + WL PLL  +   R  +  
Sbjct: 328 -KVKVLHLRERHGLIRARLAGAQMATGTVLTFLDSHVECNVGWLEPLLERVRLHRARVAC 386

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           PVI+ I  +   + +V   D+  RGIF W M +   ++P+   +K K   ++  + P  A
Sbjct: 387 PVIEVISDKDMSYMTV---DNFQRGIFTWPMNFGWKQIPQEVIEKNKLKETDIIRCPVMA 443

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++++ +F ELG YD GL VWGGEN ELSFK+WMCGG IE VPCSR+GH++R+  PY
Sbjct: 444 GGLFSVEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCSRVGHIFRNDNPY 503

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
           +F K  DRV+   +  N  RV E W D  K   Y +    L    ++GD+S+Q
Sbjct: 504 SFPK--DRVR--TVERNLARVAEVWLDGYKELFYGHAYHLLQRRAELGDLSQQ 552


>gi|19922324|ref|NP_611043.1| GalNAc-T1, isoform A [Drosophila melanogaster]
 gi|24653878|ref|NP_725472.1| GalNAc-T1, isoform B [Drosophila melanogaster]
 gi|51315876|sp|Q6WV20.2|GALT1_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           Short=pp-GaNTase 1; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 1; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1
 gi|10121393|gb|AAG13184.1|AF218236_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
           melanogaster]
 gi|7303062|gb|AAF58130.1| GalNAc-T1, isoform B [Drosophila melanogaster]
 gi|21064373|gb|AAM29416.1| RE14585p [Drosophila melanogaster]
 gi|21645385|gb|AAM70974.1| GalNAc-T1, isoform A [Drosophila melanogaster]
 gi|220947986|gb|ACL86536.1| GalNAc-T1-PA [synthetic construct]
          Length = 601

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 146/364 (40%), Positives = 207/364 (56%), Gaps = 13/364 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L+  K G GE G A HL  A +  GD    +  +N E S  ++++R++ D R   C    
Sbjct: 83  LQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQR 142

Query: 77  YPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
           +  D LP ASV+++F NE +S L+RTVHS +     + L+EIILVDD S   +L  KL+ 
Sbjct: 143 FDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202

Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           Y++    +GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PLL  I 
Sbjct: 203 YVRTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262

Query: 194 SDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
             R  + VP+ID ID     Y T  ++S       + G F+W  L +  +  +R   K++
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQE 322

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
               P  SPT AGGLFA+DR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +PCS
Sbjct: 323 REICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           R+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L    D+GD
Sbjct: 383 RVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIGD 437

Query: 369 ISEQ 372
           ++ +
Sbjct: 438 VTHR 441


>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
 gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 1 [Xenopus
           (Silurana) tropicalis]
          Length = 562

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 135/343 (39%), Positives = 198/343 (57%), Gaps = 17/343 (4%)

Query: 32  HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 91
           +L   +  AG+    ++  N   S+ +S +R I D R   C    +  DLP  SVI+ FH
Sbjct: 80  YLSSKFIKAGEDPYRQHAFNQLESDKLSSERPIRDTRHYRCTSVHHDNDLPSTSVIITFH 139

Query: 92  NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 151
           NE  S+L+RT+ S++ R+P   ++EIILVDDFS+  D  Q L         KV+ +RN  
Sbjct: 140 NEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQLLTKI-----PKVKCLRNNR 194

Query: 152 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 211
           REGLIR+R RGA+ +   V+ FLD+HCEV   WL PLL  +  D   +  P+ID I    
Sbjct: 195 REGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQPLLQRVKDDHTRVVSPIIDVISLDN 254

Query: 212 WEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFF 271
           + + +        RG F+W + +K  ++P  +   R   +   ++P  AGG+F +D+++F
Sbjct: 255 FAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTSSIRTPVIAGGIFVIDKSWF 311

Query: 272 LELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKG 331
            +LG YD  + +WGGENFELSF++WMCGGS+E VPCSR+GHV+R   PY F        G
Sbjct: 312 NQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYEFP------DG 365

Query: 332 PLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
             +TY  N KR +E W DE +K Y+Y   P A+    G ++++
Sbjct: 366 NALTYIKNTKRTVEVWMDE-YKQYYYQARPSAIGKSYGSVADR 407


>gi|29437281|gb|AAH49554.1| Galntl5 protein, partial [Mus musculus]
          Length = 434

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78  LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F GKV+LIRN +REGLIR++  GA 
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T +    Y      
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394


>gi|12838270|dbj|BAB24147.1| unnamed protein product [Mus musculus]
          Length = 424

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78  LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F GKV+LIRN +REGLIR++  GA 
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T +    Y      
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394


>gi|254553456|ref|NP_080725.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
           5 [Mus musculus]
 gi|51316084|sp|Q9D4M9.2|GLTL5_MOUSE RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5;
           AltName: Full=Polypeptide GalNAc transferase 15;
           Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 15;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 15
 gi|148671134|gb|EDL03081.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5, isoform CRA_b
           [Mus musculus]
 gi|148877565|gb|AAI45758.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Mus musculus]
          Length = 431

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78  LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F GKV+LIRN +REGLIR++  GA 
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T +    Y      
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394


>gi|195120313|ref|XP_002004673.1| GI20058 [Drosophila mojavensis]
 gi|193909741|gb|EDW08608.1| GI20058 [Drosophila mojavensis]
          Length = 668

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 137/347 (39%), Positives = 197/347 (56%), Gaps = 18/347 (5%)

Query: 7   DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
           D  +   EP     K+G G  G+   +P   R            N+  S+ I  +RT+ D
Sbjct: 80  DYNINQFEP-----KQGEGADGRPVIIPLRDRFRMQRFFKLNSFNLLASDRIPLNRTLKD 134

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            R  EC+   Y  ++P  SVI+VFHNE +S L+RT+ S+I R+P   L EIILVDD S +
Sbjct: 135 YRTNECREKRYTQNMPTTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASDR 194

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
           + L ++LE YI+      RL R  ER GL+  R  GA+ +RG+V+ FLDAHCE    WL 
Sbjct: 195 SFLKRQLEAYIEVLKVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLE 254

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK----ENELPER 242
           PLLA I   R ++  PVID I    + +   +E  +H+ G F W + ++    + +  + 
Sbjct: 255 PLLARIKESRNVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKTRQA 311

Query: 243 EAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
            AK+ K ++ P  +P  AGGLFA+DR +F E+G YD  + +WGGEN E+SF+IW CGG I
Sbjct: 312 IAKENKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDRDMRIWGGENVEMSFRIWQCGGRI 371

Query: 303 EWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
           E  PCS +GH++RS  PY F G +++     ++T N  R    W D+
Sbjct: 372 EISPCSHVGHIFRSSTPYTFPGGMSE-----VLTSNLARAATVWMDD 413


>gi|403258971|ref|XP_003922013.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
           2 [Saimiri boliviensis boliviensis]
          Length = 967

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 148/380 (38%), Positives = 215/380 (56%), Gaps = 39/380 (10%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR-------------- 68
            PG+ G+   +P       +    E   N+  S+ I  DR I D R              
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGEQLLLPLFPCS 496

Query: 69  -------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
                        +  C       +LP  SVI+ F +E +S+L+R+VHS++ R+P   ++
Sbjct: 497 HMTLAEIKTSLFLIHGCTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIK 556

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
           EI+LVDDFS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G+V+ FLD
Sbjct: 557 EILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLD 615

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 235
           +H E  + WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF W M + 
Sbjct: 616 SHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFG 672

Query: 236 ENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
              +P +  AK R   ++  + P  AGGLF++D+++F ELG YDPGL VWGGEN ELSFK
Sbjct: 673 WRTIPPDVIAKNRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFK 732

Query: 295 IWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYF 354
           +WMCGG IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W DE +K  F
Sbjct: 733 VWMCGGEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLDE-YKELF 787

Query: 355 YTR--EPLAMFLDMGDISEQ 372
           Y      +   LD+G++++Q
Sbjct: 788 YGHGDHLINQGLDVGNLTQQ 807


>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 592

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 154/382 (40%), Positives = 211/382 (55%), Gaps = 20/382 (5%)

Query: 2   PVFKADGKLGNLEPPLEPYKE-----GPGEGGKAYHLPEAY-----RAAGDASLGEYGMN 51
           PV     +L  L P   P K      GPGEG   Y +  +      +A  D    +   N
Sbjct: 62  PVLARPKELSGLSPSYPPPKSDQNSVGPGEGAVPYLVNRSALSVEEQAKYDKGFQDNAFN 121

Query: 52  METSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPA 111
              S+ IS  R IPD R   CK   +  DLPK +VI+ FHNE +S+L+R+VHS++  +P 
Sbjct: 122 QYASDRISVRRYIPDFRNGACKTQSFSSDLPKTAVIICFHNEAWSALLRSVHSVLDYSPK 181

Query: 112 QYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVI 171
           + L+EIILVDDFSS+  L + LE Y+Q+F   V++IR   REGLIR R  G   S  EV+
Sbjct: 182 ELLQEIILVDDFSSRDYLKEPLEIYMQQF-PVVKIIRTKRREGLIRARMVGTNVSTAEVL 240

Query: 172 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 231
            +LD+H E    WL PLL  I +    + VPVI+ I+ Q    ++  E      G F+W 
Sbjct: 241 TYLDSHIECTPGWLEPLLERIKASTSNVVVPVIEIINDQDLSMKATQEASVQVGG-FDWS 299

Query: 232 MLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFEL 291
           + +  +  P+R+  +      P +SPT AGGLFA+ R FF  LG YD  + VWGGEN EL
Sbjct: 300 LTFTWHLPPKRDQIRLGAPYSPIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLEL 359

Query: 292 SFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHK 351
           SFK WMCGG +E V CS +GH++RS  PY++    +  +   I +N  R+ ETW D+   
Sbjct: 360 SFKTWMCGGQLETVVCSHVGHIFRSRSPYSW----ESKRTSPIKFNLVRLAETWLDDYKF 415

Query: 352 AYFYTREPLAMFL-DMGDISEQ 372
            Y+   + L   L D GDIS +
Sbjct: 416 LYY---DSLNFDLGDYGDISSR 434


>gi|395732382|ref|XP_002812541.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 5 [Pongo abelii]
          Length = 967

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 150/387 (38%), Positives = 218/387 (56%), Gaps = 41/387 (10%)

Query: 16  PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR------- 68
           P +P  + PG+ G+   +P       +    E   N+  S+ I  DR I D R       
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGGQLF 489

Query: 69  --------------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
                               +  C       +LP  SVI+ F +E +S+L+R+VHS++ R
Sbjct: 490 LPLFPYSHMTLAEIKTPLFLIHGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNR 549

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P   ++EI+LVDDFS+K  L   L+ Y+ +F  KVR++R  ER GLIR R  GA+ + G
Sbjct: 550 SPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATG 608

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           +V+ FLD+H E  + WL PLL  +Y  RK +  PVI+ I+ +   + +V   D+  RGIF
Sbjct: 609 DVLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIF 665

Query: 229 EWGMLYKENELP-EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W M +    +P +  AK R   ++  + P  AGGLF++D+++F ELG YDPGL VWGGE
Sbjct: 666 VWPMNFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGE 725

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N ELSFK+WMCGG IE +PCSR+GH++R+  PY+F K  DR+K   +  N  RV E W D
Sbjct: 726 NMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPK--DRMK--TVERNLVRVAEVWLD 781

Query: 348 EKHKAYFYTR--EPLAMFLDMGDISEQ 372
           E +K  FY      +   LD G++++Q
Sbjct: 782 E-YKELFYGHGDHLIDQGLDAGNLTQQ 807


>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
 gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
          Length = 455

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/345 (40%), Positives = 204/345 (59%), Gaps = 15/345 (4%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY    A +   DA +     N    + +  DR +PD R   C+  +Y   LP  SVI+
Sbjct: 14  KAYIGATALKQGEDAYI-RNAFNQAECDKLPTDRGVPDTRDYSCRSLEYKHKLPTTSVII 72

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RT+ S++ R+P++ L+EIILVDDFS  A+ D +L   +     KV+ +R
Sbjct: 73  TFHNEARSALLRTIRSVLNRSPSELLKEIILVDDFSDNAN-DGRLLKILP----KVKTLR 127

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N +REGLIR+R RGA  ++G+V+ FLD+HCEV   WL PLL+ +  +  I+  P+ID I 
Sbjct: 128 NNKREGLIRSRVRGADLAKGDVLTFLDSHCEVNERWLEPLLSRVAQNETIVVSPIIDVIH 187

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK-YNSEPYKSPTHAGGLFAMD 267
             T+ +          +G F W + +K + +   E  +R  + + P K+P  AGGLF++ 
Sbjct: 188 MDTFNY---IGSSADLKGGFGWNLNFKWDSMTSEEQSQRAAHPTRPIKTPMIAGGLFSIS 244

Query: 268 RAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLAD 327
           + +F++ G YD G+ VWGGEN E+S +IWMCGGS+E VPCSR+GHV+R   PY F     
Sbjct: 245 KNWFIKSGKYDMGMDVWGGENLEISLRIWMCGGSLEIVPCSRVGHVFRKRHPYTFPGGG- 303

Query: 328 RVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
              G +   N +R  E W D   K ++Y REP A  +  GDIS++
Sbjct: 304 ---GFVFAKNTRRAAEAWMDGYAK-FYYKREPGARGVPYGDISDR 344


>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
           gallus]
          Length = 566

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 200/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 100 NQVESDKLRMDRNIPDTRHDQCQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSP 159

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
           +  ++EIILVDD+S+  D D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 160 SHLIKEIILVDDYSNDPD-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 214

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  D+  +  P+ID I+   +++          +G F+W
Sbjct: 215 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 271

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+++F ELG YD  + VWGGEN 
Sbjct: 272 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 331

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 332 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 386

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 387 YKNFYYAAVPSARNVPYGNIQSR 409


>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           5 [Pan troglodytes]
          Length = 638

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/351 (41%), Positives = 206/351 (58%), Gaps = 18/351 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R
Sbjct: 402 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRR 446


>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Oreochromis niloticus]
          Length = 598

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 188/331 (56%), Gaps = 17/331 (5%)

Query: 41  GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
           GD     Y  N   S  I  DR + D R   C    Y  +LP  S+I+ FHNE  S+L+R
Sbjct: 116 GDDPYTLYAFNQRESERIPSDRALRDTRHYRCTTLHYDSELPSTSIIITFHNEARSTLLR 175

Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
           T+ S++ RTP   + EIILVDDFS      Q L         KV+  RN +REGLIR+R 
Sbjct: 176 TIKSVLNRTPVHLIYEIILVDDFSDDESDCQLLTKL-----PKVKCFRNNKREGLIRSRV 230

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
           RG   +R +V+ FLD+HCEV  +WLPPLL  I  D   +  PVID I+  T+ + +    
Sbjct: 231 RGTDAARAKVLTFLDSHCEVNKDWLPPLLQRIKEDPSRVVSPVIDIINMDTFAYVAA--- 287

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F+W + +K  +L   +  +R   ++P K+P  AGGLF +DRA+F  LG YD  
Sbjct: 288 SADLRGGFDWSLHFKWEQLSPEQRARRTDPTQPIKTPIIAGGLFVIDRAWFNHLGKYDTA 347

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
           + +WGGENFE+SF++W CGGS+E +PCSR+GHV+R   PY F       +G   TY  N 
Sbjct: 348 MDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFRKKHPYVFP------EGNANTYIKNT 401

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +R  E W D+  + ++Y+  P A     GDI
Sbjct: 402 RRTAEVWMDD-FRLFYYSARPAARGKSYGDI 431


>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Takifugu rubripes]
          Length = 579

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 188/331 (56%), Gaps = 17/331 (5%)

Query: 41  GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
           GD     Y  N   S  I  +R + D R   C    Y  DLP  S+I+ FHNE  S+L+R
Sbjct: 97  GDDPYTLYAFNQRESERIPSNRALRDTRHFRCATIRYDSDLPPTSIIITFHNEARSTLLR 156

Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
           TV S++ RTP   + EIILVDDFS      Q L         KVR +RN +REGLIR+R 
Sbjct: 157 TVRSVLNRTPVHLIHEIILVDDFSDDESDCQLLIKL-----PKVRCVRNPQREGLIRSRV 211

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
           RGA  ++  V+ FLD+HCEV  +WLPPLL  I  D   +  PVID I+  T+ + +    
Sbjct: 212 RGADSAKAAVLTFLDSHCEVNKDWLPPLLQRIKQDPTRVVSPVIDIINMDTFAYVAA--- 268

Query: 221 DHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPG 280
               RG F+W + +K  +L   +  +R   ++P K+P  AGGLF +DR++F  LG YD  
Sbjct: 269 SADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTA 328

Query: 281 LLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NY 338
           + +WGGENFE+SF++W CGGS+E +PCSR+GHV+R   PY F       +G   TY  N 
Sbjct: 329 MDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFRKKHPYVFP------EGNANTYIKNT 382

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
           +R  E W D+    ++Y+  P A     GDI
Sbjct: 383 RRTAEVWMDD-FSLFYYSARPAARGKSYGDI 412


>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 611

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 147/365 (40%), Positives = 213/365 (58%), Gaps = 20/365 (5%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P    PG  GKA+      PE      +  +  +  N   S+ IS  R++  D R 
Sbjct: 100 PPQDP--GSPGADGKAFQKDQMTPEEENEKKEG-MTRHCFNQFASDRISLSRSLGDDTRP 156

Query: 70  EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S+ 
Sbjct: 157 PECVERKFLRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASAA 215

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L ++LE ++ +    VR++R  ER+GLI  R  GA  ++GEV+ FLDAHCE    WL 
Sbjct: 216 DHLKEQLEVFVHQLK-IVRVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLE 274

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAK 245
           PLLA I  +   +  P I  ID +T++F       H Y RG F+WG+ +   ++PE   K
Sbjct: 275 PLLARIVEEPTAVVSPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEAARK 334

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
            RK  + P K+PT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 335 LRKDETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEII 394

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
           PCS +GHV+R+  P+ F K  D     +IT N  R+ E W D+ +K  FY R   A  + 
Sbjct: 395 PCSVVGHVFRTKSPHTFPKGTD-----VITRNQVRLAEVWMDD-YKKIFYRRNRNAENMA 448

Query: 366 MGDIS 370
             D++
Sbjct: 449 KEDLT 453


>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
 gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
          Length = 545

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 195/311 (62%), Gaps = 12/311 (3%)

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            R   CK   YP  LP  SVI+ F +E FS++MR+VHSII RTP   L E+ILVDD S++
Sbjct: 83  CRQVRCKTKKYPEYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTR 142

Query: 127 ADLDQKLEDYIQRFNG--KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
           A+L   L+DY++R  G  KV+++   +REGLIR R RGA+++ G V+ FLDAH E  + W
Sbjct: 143 AELKGHLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGW 202

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPERE 243
           + PLL  I+ +R  + +P+I+ ID +T+E+   V    +  RG F W + +    +PE E
Sbjct: 203 VEPLLHRIWENRSNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYE 262

Query: 244 AKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
            K+ K + + P +SPT AGGLF++D+++F ELG YD  +  WGGEN ELSFKIWMCGG++
Sbjct: 263 IKRWKGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMCGGTL 322

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGP-LITYNYKRVIETWFDEKHKAYFYTREPLA 361
           E  PCS++GHV+RS  PY+         GP     N  RV+E W D  +K  FY   P  
Sbjct: 323 EQPPCSKVGHVFRSSAPYS------NPSGPKTFIRNTLRVVEVWLDS-YKDLFYALNPHM 375

Query: 362 MFLDMGDISEQ 372
                GD+SE+
Sbjct: 376 QGEPYGDVSER 386


>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 586

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 136/346 (39%), Positives = 202/346 (58%), Gaps = 18/346 (5%)

Query: 29  KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 88
           KAY L E     G     ++  N+  S+ +  +R I D R   C    Y  +LP  S+I+
Sbjct: 110 KAY-LTEKLLKPGVDPYQDHAFNVLESDRVGSERAIRDTRHYRCASISYDPELPSTSIII 168

Query: 89  VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 148
            FHNE  S+L+RTV S++ R+P   ++EIIL+DDFSS  + D +L  +I     KVR +R
Sbjct: 169 TFHNEARSTLLRTVKSVLMRSPPSLIQEIILIDDFSSDPE-DCQLLVHIP----KVRCLR 223

Query: 149 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 208
           N  REGLIR+R RGA  +   ++ FLD+HCEV  +WL P++  +  D   +  P+ID I 
Sbjct: 224 NVRREGLIRSRVRGANAASAPILTFLDSHCEVNTDWLQPMIQRVKEDHTRVVSPIIDVIS 283

Query: 209 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDR 268
              + + +        RG F+W + +K  ++P  +   R   ++P ++P  AGG+F MD+
Sbjct: 284 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMARSDPTQPIRTPVIAGGIFVMDK 340

Query: 269 AFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADR 328
           ++F  LG YD  + +WGGENFELSF++WMCGGS+E +PCSR+GHV+R   PY F      
Sbjct: 341 SWFNRLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRVGHVFRKRHPYEFP----- 395

Query: 329 VKGPLITY--NYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +G  +TY  N +R  E W DE +K Y+Y+  P A     G I+++
Sbjct: 396 -EGNALTYIRNTRRAAEVWMDE-YKQYYYSARPSAQGKAFGSITDR 439


>gi|327274929|ref|XP_003222227.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           protein 2-like [Anolis carolinensis]
          Length = 605

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 142/345 (41%), Positives = 196/345 (56%), Gaps = 19/345 (5%)

Query: 32  HLPEA--YRAAGDAS------LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 83
           HL E   ++   DAS      L  YG N   S  I   R +P++R   C   +   +LP 
Sbjct: 139 HLAEEDEFQNQTDASEQTIDGLEIYGFNEALSKQIPLHRELPEVRHPLCLQQEPSPNLPT 198

Query: 84  ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 143
           ASV++ FH+E +S+L+RTVHS++   P  +L+EIILVDD S++  L   L +YI +  G 
Sbjct: 199 ASVVICFHDEAWSTLLRTVHSVLDTAPRDFLKEIILVDDLSTQEYLKSSLSEYISKLPG- 257

Query: 144 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 203
           V+LIR+  R G+I+ R  GA  + GEV+VF+D+HCE    WL PLL  + SDR  +  PV
Sbjct: 258 VKLIRSNRRLGVIQGRMLGAARATGEVVVFMDSHCECHNGWLEPLLERLASDRSRIVSPV 317

Query: 204 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 263
           ID ID++T+++    E     RG+F+W + +    L E E K R     P +SP   GG+
Sbjct: 318 IDVIDWKTFQYHHTMELQ---RGVFDWKLDFHWKPLTEHEKKVRPSPVSPIRSPAVPGGV 374

Query: 264 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFG 323
            A+ R  F   GGYD  + + GGEN ELS K W+CGGS+E +PCSR+GHVYR+ MPYNF 
Sbjct: 375 IAVHRHHFQNTGGYDSDMTLLGGENIELSIKAWLCGGSVEILPCSRVGHVYRTGMPYNFS 434

Query: 324 KLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
                     I  N  R+ ETW D   K  FY  + LA  +   +
Sbjct: 435 ------DEKAIERNKIRIAETWLD-SFKHLFYQHDRLACLISKAE 472


>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
          Length = 669

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 137/328 (41%), Positives = 197/328 (60%), Gaps = 18/328 (5%)

Query: 51  NMETSNHISFDRTIPDLRMEECK--YWDYPLD---LPKASVILVFHNEGFSSLMRTVHSI 105
           N + S+ +  +R +PD R   C+   W        LP  SVI+ FHNE  S+L+RTV S+
Sbjct: 196 NQQASDGLKSNRELPDTRNAMCRRTSWSSATSIESLPATSVIITFHNEARSTLLRTVVSV 255

Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
           + R+P + + EIILVDDFS   +  Q+L   IQ    KVRLIRN +REGL+R+R  GA  
Sbjct: 256 LNRSPERLIHEIILVDDFSDFPEDGQELAK-IQ----KVRLIRNAKREGLVRSRVTGAAA 310

Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
           +  +V+ FLD+HCE  ++WL PLLA +  D   +  PVID I   T+++          R
Sbjct: 311 ATAKVLTFLDSHCECNVHWLEPLLARVAEDPTRVVCPVIDVISMDTFQY---IGASADLR 367

Query: 226 GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           G F+W +++K   L   E K+R+ + + P ++P  AGGLF +DR++F +LG YD  + +W
Sbjct: 368 GGFDWNLVFKWEYLSGAERKERQRDPTAPIRTPMIAGGLFVIDRSYFEKLGTYDTQMDIW 427

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN E+SF++W CGGS+E +PCSR+GHV+R   PY F        G +   N +R  E 
Sbjct: 428 GGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGG---SGNIFAKNTRRAAEV 484

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K Y+Y   PLA  +  GDI ++
Sbjct: 485 WMDE-YKRYYYAAVPLATNIPFGDIEDR 511


>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
           saltator]
          Length = 493

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 132/308 (42%), Positives = 199/308 (64%), Gaps = 13/308 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +R++PD+R ++C  +Y +    LPK S+I+VFHNE +S+L+RTVHS+I R
Sbjct: 11  NLMASDRIPLNRSLPDVRKKKCISRYANLG-KLPKTSIIIVFHNEAWSTLLRTVHSVIDR 69

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P + LEEIILVDD S +  L   L++Y+++ +   +++R+TER GLI+ R  GA +++G
Sbjct: 70  SPRELLEEIILVDDNSEREFLKNPLDEYVKKLSVPTKVLRSTERVGLIKARLLGASDAKG 129

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           EV+ FLDAHCE  + WL PLL  +  +   +  PVID I+  T+ +   +E   H+ G F
Sbjct: 130 EVLTFLDAHCECTVGWLEPLLEAVGKNATRIISPVIDIINDNTFSYTRSFE--LHW-GAF 186

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L  R  K+R+ +  EP+++P  AGGLF+M+R +F +LG YD  + +WGGE
Sbjct: 187 NWDLHFRWLTLNGRLLKERRESIVEPFRTPAMAGGLFSMNRNYFFQLGSYDDQMRIWGGE 246

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           N ELSF+ W CGGSIE  PCS +GH++R   PY F G + D + G L+     RV   W 
Sbjct: 247 NLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLV-----RVASVWM 301

Query: 347 DEKHKAYF 354
           D+  + YF
Sbjct: 302 DQWAEFYF 309


>gi|194755004|ref|XP_001959782.1| GF13042 [Drosophila ananassae]
 gi|190621080|gb|EDV36604.1| GF13042 [Drosophila ananassae]
          Length = 599

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 147/369 (39%), Positives = 210/369 (56%), Gaps = 13/369 (3%)

Query: 12  NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +++  L+  + G GE G A HL  A +  G+A   +  +N E S  + ++R++ D R   
Sbjct: 76  SIQLDLQKQRVGLGEQGVAVHLTGAAKERGEAIYKKIALNEELSEQLLYNRSVGDHRNPL 135

Query: 72  CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           C    + +D LP ASV+++F NE +S L+RTVHS +     + L+EIILVDD S   +L 
Sbjct: 136 CAAERFDVDTLPTASVVIIFFNEPYSVLLRTVHSTLTTCNEKALKEIILVDDGSDNPELG 195

Query: 131 QKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
            KL+ YI+     GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PL
Sbjct: 196 GKLDYYIRTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPL 255

Query: 189 LAPIYSDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 243
           L  I   R  + VP+ID ID     Y T  ++S       + G F+W  L +  +  +R 
Sbjct: 256 LQRIKESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRR 315

Query: 244 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 303
             K++    P  SPT AGGLFAMDR +F E+G YD  +  WGGEN E+SF+IW CGG+IE
Sbjct: 316 ECKQQREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIE 375

Query: 304 WVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMF 363
            +PCSR+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L   
Sbjct: 376 TIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFH 430

Query: 364 LDMGDISEQ 372
            D+GD++ +
Sbjct: 431 ADIGDVTHR 439


>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
          Length = 603

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 150/370 (40%), Positives = 213/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+   +         +    ++  N   S+ IS  R++ PD R  
Sbjct: 87  PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 144

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  L   SVI+VFHNE +S+L+RTV+S++  TPA  L+EIILVDD S++ 
Sbjct: 145 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 203

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L +KLE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE     L P
Sbjct: 204 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGRLEP 262

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D+ ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 263 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 322

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 323 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 382

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 383 CSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDS-YKKIFYRRNLQAAKMAQ 436

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 437 EKSFGDISER 446


>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 26/372 (6%)

Query: 15  PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 68
           PP+ P   G     G+GGK+  L E  + + +    +  + +N   S  IS  RT+ + R
Sbjct: 73  PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132

Query: 69  MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
            ++C     K + Y   LP  SVI+ F+NEG+++L+RT++SI+  +P   L+EIIL+DD 
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191

Query: 124 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           S K +   L ++LED +     +VRLIR  +REGL+R R  GA+ + GEV+ FLD H E 
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
              WL PLL  I  D  ++ VP+I  I +Q + F           G F+W + ++ + +P
Sbjct: 251 NNGWLEPLLQRIAEDDSVVAVPIISTIAWQDFAFHHSSNSIEPQIGGFDWRLTFQWHSIP 310

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           +    KRK +++P  +PT AGGLFA+ R +F  +G YD G+ VWGGEN E+SF++WMCGG
Sbjct: 311 DEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGG 370

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           S+E +PCS +GHV+    PY              T N  R +E W D+ +K +FY R PL
Sbjct: 371 SLEIIPCSIVGHVFPKTAPYERKSF---------TPNTVRAVEVWLDD-YKRHFYARNPL 420

Query: 361 AMFLDMGDISEQ 372
           +     GDISE+
Sbjct: 421 SKDEKYGDISER 432


>gi|198474479|ref|XP_002132699.1| GA25744 [Drosophila pseudoobscura pseudoobscura]
 gi|198138409|gb|EDY70101.1| GA25744 [Drosophila pseudoobscura pseudoobscura]
          Length = 635

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/367 (41%), Positives = 213/367 (58%), Gaps = 21/367 (5%)

Query: 18  EPYKEGPGEGGKAYHLPEA--YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 75
           +  K G GE G    + +   Y+     S+ + G N   S+ IS +RTI D R   CK  
Sbjct: 85  DSLKTGLGEQGLRIAIEDTKEYQEMIAMSIKK-GFNSLLSDKISVNRTIADTRPLRCKSR 143

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
            Y + LP  SVI+VFHN   S L+R +HSII RTP + L E+ILVDD S+  +L ++L+ 
Sbjct: 144 KYLVKLPNVSVIMVFHNTHLSVLLRAIHSIINRTPHELLHEVILVDDGSTAQELQEQLDK 203

Query: 136 YI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           Y+ + F  KV +IR  +R G+   R  G   + G V+VF DA  EV  NWLPPLL P+  
Sbjct: 204 YVNEHFGSKVSIIRQKKRTGMPAARVAGVNSANGTVMVFCDASIEVIYNWLPPLLEPMTL 263

Query: 195 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPY 254
             KI+T P++D ID   + F+  +     +RG F+W   +  N+LP  + +  K  S+PY
Sbjct: 264 HYKIVTSPILDEIDNTDFSFK--WSDPLLWRGGFDWH--FNFNKLPVLQ-EDIKGESQPY 318

Query: 255 KSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVY 314
           ++P   G +FA+DR +FLELGGYD GL   GGE +E+SFKIWMCGG +  VPCSR+GH+ 
Sbjct: 319 RNPVMEGTVFAIDRKYFLELGGYDEGLDASGGEQYEMSFKIWMCGGMLLQVPCSRVGHI- 377

Query: 315 RSFMP-------YNFGKLADRVKGP--LITYNYKRVIETWFDEKHKAYFYTREPLAMFLD 365
            +  P       +  G+L +  KG    +T NYKRV E W D  +K Y Y R+P    ++
Sbjct: 378 -AIDPKDAQDPTWQKGELLESTKGEYDTLTRNYKRVAEVWMD-GYKHYLYLRDPYKYHIN 435

Query: 366 MGDISEQ 372
            G+++ Q
Sbjct: 436 AGNVTRQ 442


>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Cavia porcellus]
          Length = 531

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 132/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R E+C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 65  NQVESDKLRMDRAIPDTRHEQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 124

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 125 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 179

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 180 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 236

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 237 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 296

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 297 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 351

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 352 YKNFYYAAVPSARNVPYGNIQSR 374


>gi|195148070|ref|XP_002014997.1| GL18654 [Drosophila persimilis]
 gi|194106950|gb|EDW28993.1| GL18654 [Drosophila persimilis]
          Length = 635

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/334 (43%), Positives = 200/334 (59%), Gaps = 18/334 (5%)

Query: 49  GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           G N   S+ IS +RTI D R   CK   Y + LP  SVI+VFHN   S L+R +HSII R
Sbjct: 117 GFNSLLSDKISVNRTIADTRPLRCKSRKYLVKLPNVSVIMVFHNTHLSVLLRAIHSIINR 176

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           TP + L E+ILVDD S+  +L ++L+ Y+ + F  KV +IR  +R G+   R  G   + 
Sbjct: 177 TPHELLHEVILVDDGSTAQELQEQLDKYVNEHFGSKVSIIRQKKRTGMPAARVAGVNSAN 236

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
           G V+VF DA  EV  NWLPPLL P+    KI+T P++D ID   + F+  +     +RG 
Sbjct: 237 GTVMVFCDASIEVIYNWLPPLLEPMTLHYKIVTSPILDEIDNTDFSFK--WSDPLLWRGG 294

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F+W   +  N+LP  + +  K  S+PY++P   G +FA+DR +FLELGGYD GL   GGE
Sbjct: 295 FDWH--FNFNKLPVLQ-EDIKGESQPYRNPVMEGTVFAIDRKYFLELGGYDEGLDASGGE 351

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP-------YNFGKLADRVKGP--LITYNY 338
            +E+SFKIWMCGG +  VPCSR+GH+  +  P       +  G+L +  KG    +T NY
Sbjct: 352 QYEMSFKIWMCGGMLLQVPCSRVGHI--AIDPKDAQDPTWQKGELLESTKGEYDTLTRNY 409

Query: 339 KRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           KRV E W D  +K Y Y R+P    ++ G+++ Q
Sbjct: 410 KRVAEVWMD-GYKHYLYLRDPYKYHINAGNVTRQ 442


>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 573

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/375 (38%), Positives = 216/375 (57%), Gaps = 20/375 (5%)

Query: 3   VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 62
           +  + GK G     L    E  G+ G+   L E  +A    +      N+  SN I   R
Sbjct: 51  IADSSGKFG-----LHDQSEKFGDMGRPVVLSEFLKAESKLTFHLNEFNLVVSNLIGTRR 105

Query: 63  TIPDLRMEECKYWDYPLD--LP-KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 119
            + D R   C++   PLD  LP K SVI+VFHNE +S+L+RTVHS++ RTP Q L EIIL
Sbjct: 106 NLDDFRHPSCRH-QIPLDKLLPFKTSVIIVFHNEAWSALLRTVHSVLDRTPVQLLHEIIL 164

Query: 120 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 179
           VDD S+++ L  +L++Y++  N  VR+ R + R GLIR R  GAK S G+ + FLDAHCE
Sbjct: 165 VDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLIRARLHGAKISTGKTLTFLDAHCE 224

Query: 180 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 239
           V + WL  LL  I  ++K +  P+ID I + T+E+  +   D  + G F+W   +    +
Sbjct: 225 VTIGWLETLLKHISENQKRIVCPIIDVISHDTFEY--LLGSDRTW-GTFDWQFNFHWETV 281

Query: 240 PEREAKK-RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMC 298
            +RE  +    ++ P ++PT AGGLF + R +F E+G YD  + +WGGEN ELSF++W C
Sbjct: 282 VDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELSFRVWQC 341

Query: 299 GGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           GG +   PCSR+GHV+R   PY + G ++      ++  N+ R    W D+  + YF   
Sbjct: 342 GGELLIDPCSRVGHVFRKSSPYTWPGGVSH-----ILHKNFVRTALVWLDQYSRFYFML- 395

Query: 358 EPLAMFLDMGDISEQ 372
            P A+ +D GD++++
Sbjct: 396 NPSALSVDYGDVTKR 410


>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Megachile rotundata]
          Length = 638

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 156/360 (43%), Positives = 206/360 (57%), Gaps = 29/360 (8%)

Query: 11  GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 65
           G L  P E     PGE G+   LP     E  +   D  L     N   S+ IS  RT+P
Sbjct: 86  GVLVAPREQDSSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 144

Query: 66  DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 124
           D R   CK    Y  +LP  +VI+ FHNE +S L+RTVHS++ R+P   ++EIILVDD+S
Sbjct: 145 DPRDPWCKEPGRYLKELPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDYS 204

Query: 125 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 184
               L ++LEDY+  +  KV++IR  +REGLIR R  GA  ++  V+ +LD+HCE    W
Sbjct: 205 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 263

Query: 185 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 237
           L PLL  I  D   +  PVID ID  T E+        H+R       G F+W + +  +
Sbjct: 264 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 315

Query: 238 ELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 297
            +PERE K+ K  +EP  SPT AGGLF++DRAFF  LG YD G  +WGGEN ELSFK WM
Sbjct: 316 AVPEREKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWM 375

Query: 298 CGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTR 357
           CGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE  K Y+Y R
Sbjct: 376 CGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLKRNSIRLSEVWLDEYAK-YYYQR 429


>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
          Length = 621

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 210/370 (56%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+H  +         +    ++  N   S+ IS  R + PD R  
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L  +LE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  +   +  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +A+F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNMQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
          Length = 564

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/324 (42%), Positives = 201/324 (62%), Gaps = 16/324 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N E S+++  +R IPD R   C+   +  DLP  SVI+ FHNE  S+L+RTV S++ R+P
Sbjct: 100 NQEASDNLPSNREIPDTRNAMCRRKLWRTDLPPTSVIITFHNEARSTLLRTVVSVLNRSP 159

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDDFS   +  ++L   IQ    KVR++RN +REGL+R+R RGA  +   V
Sbjct: 160 EHLIKEIILVDDFSDNPEDGEELAK-IQ----KVRVLRNDKREGLMRSRVRGADAATASV 214

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE  +NWL PLL  +  D   +  PVID I   T+++          RG F+W
Sbjct: 215 LTFLDSHCECNVNWLEPLLERVAEDPTRVVCPVIDVISMDTFQYIGA---SADLRGGFDW 271

Query: 231 GMLYKENEL--PEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
            +++K   L   ERE+++R   ++  ++P  AGGLF +++A+F +LG YD  + VWGGEN
Sbjct: 272 NLVFKWEYLGYAERESRQRD-PTQAIRTPMIAGGLFVINKAYFEKLGKYDMKMDVWGGEN 330

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W D+
Sbjct: 331 LEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGS----GNVFARNTRRAAEVWMDD 386

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
            +K ++Y   PLA  +  GDISE+
Sbjct: 387 -YKHFYYAAVPLAKNIPFGDISER 409


>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Ailuropoda melanoleuca]
          Length = 622

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 210/370 (56%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+H  +         +    ++  N   S+ IS  R + PD R  
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L  +LE Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  +   +  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +A+F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNMQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
           caballus]
          Length = 622

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/371 (40%), Positives = 211/371 (56%), Gaps = 24/371 (6%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P    PG  GKA+      P+  +   +    ++  N   S+ IS  R + PD R 
Sbjct: 106 PPQDP--SSPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162

Query: 70  EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  TPA  L EIILVDD S+ 
Sbjct: 163 PECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLREIILVDDASTD 221

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L ++LE Y+++    VR++R  ER GLI  R  GA  ++ EV+ FLDAHCE    WL 
Sbjct: 222 EYLKEQLEQYVKQLQ-VVRVVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           PLLA I  D   +  P I  ID  T+EF + V     H RG F+W + +    LP  E +
Sbjct: 281 PLLARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQ 340

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
           PCS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-GYKEIFYRRNMQAAKMA 454

Query: 362 MFLDMGDISEQ 372
                GDISE+
Sbjct: 455 QEKSFGDISER 465


>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 26/372 (6%)

Query: 15  PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 68
           PP+ P   G     G+GGK+  L E  + + +    +  + +N   S  IS  RT+ + R
Sbjct: 73  PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132

Query: 69  MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 123
            ++C     K + Y   LP  SVI+ F+NEG+++L+RT++SI+  +P   L+EIIL+DD 
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191

Query: 124 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 180
           S K +   L ++LED +     +VRLIR  +REGL+R R  GA+ + GEV+ FLD H E 
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250

Query: 181 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 240
              WL PLL  I  D  ++ VP+I  I +Q + F           G F+W + ++ + +P
Sbjct: 251 NDGWLEPLLQRIAEDDSVVAVPIISTIAWQDFGFHHSSNSIEPQIGGFDWQLTFQWHSIP 310

Query: 241 EREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGG 300
           +    KRK +++P  +PT AGGLFA+ R +F  +G YD G+ VWGGEN E+SF++WMCGG
Sbjct: 311 DEIKAKRKADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGG 370

Query: 301 SIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPL 360
           S+E +PCS +GHV+    PY              T N  R +E W D+ +K +FY R PL
Sbjct: 371 SLEIIPCSIVGHVFPKTAPYERKSF---------TPNTVRAVEVWLDD-YKRHFYARNPL 420

Query: 361 AMFLDMGDISEQ 372
           +     GDISE+
Sbjct: 421 SKDEKYGDISER 432


>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Oreochromis niloticus]
          Length = 575

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/364 (41%), Positives = 215/364 (59%), Gaps = 22/364 (6%)

Query: 14  EPPLEPYKEGPGEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +PPL+   E  GE G+A   +L E  +   + S+  + +N   S+ IS  R +P+     
Sbjct: 59  KPPLD--LEAVGEMGRAVKLNLNEEEKRKEEESIKAHQINTYVSDKISLHRRLPERWNPL 116

Query: 72  CKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           CK   Y    LP  SV++ F+NE +S+L+RTVHS+++ +P   L+E++LVDD+S KA L 
Sbjct: 117 CKELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDKAHLK 176

Query: 131 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 190
           + L+ YI   N KVRLIR T+REGL+R R  GA  + GEV+ FLD HCE    WL P+L 
Sbjct: 177 EPLDKYISGLN-KVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPVLH 235

Query: 191 PIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 249
            I  + K +  PVID ID+ T+++     EP     G F+W +++  + +P+ E K+R+ 
Sbjct: 236 RIKEEPKAVVCPVIDVIDWNTFQYLGHAGEPQ---IGGFDWRLVFTWHSIPDYEQKRRRS 292

Query: 250 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 309
             +  +SPT AGGLFA+ + FF  LG YD G+ VWGGEN E SF+IW CGGS+E  PCS 
Sbjct: 293 PVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSH 352

Query: 310 IGHVYRSFMPYNFGK-LADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           +GHV+    PY+  K LA+ V          R  E W DE  K  +Y R P A     GD
Sbjct: 353 VGHVFPKKAPYSRSKALANSV----------RAAEVWLDE-FKEIYYHRNPHARLEAFGD 401

Query: 369 ISEQ 372
           ++E+
Sbjct: 402 VTER 405


>gi|18314429|gb|AAH22021.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [Homo sapiens]
 gi|51105933|gb|EAL24517.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 15 [Homo sapiens]
 gi|119574364|gb|EAW53979.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5, isoform CRA_c
           [Homo sapiens]
 gi|123979772|gb|ABM81715.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [synthetic
           construct]
 gi|123994539|gb|ABM84871.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 [synthetic
           construct]
          Length = 443

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 134/328 (40%), Positives = 202/328 (61%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L +YG N+  S  +  +R +PD R +      YP  LP AS+++ F+NE  ++L +T+ S
Sbjct: 97  LLKYGFNVIISRSLGIEREVPDTRSKMRLQKHYPARLPTASIVICFYNEECNALFQTMSS 156

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           +   TP  +LEEIILVDD S   DL +KL+ +++ F GKV++IRN +REGLIR R  GA 
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+V+VFLD+HCEV   WL PLL  I  D K++  P+ID ID +T E    Y+P    
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W + +K + +   E    + +++P +SP  +GG+FA+ R +F E+G YD  +  W
Sbjct: 273 RGTFDWNLQFKWDNVFSYEMDGPEGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFW 332

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           G EN ELS +IWMCGG +  +PCSR+GH+ +       GK +  +    +T+NY R++  
Sbjct: 333 GRENLELSLRIWMCGGQLFIIPCSRVGHISKK----QTGKPSTIISA--MTHNYLRLVHV 386

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ R+P   ++  G+I E+
Sbjct: 387 WLDE-YKEQFFLRKPGLKYVTYGNIRER 413


>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
 gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
          Length = 658

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/369 (40%), Positives = 205/369 (55%), Gaps = 29/369 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSN 56
           +P  + D K   ++PP   + E  GE GK   LP    +  + A +        N   S+
Sbjct: 133 KPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVSD 190

Query: 57  HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
            IS  RT+PD R   CK    Y  +LP   VI+ FHNE ++ L+RTVHS++ R+P   + 
Sbjct: 191 LISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIG 250

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
            IILVDD+S    L  +LEDY   +  KV++IR  +REGLIR R  GA+ ++  V+ +LD
Sbjct: 251 RIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLD 309

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
           +HCE    WL PLL  I  +   +  PVID I   T E+        HYR       G F
Sbjct: 310 SHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGF 361

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +  + +PERE K+    +EP  SPT AGGLF++DR +F  LG YD G  +WGGEN
Sbjct: 362 DWNLQFSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGEN 421

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFK WMCGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE
Sbjct: 422 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDE 476

Query: 349 KHKAYFYTR 357
            +  Y+Y R
Sbjct: 477 -YSQYYYHR 484


>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
          Length = 607

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/329 (42%), Positives = 194/329 (58%), Gaps = 23/329 (6%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           ++  N++ S+ +  DR +PD R + CK  +YP +LP  SVI VF+NE  S L R++H ++
Sbjct: 125 QHCFNLKRSDSLPLDRPVPDHRDKRCKEIEYPHNLPTTSVIFVFYNEPLSPLYRSIHGVL 184

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            RTP   L EIILVDD S    L +  EDYI+    K +L+R +ER GL+  RS GA+ +
Sbjct: 185 DRTPEHLLHEIILVDDGSDADYLKKDFEDYIKLLP-KTKLVRKSERSGLMDARSYGAEVA 243

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
            G+ I FLDAH EV   WL P++A I  DRK + +P+ID ID  ++ +          RG
Sbjct: 244 TGDTITFLDAHIEVSKGWLEPMMARINEDRKHVVMPIIDSIDPDSFNY---------MRG 294

Query: 227 I-----FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
                 F WGM  K          +R+   EP  SP  AGGLF+MDR +F +LGGYDPG+
Sbjct: 295 GLDILGFSWGMGQKSI------GSRRRTRVEPMPSPIMAGGLFSMDRKYFFDLGGYDPGM 348

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
            ++GGE  E+SF+IW CGG++E +PCSR+GHV+R+   Y  G++   V G +I  N  R 
Sbjct: 349 KLYGGEELEISFRIWQCGGTLECIPCSRVGHVFRTGA-YWKGQVYT-VPGHVIVKNKLRA 406

Query: 342 IETWFDEKHKAYFYTREPLAMFLDMGDIS 370
            E W DE  +       PL   +D+GD+S
Sbjct: 407 AEVWMDEYKEVVQRVMPPLPRGMDLGDLS 435


>gi|195587296|ref|XP_002083401.1| GD13712 [Drosophila simulans]
 gi|194195410|gb|EDX08986.1| GD13712 [Drosophila simulans]
          Length = 631

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 156/344 (45%), Positives = 206/344 (59%), Gaps = 17/344 (4%)

Query: 23  GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           G GEGGKA  L  E+ R        E G N   S+ IS +R++PD+R   C+  +Y   L
Sbjct: 142 GLGEGGKASSLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVTKL 201

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  SVI++F+NE  S LMR+VHS+I R+P + ++EIILVDD S +  L ++LE YI    
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
             VR++R   R GLI  R+ GA+ +  EV++FLD+H E   NWLPPLL PI  +++    
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRKYNSEPYKSPTHA 260
           P ID ID+  + +R+    D   RG F+W   YK    LPE      K+ ++P+KSP  A
Sbjct: 322 PFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DLKHPADPFKSPIMA 374

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG +   PCSRIGH+YR   P 
Sbjct: 375 GGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRG--PR 432

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           N        KG  +  NYKRV E     K K++ +  E +A  L
Sbjct: 433 NHQ--PSPRKGDYLHKNYKRVAEL----KCKSFKWFMEEVAFDL 470


>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
 gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
          Length = 658

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/369 (40%), Positives = 205/369 (55%), Gaps = 29/369 (7%)

Query: 1   RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSN 56
           +P  + D K   ++PP   + E  GE GK   LP    +  + A +        N   S+
Sbjct: 133 KPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVSD 190

Query: 57  HISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 115
            IS  RT+PD R   CK    Y  +LP   VI+ FHNE ++ L+RTVHS++ R+P   + 
Sbjct: 191 LISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIG 250

Query: 116 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 175
            IILVDD+S    L  +LEDY   +  KV++IR  +REGLIR R  GA+ ++  V+ +LD
Sbjct: 251 RIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLD 309

Query: 176 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIF 228
           +HCE    WL PLL  I  +   +  PVID I   T E+        HYR       G F
Sbjct: 310 SHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGF 361

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +  + +PERE K+    +EP  SPT AGGLF++DR +F  LG YD G  +WGGEN
Sbjct: 362 DWNLQFSWHAVPEREKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGEN 421

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSFK WMCGG++E VPCS +GH++R   PY +     R    ++  N  R+ E W DE
Sbjct: 422 LELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKW-----RSGVNVLRKNSVRLAEVWMDE 476

Query: 349 KHKAYFYTR 357
            +  Y+Y R
Sbjct: 477 -YSQYYYHR 484


>gi|34042906|gb|AAQ56699.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 601

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/364 (39%), Positives = 207/364 (56%), Gaps = 13/364 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L+  K G GE G A HL  A +  GD    +  +N E S  ++++R++ D R   C    
Sbjct: 83  LQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQR 142

Query: 77  YPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
           +  + LP ASV+++F NE +S L+RTVHS +     + L+EIILVDD S   +L  KL+ 
Sbjct: 143 FDSESLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202

Query: 136 YIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 193
           Y++    +GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PLL  I 
Sbjct: 203 YVRTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIK 262

Query: 194 SDRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
             R  + VP+ID ID     Y T  ++S       + G F+W  L +  +  +R   K++
Sbjct: 263 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQE 322

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
               P  SPT AGGLFA+DR +F E+G YD  +  WGGEN E+SF+IW CGG+IE +PCS
Sbjct: 323 REICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCS 382

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           R+GH++R F PY F    DR    +   N  R+   W DE    +F  R  L    D+GD
Sbjct: 383 RVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDEYINIFFLNRPDLKFHADIGD 437

Query: 369 ISEQ 372
           ++ +
Sbjct: 438 VTHR 441


>gi|12855129|dbj|BAB30220.1| unnamed protein product [Mus musculus]
          Length = 431

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 132/328 (40%), Positives = 197/328 (60%), Gaps = 11/328 (3%)

Query: 45  LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 104
           L  YG+N   S  +  +R +PD R + C+   YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78  LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137

Query: 105 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 164
           ++  +P   LEEIILVDD S   DL  KL+ Y++ F G+V+LIRN +REGLIR++  GA 
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGEVKLIRNKKREGLIRSKMIGAS 197

Query: 165 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 224
            + G+++VFLD+HCEV   WL PLL  I  D K++  P+ID I+  T +    Y      
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253

Query: 225 RGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           RG F+W +  + + +   E    +  S P +SP   GG+FA++R +F ELG YD G+ + 
Sbjct: 254 RGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDIC 313

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN ELS +IWMCGG +  +PCSR+G+  ++   +       R     ++ N  RV+  
Sbjct: 314 GGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQHR------RANQSALSRNLLRVVHV 367

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K  F+ + P   ++  G+ISE+
Sbjct: 368 WLDE-YKGNFFLQRPSLTYVSCGNISER 394


>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
 gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
          Length = 494

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 131/335 (39%), Positives = 197/335 (58%), Gaps = 18/335 (5%)

Query: 41  GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 100
           G+ + G+   N   S+ I  DR +PD R   C+Y  YP  LP  S+I+ FHNE  S+L+R
Sbjct: 17  GEDAYGKNQFNQAISDKIGGDRDVPDTRHSHCRYEAYPSTLPATSIIITFHNEARSTLLR 76

Query: 101 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 160
           TV SI+ +TP   + EIILVDDFS  A+     +  +     KV+++RN +R+GLIR+R 
Sbjct: 77  TVKSILNKTPPNLVNEIILVDDFSDDAE-----DGLLLMGLPKVKVLRNNKRQGLIRSRV 131

Query: 161 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 220
           +G+  ++ +V+ FLD+HCE   +WL PLL  +  ++K +  P+ID I+   + +      
Sbjct: 132 KGSDTAKSDVLTFLDSHCECNTDWLQPLLKRVVQNKKAVVSPIIDVINMDDFSYIGA--- 188

Query: 221 DHHYRGIFEWGMLYK-ENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDP 279
               +G F+W + +K +N  PE++  +R     P K+P  AGGLF + +++F E+G YD 
Sbjct: 189 SADIKGGFDWSLHFKWDNLTPEQKQSRRSTPIAPIKTPMIAGGLFVVTKSWFEEMGKYDT 248

Query: 280 GLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--N 337
            + +WGGENFE+SF+ W CGGS+E +PCSR+GHV+R   PY F        G   TY  N
Sbjct: 249 MMDIWGGENFEISFRTWQCGGSMEIIPCSRVGHVFRKRHPYTF------PDGNANTYMKN 302

Query: 338 YKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +R  E W DE +K ++Y   P+A     G I  +
Sbjct: 303 TRRTAEVWMDE-YKRFYYAARPMARSALYGSIKSR 336


>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
 gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
          Length = 504

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 136/328 (41%), Positives = 196/328 (59%), Gaps = 18/328 (5%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYP-----LDLPKASVILVFHNEGFSSLMRTVHSI 105
           N + S+ +  +R +PD R   C+   +        LP  SVI+ FHNE  S+L+RTV S+
Sbjct: 33  NQQASDGLKSNRELPDTRNAMCRRSSWSDLSTIAHLPATSVIITFHNEARSTLLRTVVSV 92

Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
           + R+P + + EIILVDD+S   +  Q+L   IQ    KVRLIRN++REGL+R+R  GA  
Sbjct: 93  LNRSPERLIHEIILVDDYSDFPEDGQELAK-IQ----KVRLIRNSKREGLVRSRVTGAAA 147

Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
           +  +V+ FLD+HCE  +NWL PLLA +  D   +  PVID I   T+++          R
Sbjct: 148 ATAKVLTFLDSHCECNVNWLEPLLARVAEDPTRVVCPVIDVISMDTFQY---IGASADLR 204

Query: 226 GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 284
           G F+W +++K   L   E K R+ + + P ++P  AGGLF +D+A+F  LG YD  + +W
Sbjct: 205 GGFDWNLVFKWEYLSNAERKARQRDPTAPIRTPMIAGGLFVIDKAYFERLGTYDTQMDIW 264

Query: 285 GGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIET 344
           GGEN E+SF++W CGGS+E +PCSR+GHV+R   PY F        G +   N +R  E 
Sbjct: 265 GGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTFPGGG---SGNIFAKNTRRAAEV 321

Query: 345 WFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           W DE +K Y+Y   PLA  +  GDI ++
Sbjct: 322 WMDE-YKKYYYAAVPLATNIPFGDIDDR 348


>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
 gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
           norvegicus]
 gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
          Length = 633

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 156/388 (40%), Positives = 222/388 (57%), Gaps = 25/388 (6%)

Query: 1   RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
           RP  +       L+P L+ P ++   PG  GK +   HL    +   +    ++  N   
Sbjct: 95  RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFA 154

Query: 55  SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           S+ IS  R + PD R  EC   K+   P  LP  SVI+VFHNE +S+L+RTVHS++  +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
           A  L+EIILVDD S    L +KLE+YI++F+  V+++R  ER+GLI  R  GA  +  E 
Sbjct: 214 AILLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAET 272

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
           + FLDAHCE    WL PLLA I  +   +  P I  ID  T+EF   S Y  +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +    LP+ E ++RK  + P K+PT AGGLF++ R +F  +G YD  + +WGGEN
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGEN 391

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            E+SF++W CGG +E +PCS +GHV+RS  P+ F K        +I  N  R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE 446

Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
            +K  FY R   A  +      GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKSFGDLSKR 473


>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 4-like
           [Ornithorhynchus anatinus]
          Length = 580

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 151/361 (41%), Positives = 206/361 (57%), Gaps = 24/361 (6%)

Query: 19  PYKEGPGEGGKAYHLPEAYRAAGDASLGE------YGMNMETSNHISFDRTIPDLRMEEC 72
           P    PGE G+A  L    +  GDA   E      Y +N+  S+ IS  R I D RM EC
Sbjct: 71  PEPRAPGEWGEATRL----QLRGDAKKREEELVEKYAINIHLSDRISLHRRIRDRRMPEC 126

Query: 73  KYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 131
           +   Y    LP  SV++ F+NE +S+L+RTVHS+++ +PA  L+E+ILVDD S +  L  
Sbjct: 127 RAVTYDYRRLPTTSVVIAFYNEAWSTLLRTVHSVLETSPAVLLKEVILVDDLSDRPYLKA 186

Query: 132 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 191
           +LE Y+     +VRL+R   REGL+R R  GA  + GEV+ FLD HCE G  WL PLL  
Sbjct: 187 ELEKYVSALQ-RVRLVRTNRREGLVRARLIGATFATGEVLTFLDCHCECGPGWLEPLLER 245

Query: 192 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 251
           I  +   +  PVID ID+ T+EF    +      G F+W + ++   +PERE ++R+   
Sbjct: 246 IGRNETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWQTVPERERRRRRSRI 303

Query: 252 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIG 311
           +P  SPT AGGLFA+ + +F  LG YD G+ VWGGEN ELSF++W CGG++E +PCS +G
Sbjct: 304 DPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEILPCSHVG 363

Query: 312 HVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISE 371
           HV+    PY           P    N  R  E W D  +K +FY R P A      D+SE
Sbjct: 364 HVFPKRAPY---------ARPSFLRNTARAAEVWMD-GYKEHFYNRNPPARKESYWDLSE 413

Query: 372 Q 372
           +
Sbjct: 414 R 414


>gi|443685595|gb|ELT89149.1| hypothetical protein CAPTEDRAFT_34275, partial [Capitella teleta]
          Length = 358

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 133/316 (42%), Positives = 193/316 (61%), Gaps = 9/316 (2%)

Query: 42  DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 101
           + +  +Y MN++ SN +  DR+I D R  EC    +   L K S+I+ F++E +S L+R 
Sbjct: 1   ETNFDQYSMNVQLSNTVPLDRSILDTRNPECHVVQFSQQL-KVSIIVPFYDESWSMLLRM 59

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
           +HS+I RTP   LEEIIL+DD SS+  L   L++Y +  + K+R+IR+  REGL+R R  
Sbjct: 60  LHSVIDRTPDALLEEIILIDDKSSRDYLKAPLDEYCKVLSPKIRIIRSEHREGLMRGRMV 119

Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
           GAKE++ + +VFLDAH E    WL PLL  I    + + VP +D ID QT ++ S    +
Sbjct: 120 GAKEAKADTLVFLDAHVECNEGWLDPLLQIIMDHPRAIAVPTMDNIDPQTIKYESW---N 176

Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
           H   G F W M Y+   LP+    K    ++P+ SPT  G   AM+R +F E+GG+D G+
Sbjct: 177 HVAYGGFTWNMEYQWKVLPDTLVNKLISKTQPFPSPTTIGCAMAMNRDYFFEIGGFDEGM 236

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLI-TYNYKR 340
            +WGGEN E+SFK WMCG  +   PCSR+GH++R+ +PY F    ++  G ++   NY+R
Sbjct: 237 FIWGGENLEISFKTWMCGEGLYISPCSRVGHLFRTILPYVF---PNQYGGGMVRQKNYQR 293

Query: 341 VIETWFDEKHKAYFYT 356
           V E W DE +K  FY 
Sbjct: 294 VAEVWMDE-YKELFYA 308


>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Ornithorhynchus anatinus]
          Length = 628

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR +PD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 162 NQVESDKLRMDRAVPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVASVLKKSP 221

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++  V
Sbjct: 222 PHLVKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQARV 276

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  D+  +  P+ID I+   +++          +G F+W
Sbjct: 277 LTFLDSHCECNEHWLEPLLERVAEDKTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 333

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+++F ELG YD  + VWGGEN 
Sbjct: 334 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 393

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E VPCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 394 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 448

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 449 YKNFYYAAVPSARNVPYGNIQSR 471


>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
           catus]
          Length = 622

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 149/370 (40%), Positives = 211/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP +P    PG  GKA+             +    ++  N   S+ IS  R + PD R  
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTSLETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S+  
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L ++L+ Y+++    VR++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL P
Sbjct: 223 YLKEQLDQYVKKLQ-IVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  D  ++  P I  ID  T+EF + V     H RG F+W + +    LP  E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQR 341

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 342 RKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIP 401

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLAM 362
           CS +GHV+R+  P+ F K        +I  N  R+ E W D  +K  FY R      +A 
Sbjct: 402 CSVVGHVFRTKSPHTFPKGIS-----VIARNQVRLAEVWMD-SYKEIFYRRNLQAAKMAQ 455

Query: 363 FLDMGDISEQ 372
               GDISE+
Sbjct: 456 EKSFGDISER 465


>gi|195455372|ref|XP_002074693.1| GK23025 [Drosophila willistoni]
 gi|194170778|gb|EDW85679.1| GK23025 [Drosophila willistoni]
          Length = 599

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 146/370 (39%), Positives = 210/370 (56%), Gaps = 15/370 (4%)

Query: 12  NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 71
           +++  L   + G G  G A HL  + +  G+    +  +N E S  +S++RT+ D R   
Sbjct: 75  SIQLDLAKQRPGLGNNGVAVHLTGSAKERGEKIYKKIALNEELSEQLSYNRTVGDHRNPL 134

Query: 72  CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 130
           C    +  + LP ASVI++F NE +S L+RTVHS +     + L+EIILVDD S   +L 
Sbjct: 135 CASQRFDTNSLPSASVIIIFFNEPYSVLLRTVHSTLSTCNEKSLKEIILVDDGSDNVELG 194

Query: 131 QKLEDYIQ-RF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 188
            KL+ YI+ RF  GKV ++R   R GLIR R  GA+ + G+V++FLDAHCE  + W  PL
Sbjct: 195 GKLDHYIRTRFPAGKVTVLRLKNRLGLIRARLAGARMATGDVLIFLDAHCEGNVGWCEPL 254

Query: 189 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 248
           L  I   R  + VP+ID ID   +++ S         G F+W   +    LPERE ++++
Sbjct: 255 LQRIKESRTSVLVPIIDVIDANDFQY-STNGYKAFQVGGFQWNGHFDWVNLPEREKQRQR 313

Query: 249 YNSE------PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSI 302
              +      P  SPT AGGLFA+DR +F E+G YD  +  WGGEN E+SF+IW CGG+I
Sbjct: 314 RECDQAREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTI 373

Query: 303 EWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAM 362
           E +PCSR+GH++R F PY F    DR    +   N  R+   W D+    +F  R  L  
Sbjct: 374 ETIPCSRVGHIFRDFHPYKFPN--DRDTHGI---NTARMALVWMDDYINIFFLNRPDLKF 428

Query: 363 FLDMGDISEQ 372
             D+GD++ +
Sbjct: 429 HADIGDVTHR 438


>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
 gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
          Length = 621

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 213/370 (57%), Gaps = 26/370 (7%)

Query: 15  PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTI-PDLR 68
           PP +P   GPG  GKA+   +    A ++   E G      N   S+ IS  R + PD R
Sbjct: 105 PPQDP--SGPGADGKAFK--KEQWTAEESKEKERGYEKHCFNAFASDRISLQRALGPDSR 160

Query: 69  MEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
             EC   K+   P  LP  SV++VFHNE +S+L+RTV+S++  +PA  L EIILVDD S+
Sbjct: 161 PPECIDQKFKRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHASPAALLREIILVDDAST 219

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
              L  +L+ Y+++    VR++R  ER+GLI  R  GA  + GEV+ FLDAHCE    WL
Sbjct: 220 DEYLKDELDRYVKQLQ-IVRVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWL 278

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREA 244
            PLL+ I  +   +  P I  ID  T+EF + V     H RG F+W + +    +P RE 
Sbjct: 279 EPLLSRIAEEPTAVVSPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRER 338

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           ++RK  + P KSPT AGGLFA+ R++F  +G YD  + +WGGEN E+SF++W CGG +E 
Sbjct: 339 QRRKDETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSFRVWQCGGQLEI 398

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           +PCS +GHV+RS  P+ F K        +I+ N  R+ E W D+ +K  FY R   A  +
Sbjct: 399 IPCSVVGHVFRSKSPHTFPKGTQ-----VISRNQVRLAEVWMDD-YKEIFYRRNQQAAQM 452

Query: 365 ----DMGDIS 370
                 GDI+
Sbjct: 453 AREKTYGDIT 462


>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
           catus]
          Length = 653

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 187 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 246

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 247 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 301

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 302 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 358

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 359 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 418

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E VPCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 419 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 473

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 474 YKNFYYAAVPSARNVPYGNIQSR 496


>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
           glaber]
          Length = 567

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 101 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 160

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 161 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 215

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 216 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 272

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 273 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 332

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 333 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 387

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 388 YKNFYYAAVPSARNVPYGNIQSR 410


>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           2, partial [Ciona intestinalis]
          Length = 582

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 135/322 (41%), Positives = 192/322 (59%), Gaps = 17/322 (5%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N + S+ +  DR +PD R   C    WD    LP  SVI+ FHNE  S+L+RTV S++ R
Sbjct: 114 NQQASDKLKCDRPVPDTRNGLCSSNSWDLS-KLPATSVIVTFHNEARSTLLRTVVSVLNR 172

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P   + EIILVDDFS  A+ D +L   I+    KVR++RN +REGL+R+R RGA  +  
Sbjct: 173 SPPSLVREIILVDDFSDNAE-DGQLLAQIE----KVRVLRNNQREGLMRSRIRGADAAAA 227

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
            V+ FLD+H E   NWL PLL  I  DR  +  P+ID I+   +E+          RG F
Sbjct: 228 PVLTFLDSHVECNKNWLEPLLQRIADDRTAVVCPIIDVINMDNFEYIGASAD---LRGGF 284

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           +W +++K + +   E + R  N + P  +P  AGGLF+MD+++F +LG YD  + VWGGE
Sbjct: 285 DWNLVFKWDYMSSEERRSRAGNPTAPISTPMIAGGLFSMDKSYFNQLGKYDTAMDVWGGE 344

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFD 347
           N E+SF++W CGG +E +PCSR+GHV+R   PY F   +    G + T N +R  E W D
Sbjct: 345 NLEISFRVWQCGGRLEIIPCSRVGHVFRKQHPYTFPGGS----GNVFTRNTRRAAEVWMD 400

Query: 348 EKHKAYFYTREPLAMFLDMGDI 369
           + +K Y+Y   P A  +  G+I
Sbjct: 401 D-YKEYYYAAVPSAKLIPFGNI 421


>gi|195027660|ref|XP_001986700.1| GH20386 [Drosophila grimshawi]
 gi|193902700|gb|EDW01567.1| GH20386 [Drosophila grimshawi]
          Length = 666

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 137/345 (39%), Positives = 195/345 (56%), Gaps = 16/345 (4%)

Query: 7   DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 66
           D  +   EP     K+G G  G+   +P   R            N+  S+ I  +RT+ D
Sbjct: 80  DYNINQFEP-----KQGEGADGRPVIVPPRDRFRMQRFFKLNSFNILASDRIPLNRTLKD 134

Query: 67  LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            R  EC+   Y   LP  SVI+VFHNE +S L+RT+ S+I R+P   L EIILVDD S++
Sbjct: 135 YRTGECRDKRYANSLPNTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASNR 194

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
           + L ++LE YIQ      RL R  ER GL+  R  GA+ +RG+V+ FLDAHCE    WL 
Sbjct: 195 SFLKRQLEAYIQVLAVPTRLYRMKERSGLVPARLLGAQHARGDVLTFLDAHCECSRGWLE 254

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREA 244
           PLLA I   R+++  PVID I    + +   +E  +H+ G F W + ++   ++   +  
Sbjct: 255 PLLARIGESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKRQTT 311

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
              K ++ P  +P  AGGLFA+DR +F E+G YD  + +WGGEN E+SF+IW CGG IE 
Sbjct: 312 ANTKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSDMRIWGGENVEMSFRIWQCGGRIEI 371

Query: 305 VPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWFDE 348
            PCS +GH++RS  PY F G +++     ++T N  R    W D+
Sbjct: 372 SPCSHVGHIFRSSTPYTFPGGMSE-----VLTANLARAATVWMDD 411


>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
          Length = 613

 Score =  258 bits (658), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 149/356 (41%), Positives = 209/356 (58%), Gaps = 28/356 (7%)

Query: 18  EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           E  +EGPGE GK    P       D  L E    +   N  S    +   R        Y
Sbjct: 103 EAEREGPGEHGK----PLKLEKLEDIKLNE---KLFKENGYSALSGVGKKR--------Y 147

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP  SVI++F+NE +S+L+RTV+S++ R+P+  L+EI+LV+D S+K  L + L+D++
Sbjct: 148 LQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQDFV 207

Query: 138 Q-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 196
           +     KV+LI    R GLI  R  GAK + G+V++ LD+H EV +NWLPPL+ PI  D 
Sbjct: 208 RTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDY 267

Query: 197 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKS 256
           +    P ID I + T+++R+    D   RG F+W  LYK   LP R A+     +EP++S
Sbjct: 268 RTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFLYKR--LPLR-AQDMVDPTEPFES 321

Query: 257 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRS 316
           P  AGGLFA+   FF ELGGYD GL +WGGE +ELSFK+W CGG +   PCSR+GHVYR 
Sbjct: 322 PIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRG 381

Query: 317 FMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
           + P+   +  +      +T N+KRV E W DE +K + Y R P     D GD+++Q
Sbjct: 382 YAPFPNPRGTN-----FVTRNFKRVAEVWMDE-YKQFLYERNPQFDQTDAGDLTKQ 431


>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
          Length = 658

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 137/322 (42%), Positives = 189/322 (58%), Gaps = 14/322 (4%)

Query: 55  SNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 113
           S+ +  DR +PD+R   CK   +P  +L KAS+I+ F NE +S+L+RTVHS++ R+PA  
Sbjct: 188 SSLLPLDRPVPDVRPPACKAKQWPTANLLKASIIICFVNEAWSTLLRTVHSVLNRSPADL 247

Query: 114 LEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 172
           + EIIL+DD S  A L  KL +YI+     KV+ +R   R GLIR R  GA+ + G+V++
Sbjct: 248 VHEIILLDDSSDAAWLGDKLTNYIRDNLPDKVKYVRTQHRSGLIRARLVGAEHATGDVLL 307

Query: 173 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 232
           FLD+HCE  LNWL P++A I  DR+ +  PVID ID+ T E+    + D    G F+W M
Sbjct: 308 FLDSHCEANLNWLEPIMALITEDRRTVVTPVIDSIDHHTMEYSKATQ-DVPAVGTFDWTM 366

Query: 233 LYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS 292
            +       R        ++P  SPT AGGLFAM++ +F ELG YD  +  WGGEN E+S
Sbjct: 367 DFNWKAGVRRAGADA---TDPVDSPTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMS 423

Query: 293 FKIWMCGGSIEWVPCSRIGHVYRSFMPYNF--GKLADRVKGPLITYNYKRVIETWFDEKH 350
           F+IW CGG +   PCS +GH++R   PY    G + D         N  RV E W D  +
Sbjct: 424 FRIWQCGGRLVTAPCSHVGHIFRDSHPYTVPGGSIHD-----TFLRNSMRVAEVWMDH-Y 477

Query: 351 KAYFYTREPLAMFLDMGDISEQ 372
           K YF    P    +D GD+SE+
Sbjct: 478 KQYFLDTRPGQNIIDAGDVSER 499


>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
          Length = 737

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 213/355 (60%), Gaps = 13/355 (3%)

Query: 23  GPGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 77
           GPGEGG  Y      +  A +   D    +   N   S+ IS  R +PD R   CK   Y
Sbjct: 181 GPGEGGIPYTVNREDISPAEQVIFDKGWKDNAFNQLASDRISVRRYLPDYREGTCKDNKY 240

Query: 78  PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 137
             +LP AS+I+ FHNE +S L+R+VHS+I R+P+  L EIILVDDFS +  L + LE+Y+
Sbjct: 241 SRNLPSASIIICFHNEAWSVLLRSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYM 300

Query: 138 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 197
           +  N  V+++R   REGLIR R  GA +S G+V+VFLD+H E    WL PLL  I  +  
Sbjct: 301 KMLN-VVKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDRIAYNSS 359

Query: 198 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 257
           I+ VPVI  I+ +T ++  +  P     G F+W + +  +E  ER   +      P +SP
Sbjct: 360 IVVVPVITVINDKTLKY-DLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYSPVQSP 418

Query: 258 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSF 317
           T AGGLFA+ R +F  LG YDPG+ VWGGEN ELSFKIWMCGGS+E V CS++GH++R  
Sbjct: 419 TMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGHIFRDR 478

Query: 318 MPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            PY +      VK PL   N  R+ + W D+ +K +++ R    M +D+G++SE+
Sbjct: 479 SPYIWDV---DVKDPL-KRNLLRLADVWLDD-YKRFYHARIGFEM-VDIGNVSER 527


>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Cavia porcellus]
          Length = 622

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 149/371 (40%), Positives = 215/371 (57%), Gaps = 24/371 (6%)

Query: 15  PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 69
           PP +P    PG  GKA+      P+  +   +    ++  N   S+ IS  R + PD R 
Sbjct: 106 PPQDP--NSPGADGKAFQKSDWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162

Query: 70  EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 126
            EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA  L+EIILVDD S+ 
Sbjct: 163 SECIHQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPATLLKEIILVDDASTD 221

Query: 127 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 186
             L  +LE Y+Q+    V+++R  ER+GLI  R  GA  ++ EV+ FLDAHCE    WL 
Sbjct: 222 EYLKDELERYVQQLQ-IVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280

Query: 187 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 245
           PLLA I  ++  +  P I  I+  T+EF + + E   H RG F+W + +    LP  E +
Sbjct: 281 PLLARIAENKMAVVSPDIVTINLNTFEFSKPIPEGRIHSRGNFDWILTFGWEALPAHEKQ 340

Query: 246 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 305
           +RK  + P KSPT AGGLF++ +++F  +G YD  + +WGGEN E+SF++W CGG +E +
Sbjct: 341 RRKDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEII 400

Query: 306 PCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTRE----PLA 361
           PCS +GHV+R+  P+ F K        +I  N  R+ E W D+ +K  FY R      +A
Sbjct: 401 PCSVVGHVFRTKSPHTFPKGTS-----VIARNQVRLAEVWMDD-YKKIFYRRNLQAAKIA 454

Query: 362 MFLDMGDISEQ 372
                GDISE+
Sbjct: 455 QEKSFGDISER 465


>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
           lupus familiaris]
          Length = 588

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 122 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 181

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 182 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 236

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 237 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 293

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 294 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 353

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E VPCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 354 EISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 408

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 409 YKNFYYAAVPSARNVPYGNIQSR 431


>gi|443687046|gb|ELT90152.1| hypothetical protein CAPTEDRAFT_141956, partial [Capitella teleta]
          Length = 351

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 199/318 (62%), Gaps = 11/318 (3%)

Query: 44  SLGEYGMNMETSNHI-SFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRT 101
           + G +  N  +S+ + +F   +PD RME C    Y L  L K S+I++FHNE  S+L+RT
Sbjct: 3   TTGYHSFNHSSSDLVGNFRHELPDFRMEGCHKKTYDLTTLGKTSIIIIFHNEARSTLLRT 62

Query: 102 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 161
           +H++++RTP   L EI++VDD S+ A L + L+ Y+Q    ++R+IR  +R+GLIR R+R
Sbjct: 63  IHALLERTPILLLVEILIVDDASTHAWLKEPLDKYLQHL-PRIRIIRLKQRQGLIRARTR 121

Query: 162 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 221
           GA+E++G+++ F DAH EVG  WLPPLL  I  +RK++  P +D I +Q++E+   +   
Sbjct: 122 GAEEAKGDILYFADAHTEVGEGWLPPLLQRIKENRKVLVFPEMDPIQHQSFEY---WRAG 178

Query: 222 HHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGL 281
             Y G F W M +K    P+    +R   ++P  SP   G   A++R +F E G YD  +
Sbjct: 179 DEYHGAFYWHMEFKYKFAPKEILNRRSDPTQPVPSPVMVGCAHAIEREYFFETGAYDTDM 238

Query: 282 LVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRV 341
            +WGGEN E +F++WMCGG +E +PCSR+GHV++  +PY+F   +      +I  N  R+
Sbjct: 239 EIWGGENIEHAFRLWMCGGRVEVIPCSRVGHVFKPRLPYSFTGDS----ASIIQRNLIRI 294

Query: 342 IETWFDEKHKAYFYTREP 359
            ETW D+ +K +FY  +P
Sbjct: 295 AETWMDD-YKKFFYATQP 311


>gi|291236246|ref|XP_002738051.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Saccoglossus kowalevskii]
          Length = 321

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 127/274 (46%), Positives = 175/274 (63%), Gaps = 3/274 (1%)

Query: 21  KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
           + GPGE G+ Y L    +        ++G N   S+ +S +R +PD+R   CK  +Y + 
Sbjct: 51  RTGPGEQGRPYILSPEEKKNEHQDFSKHGFNKHISDVLSVERALPDIRDPRCKTMEYLVK 110

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  S+++ FHNE  S L RTVHSII R+P + L EIILVDDFS   +  + L DY+   
Sbjct: 111 LPNTSIVIPFHNEALSVLKRTVHSIINRSPPELLHEIILVDDFSDHDECKEPLNDYMVTV 170

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             KVR+IR T+REGLIRTR  GA  + G+V+VFLD+HCE  +NWLPPLL  I  +RK + 
Sbjct: 171 -PKVRIIRATKREGLIRTRLLGASRATGQVLVFLDSHCEANVNWLPPLLESIALNRKCIA 229

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 260
            P+ID I    + + +  +     RG F+W + YK   L E E K+RK+ +EP+++P  A
Sbjct: 230 CPMIDVIGNNDYHYET--QAGDAMRGAFDWELFYKRIPLTEEELKRRKHAAEPFRTPIMA 287

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 294
           GGLFA+DR +F E+GGYD GL +WGGE ++LSFK
Sbjct: 288 GGLFAVDRLYFNEIGGYDAGLEIWGGEQYDLSFK 321


>gi|339234661|ref|XP_003378885.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
 gi|316978493|gb|EFV61475.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
          Length = 1819

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 156/363 (42%), Positives = 205/363 (56%), Gaps = 30/363 (8%)

Query: 23   GPGEGGKAYHLPE--AYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 80
            G GE G    LP   A +A  D      G +  TS+ IS  R I DLR  +CK   Y   
Sbjct: 1299 GVGEHGNPVELPSSVAEKAEFDRLYKANGYSGWTSDKISLYRAIKDLRHVDCKRKSYLRL 1358

Query: 81   LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR- 139
            LP  SVIL FHNE  S L+RTV++I+ RTP + L E+ILV+D S+K +L+  LE ++QR 
Sbjct: 1359 LPSTSVILPFHNEHLSVLLRTVYTIVYRTPPELLLEVILVNDASTKPELNDILERHVQRK 1418

Query: 140  FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 199
            F   V +IR    +G    R  GA ++ G+V++F+DAH EVG NWLPPLL PI    + +
Sbjct: 1419 FPNLVHVIR-AGSDG----RREGAAKASGQVLMFMDAHSEVGYNWLPPLLEPIKLHYRTV 1473

Query: 200  TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTH 259
            T P ID ID  T+ FR+    D   RG F+W   YK   L        K  +EP++SP  
Sbjct: 1474 TCPFIDVIDCDTFAFRA---QDEGARGSFDWKFHYKRLPLLN------KTGAEPFESPVM 1524

Query: 260  AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
            AGG FA+ + +F ELG YD  L++WG E +ELSFK+W C G +  +PCSRI H+YR    
Sbjct: 1525 AGGYFAISKRWFDELGRYDDQLMIWGAEQYELSFKLWQCHGRMIDIPCSRIAHIYRC--K 1582

Query: 320  YNFGKLADRVK----------GPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDI 369
            + F  L   V           G  +  NYKRV ETW DE +K Y Y R P    +D GD+
Sbjct: 1583 FGFAALFSTVHRYAPFEDPGIGNFLERNYKRVAETWMDE-YKEYLYLRMPRLRNVDPGDL 1641

Query: 370  SEQ 372
            ++Q
Sbjct: 1642 TKQ 1644


>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
           mellifera]
          Length = 603

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 136/326 (41%), Positives = 200/326 (61%), Gaps = 13/326 (3%)

Query: 51  NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 108
           N+  S+ I  +RT+PD+R + C  +Y +   +LPK S+I+VFHNE +S+L+RTV+S+I R
Sbjct: 122 NLMASDRIPLNRTLPDVRRKGCITRYMNLG-NLPKTSIIIVFHNEAWSTLLRTVYSVIDR 180

Query: 109 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 168
           +P Q LEEIILVDD S +  L   L+++I+      +++R+ +R GL+  R  GA +++G
Sbjct: 181 SPIQLLEEIILVDDNSDRDFLKDALDEHIKNLQVSTKVLRSKKRIGLVNARLLGANKAKG 240

Query: 169 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 228
           EV+ FLDAHCE  + WL PLL  +  +R  +  PVID I+  T+ +   +E   H+ G F
Sbjct: 241 EVLTFLDAHCECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAF 297

Query: 229 EWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
            W + ++   L  R  K+R+ N  EP+++P  AGGLF+M+R +F ELG YD  + +WGGE
Sbjct: 298 NWDLHFRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGE 357

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNF-GKLADRVKGPLITYNYKRVIETWF 346
           N ELSF++W CGGSIE  PCS +GH++R   PY F G + + + G     N  RV   W 
Sbjct: 358 NLELSFRVWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGVGEILYG-----NLARVALVWM 412

Query: 347 DEKHKAYFYTREPLAMFLDMGDISEQ 372
           DE  + YF      A   D   I  +
Sbjct: 413 DEWAEFYFKFNAEAARLRDKQTIRSR 438


>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
           [Anolis carolinensis]
          Length = 500

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 144/327 (44%), Positives = 191/327 (58%), Gaps = 17/327 (5%)

Query: 48  YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 107
           Y  N   S  I  DR I D R   C    Y  DLP  S+I+ FHNE  S+L+RT+ S++ 
Sbjct: 53  YAFNQRESERIPSDRAIRDTRHHRCTTLHYRTDLPPTSIIITFHNEARSTLLRTIRSVLN 112

Query: 108 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 167
           RTP   + EIILVDDFS   D  + L         KV+ +RN  REGLIR+R RGA+ + 
Sbjct: 113 RTPVHLVHEIILVDDFSDDPDDCRLLIKL-----PKVKCLRNRRREGLIRSRIRGAEMAE 167

Query: 168 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 227
            EV+ FLD+HCEV  +WL PLL  I  D   +  PVID I+  T+ + +        RG 
Sbjct: 168 AEVLTFLDSHCEVNKDWLLPLLQRIKEDPSHVVSPVIDIINLDTFAYVAA---SSDLRGG 224

Query: 228 FEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGE 287
           F+W + +K  +L  ++  KR   +EP K+P  AGGLF +D+A+F  LG YD  + +WGGE
Sbjct: 225 FDWSLHFKWEQLSPKQKAKRTDPTEPIKTPIIAGGLFVIDKAWFNHLGKYDAAMDIWGGE 284

Query: 288 NFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITY--NYKRVIETW 345
           NFE+SF++WMCGGS+E +PCSR+GHV+R   PY F       +G   TY  N KR  E W
Sbjct: 285 NFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFP------EGNANTYIKNTKRTAEVW 338

Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
            DE +K Y+Y   P A     G+I E+
Sbjct: 339 MDE-YKQYYYAARPAAQGRPYGEIPEE 364


>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
 gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
           malayi]
          Length = 575

 Score =  257 bits (656), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 205/340 (60%), Gaps = 16/340 (4%)

Query: 23  GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 80
           G GE G+   L E      + +      ++  S+ I+ +R++PD+R  +C+   Y    +
Sbjct: 28  GAGEDGRPVRLSEEDERLSEDTFVINQFSLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 87

Query: 81  LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 140
           LP  SVI+V+HNE FS+LMRTV S+I+R+P + L+EIILVDDFS++  L  +LE ++ + 
Sbjct: 88  LPTTSVIIVYHNEAFSTLMRTVMSVIQRSPRENLKEIILVDDFSTRTFLKVELEKFVAQL 147

Query: 141 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 200
             ++++IR  ER GLIR R  GA E+ G+V+ FLD+HCE    W+ PLLA I  +RK + 
Sbjct: 148 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 207

Query: 201 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTH 259
            PVID I+ +T+ ++   E    +RG F W + ++   LP    K R  + ++P  SPT 
Sbjct: 208 CPVIDIINDRTFAYQKSIEL---FRGGFNWNLQFRWYALPSEMIKSRSDDPTKPIISPTM 264

Query: 260 AGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMP 319
           AGGLF++DR +F E+G YD  + +WGGEN E+S +++      E +PCS +GHV+R   P
Sbjct: 265 AGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRVF------EILPCSHVGHVFRRTSP 318

Query: 320 YNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREP 359
           ++F     R  G ++  N  RV E W DE  K +FY   P
Sbjct: 319 HDF---PGRKSGTILNSNLLRVAEVWMDE-WKFHFYRTAP 354


>gi|170051778|ref|XP_001861920.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
           quinquefasciatus]
 gi|167872876|gb|EDS36259.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
           quinquefasciatus]
          Length = 601

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 143/364 (39%), Positives = 207/364 (56%), Gaps = 13/364 (3%)

Query: 17  LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 76
           L   + G G+ GK   L    R  G+  L    +N E S H+S++RT PD R   CK   
Sbjct: 81  LAKQERGLGDNGKGVELTGEAREIGEKQLATIALNEELSEHLSYNRTPPDERHPSCKRKS 140

Query: 77  YPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
           Y + +LP  SVI++F+NE +S L+RTVHS++     + L+EI+LVDD S+  +L  KL+ 
Sbjct: 141 YDIENLPSTSVIIIFYNEPYSVLVRTVHSVLNTADERLLKEIVLVDDGSTNEELKGKLDY 200

Query: 136 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 194
           Y++ R   KV+++R   R GLIR R  GA+ ++ +V+VFLDAHCE    WL PLL  I  
Sbjct: 201 YVRTRLPSKVKVLRQRNRVGLIRARLAGARFAKADVLVFLDAHCECMPQWLEPLLERIRE 260

Query: 195 DRKIMTVPVIDGID-----YQTWEFRSVYEPDHHYRGIFEW-GMLYKENELPEREAKKRK 248
            R  + VP+ID I+     Y T  F         + G F+W  +  +E E  +RE  ++ 
Sbjct: 261 SRTSVLVPIIDVIEAKNFFYSTNGFTDFQIGGFTWDGHFDWHDVTQREKERQKRECSEKD 320

Query: 249 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 308
               P  SPT AGGLFA+ R +F E+G YD  +  WGGEN E+SF++W CGG++E +PCS
Sbjct: 321 VAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRVWQCGGTLETIPCS 380

Query: 309 RIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGD 368
           RIGH++R F PY+F    DR    +   N  R+   W D+     +  R  L    ++GD
Sbjct: 381 RIGHIFRDFHPYSFPN--DRDTHGI---NTVRMATVWMDDYIDLLYLNRPDLRDHPEVGD 435

Query: 369 ISEQ 372
           ++ +
Sbjct: 436 VTHR 439


>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Apis florea]
          Length = 595

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 135/324 (41%), Positives = 195/324 (60%), Gaps = 9/324 (2%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRT 109
           N+  S+ I  +RT+PD+R + C      LD LPK S+I+VFHNE +S+L+RTV+S+I R+
Sbjct: 114 NLMASDRIPLNRTLPDVRRKGCISRYMNLDNLPKTSIIIVFHNEAWSTLLRTVYSVIDRS 173

Query: 110 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 169
           P Q LEEIILVDD S +  L   L+++++      +++R+ +R GL+  R  GA  ++GE
Sbjct: 174 PRQLLEEIILVDDNSDRDFLKDTLDEHVKNLQVSTKVLRSRKRIGLVNARLLGANNAKGE 233

Query: 170 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 229
           V+ FLDAHCE  + WL PLL  +  +R  +  PVID I+  T+ +   +E   H+ G F 
Sbjct: 234 VLTFLDAHCECTVGWLEPLLEAVAKNRTRVVSPVIDIINDDTFSYTRSFEL--HW-GAFN 290

Query: 230 WGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           W + ++   L  R  K+R+ N  EP+++P  AGGLF+M+R +F ELG YD  + +WGGEN
Sbjct: 291 WDLHFRWLTLNGRLLKERRENIVEPFRTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGEN 350

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            ELSF++W CGGSIE  PCS +GH++R   PY F        G ++  N  RV   W DE
Sbjct: 351 LELSFRVWQCGGSIEIAPCSHVGHLFRKSSPYTFPGGV----GEILYGNLARVALVWMDE 406

Query: 349 KHKAYFYTREPLAMFLDMGDISEQ 372
             + YF      A   D   I  +
Sbjct: 407 WAEFYFKFNAEAARLRDKQTIRSR 430


>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
 gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
          Length = 619

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 145/370 (39%), Positives = 214/370 (57%), Gaps = 22/370 (5%)

Query: 15  PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 70
           PP  P  + PG  G  +    + +         +  +  N   S+ IS  RT+  D R  
Sbjct: 101 PPENP--QAPGADGVPFQYDRMTKEEEKEKQEGMTRHCFNQFASDRISLHRTLGDDTRPP 158

Query: 71  EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 127
           EC   K+   P  LP  SVI+VFHNE +S+L+RTV+S++  +PA +L+EII+VDD S+  
Sbjct: 159 ECVDRKFRRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAAFLKEIIMVDDASTAE 217

Query: 128 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 187
            L  KLE+Y++     V+++R  ER+GLI  R  GA ++ GE++ FLDAHCE    WL P
Sbjct: 218 HLHGKLEEYVKALK-IVKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEP 276

Query: 188 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 246
           LLA I  +   +  P I  ID  T++F + V     H RG F+W + +    +P+ E  K
Sbjct: 277 LLARIVEEPTAVVSPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENAK 336

Query: 247 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 306
           RK  + P K+PT AGGLF++ +A+F ++G YD  + +WGGEN E+SF++W CGG +E +P
Sbjct: 337 RKDETYPVKTPTFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIP 396

Query: 307 CSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL-- 364
           CS +GHV+R+  P+ F K  +     +IT N  R+ E W D+ +K  FY R   A  +  
Sbjct: 397 CSVVGHVFRTKSPHTFPKGTE-----VITRNQVRLAEVWMDD-YKLIFYRRSQSAAKMAK 450

Query: 365 --DMGDISEQ 372
               GDIS++
Sbjct: 451 EKGFGDISDR 460


>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
          Length = 526

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 60  NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 119

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 120 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 174

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 175 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 231

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 232 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 291

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 292 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 346

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 347 YKHFYYAAVPSARNVPYGNIQSR 369


>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
           musculus]
          Length = 633

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)

Query: 23  GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
            PG  GK +   HL    +   +    ++  N   S+ IS  R + PD R  EC   K+ 
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
             P  LP  SVI+VFHNE +S+L+RTVHS++  +PA  L+EIILVDD S    L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           YI++F+  V+++R  ER+GLI  R  GA  +  E + FLDAHCE    WL PLLA I  +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297

Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
              +  P I  ID  T+EF   S Y  +H+ RG F+W + +    LP+ E ++RK  + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQRRKDETYP 356

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            K+PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
           +RS  P+ F K        +I  N  R+ E W DE +K  FY R   A  +      GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470

Query: 370 SEQ 372
           S++
Sbjct: 471 SKR 473


>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
          Length = 544

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 78  NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 137

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 138 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 192

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 193 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 249

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 250 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 309

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 310 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 364

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 365 YKHFYYAAVPSARNVPYGNIQSR 387


>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 571

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++  V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQARV 219

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+++F ELG YD  + VWGGEN 
Sbjct: 277 NLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENL 336

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414


>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Taeniopygia guttata]
          Length = 631

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 152/372 (40%), Positives = 214/372 (57%), Gaps = 21/372 (5%)

Query: 13  LEPPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLR 68
           L+ PL+     PG  GKA+   +L    +    A   ++  N   S+ IS  R + PD R
Sbjct: 109 LDRPLQD-PNAPGASGKAFKTINLNSEEQKEKQAGEEKHCFNAFASDRISLHRDLGPDTR 167

Query: 69  MEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 125
             EC   K+   P  LP  S+I+VFHNE +S+L+RTVHS++  +PA  L+EIILVDD S 
Sbjct: 168 PPECIEQKFKRCP-PLPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASV 226

Query: 126 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 185
              L  KL++Y+++F   V+++R  ER+GLI  R  GA  + GE + FLDAHCE    WL
Sbjct: 227 DEYLHDKLDEYVKQFQ-IVKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWL 285

Query: 186 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH-HYRGIFEWGMLYKENELPEREA 244
            PLLA I  +   +  P I  ID  T+EF       H H RG F+W + +    LP+ E 
Sbjct: 286 EPLLARIAENPVAVVSPDIASIDLNTFEFSKPSPYGHSHNRGNFDWSLSFGWESLPKHEN 345

Query: 245 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 304
           K+RK  + P ++PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E 
Sbjct: 346 KRRKDETYPIRTPTFAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEI 405

Query: 305 VPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL 364
           +PCS +GHV+RS  P+ F K        +IT N  R+ E W DE +K  FY R   A  +
Sbjct: 406 MPCSVVGHVFRSKSPHTFPKGTQ-----VITRNQVRLAEVWMDE-YKEIFYRRNTEAAKI 459

Query: 365 ----DMGDISEQ 372
                 GDIS++
Sbjct: 460 VKQKTFGDISKR 471


>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
 gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=Polypeptide GalNAc transferase 3;
           Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 3;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3
 gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
 gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)

Query: 23  GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
            PG  GK +   HL    +   +    ++  N   S+ IS  R + PD R  EC   K+ 
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
             P  LP  SVI+VFHNE +S+L+RTVHS++  +PA  L+EIILVDD S    L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           YI++F+  V+++R  ER+GLI  R  GA  +  E + FLDAHCE    WL PLLA I  +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297

Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
              +  P I  ID  T+EF   S Y  +H+ RG F+W + +    LP+ E ++RK  + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQRRKDETYP 356

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            K+PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
           +RS  P+ F K        +I  N  R+ E W DE +K  FY R   A  +      GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470

Query: 370 SEQ 372
           S++
Sbjct: 471 SKR 473


>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
           aries]
          Length = 633

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 155/388 (39%), Positives = 222/388 (57%), Gaps = 25/388 (6%)

Query: 1   RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
           RP  +       L+P L+ P ++   PG  GKA+   +L    +   +    ++  N   
Sbjct: 95  RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSAEEQKEKERGEAKHCFNAFA 154

Query: 55  SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           S+ IS  R + PD R  EC   K+   P  LP  SVI+VFHNE +S+L+RTVHS++  +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
           A  L+EIILVDD S    L  KLE+YI++F+  V+++R  ER+GLI  R  GA  +  E 
Sbjct: 214 AILLKEIILVDDASVDEYLHDKLEEYIKQFS-IVKIVRQKERKGLITARLLGATVATAET 272

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
           + FLDAHCE    WL PLLA I  +   +  P I  ID  T+EF   S Y  +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +    LP+ E ++RK  + P K+PT AGGLF++ + +F  +G YD  + +WGGEN
Sbjct: 332 DWSLSFGWETLPDHEKQRRKDETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGEN 391

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            E+SF++W CGG +E +PCS +GHV+RS  P+ F K        +I  N  R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE 446

Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
            +K  FY R   A  +      GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKSFGDLSKR 473


>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
           caballus]
          Length = 539

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 73  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 132

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 133 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 187

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 188 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 244

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 245 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 304

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 305 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 359

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 360 YKNFYYAAVPSARNVPYGNIQSR 382


>gi|363736053|ref|XP_422169.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gallus
           gallus]
          Length = 811

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 143/353 (40%), Positives = 211/353 (59%), Gaps = 10/353 (2%)

Query: 22  EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 81
           + PG+ G    +P+  +    +   E   N+  S+ I  DR I D R   C       DL
Sbjct: 308 QAPGQFGHPVAVPDDKQEEAKSRWKEGNFNVFLSDMIPVDRAIADTRPAGCLEQQVHNDL 367

Query: 82  PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 141
           P  ++I+ F +E +S+L+R+VHS++ R+P   L+E+ILVDDFS+K  L +KL+ Y+ +F 
Sbjct: 368 PTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQFP 427

Query: 142 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 201
            KV+++   ER GLIR R  GA+ +RG V+ FLD+H E  + WL PLL  +   R  +  
Sbjct: 428 -KVKVLHLRERHGLIRARLAGAQVARGTVLTFLDSHVECNVGWLEPLLERVRLRRARVAC 486

Query: 202 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHA 260
           PVI+ I  +   + +V   D+  RGIF W M +   ++P+   +K K   ++  + P  A
Sbjct: 487 PVIEVISDKDMSYMTV---DNFQRGIFTWPMNFGWKQIPQEVIEKNKLKETDIIRCPVMA 543

Query: 261 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPY 320
           GGLF++++ +F ELG YD GL VWGGEN ELSFK+WMCGG IE VPCSR+GH++R+  PY
Sbjct: 544 GGLFSIEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCSRVGHIFRNDNPY 603

Query: 321 NFGKLADRVKGPLITYNYKRVIETWFDE-KHKAYFYTREPLAMFLDMGDISEQ 372
           +F K  DRV+   +  N  RV E W D+ K   Y +    L    ++GD+S+Q
Sbjct: 604 SFPK--DRVR--TVERNLARVAEVWLDDYKELFYGHAYHLLQRRAELGDLSQQ 652


>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 548

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 82  NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 141

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 142 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 196

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 197 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 253

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 254 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 313

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 314 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 368

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 369 YKHFYYAAVPSARNVPYGNIQSR 391


>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (predicted), isoform
           CRA_a [Rattus norvegicus]
          Length = 504

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR+IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 38  NQVESDKLRMDRSIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 97

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 98  PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 152

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 153 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 209

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 210 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 269

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 270 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 324

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            K ++Y   P A  +  G+I  +
Sbjct: 325 FKHFYYAAVPSARNVPYGNIQSR 347


>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
          Length = 536

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 70  NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 129

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 130 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 184

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 185 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGASA---DLKGGFDW 241

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 242 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 301

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 302 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 356

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 357 YKHFYYAAVPSARNVPYGNIQSR 379


>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
          Length = 699

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 140/347 (40%), Positives = 200/347 (57%), Gaps = 25/347 (7%)

Query: 34  PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 93
           PE  R   + S+ +   N   S+ +S  R IPD R   C+  ++P DLP+A+VI+ F NE
Sbjct: 222 PEQVRKLEEESMKKNAFNEYRSSKLSLHRDIPDSRNPLCRQQEHPRDLPQATVIICFVNE 281

Query: 94  GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTER 152
            +S+L+RTV S++ RTP   L+EI+LVDD S +  L  KLE  ++     KV+L+R+ +R
Sbjct: 282 AWSTLLRTVWSVLDRTPPHLLKEILLVDDASDQEHLLDKLEVEVRDNLPDKVKLVRSPKR 341

Query: 153 EGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW 212
            GLIR R  GA+ +  + +VFLD+HCE  L WL PLLA +  D+  +  P ID I  QT 
Sbjct: 342 LGLIRARVLGAEHATADYMVFLDSHCEANLGWLEPLLAWMAKDKTRVVCPTIDRISAQTM 401

Query: 213 EFRSVYEPDHHYRGIFEWGM-------LYKENELPEREAKKRKYNSEPYKSPTHAGGLFA 265
           ++          RG F W +       + +  E P          ++P KSPT AGGLF 
Sbjct: 402 DY---VGGGASSRGTFHWTLDFTWEYAVRQHGETP----------ADPIKSPTMAGGLFG 448

Query: 266 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKL 325
           ++R +F ELG YD G+  WGGEN E+SF+IW CGGS+  +PCSR+GH++R + PY    +
Sbjct: 449 INRDYFYELGTYDMGMDGWGGENLEMSFRIWQCGGSLHIIPCSRVGHIFRDWHPY---AI 505

Query: 326 ADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFLDMGDISEQ 372
            +         N  R+ E W DE +K  FY  +P A  +D GD+SE+
Sbjct: 506 PNSTVNETFLKNSIRLAEVWMDE-YKDIFYDIKPSARSVDFGDVSER 551


>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
          Length = 618

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 107 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 166

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 167 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 221

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 222 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 278

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 279 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 338

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 339 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 393

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 394 YKHFYYAAVPSARNVPYGNIQSR 416


>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 67  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376


>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
          Length = 569

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR+IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 103 NQVESDKLRMDRSIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 162

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 163 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 217

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 218 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 274

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 275 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 334

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 335 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 389

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
            K ++Y   P A  +  G+I  +
Sbjct: 390 FKHFYYAAVPSARNVPYGNIQSR 412


>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 149/363 (41%), Positives = 211/363 (58%), Gaps = 22/363 (6%)

Query: 23  GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYW 75
            PG  GK +   HL    +   +    ++  N   S+ IS  R + PD R  EC   K+ 
Sbjct: 120 APGASGKPFKITHLSPEEQKEKERGETKHCFNAFASDRISLHRDLGPDTRPPECIEQKFK 179

Query: 76  DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 135
             P  LP  SVI+VFHNE +S+L+RTVHS++  +PA  L+EIILVDD S    L +KLE+
Sbjct: 180 RCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEE 238

Query: 136 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 195
           YI++F+  V+++R  ER+GLI  R  GA  +  E + FLDAHCE    WL PLLA I  +
Sbjct: 239 YIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAEN 297

Query: 196 RKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 253
              +  P I  ID  T+EF   S Y  ++H RG F+W + +    LP+ E ++RK  + P
Sbjct: 298 YTAVVSPDIASIDLNTFEFNKPSPY-GNNHNRGNFDWSLSFGWESLPDHEKQRRKDETYP 356

Query: 254 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIGHV 313
            K+PT AGGLF++ + +F  +G YD  + +WGGEN E+SF++W CGG +E +PCS +GHV
Sbjct: 357 IKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHV 416

Query: 314 YRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEKHKAYFYTREPLAMFL----DMGDI 369
           +RS  P+ F K        +I  N  R+ E W DE +K  FY R   A  +      GD+
Sbjct: 417 FRSKSPHTFPKGTQ-----VIARNQVRLAEVWMDE-YKEIFYRRNTDAAKIVKQKSFGDL 470

Query: 370 SEQ 372
           S++
Sbjct: 471 SKR 473


>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
 gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
          Length = 533

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 67  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376


>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
           gorilla gorilla]
          Length = 533

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 67  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376


>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 514

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 142/327 (43%), Positives = 191/327 (58%), Gaps = 11/327 (3%)

Query: 47  EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 106
           +   N   S+ IS +R++PD+R  EC+   Y   LP  S+I+ FHNE +S L+RTVHSI+
Sbjct: 37  QNAFNSYVSDLISVNRSLPDMRHIECRDQVYSSKLPSTSIIVCFHNEAWSVLIRTVHSIL 96

Query: 107 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 166
            R+PA  + +IILVDDFS    L   LE Y+  F  KVR++R  +REGLIR R  GA  S
Sbjct: 97  NRSPAHLIHDIILVDDFSDLQLLKDPLERYLSAFP-KVRIVRAEKREGLIRARLLGASHS 155

Query: 167 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 226
              V+ FLD+H E    WL PLL  I  +   +  PVID I   T E+ +    D +  G
Sbjct: 156 TAPVLTFLDSHVECTQGWLEPLLDRIAVNSTNVVSPVIDIIADDTLEYNAKESADVNVGG 215

Query: 227 IFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGG 286
            F+W + +  + +PER  K      +P ++PT AGGLF++DR FF  LG YDPG  +WGG
Sbjct: 216 -FDWSLQFSWHSIPERILKSGYKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGG 274

Query: 287 ENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWF 346
           EN ELSFK WMCGG +E +PCS +GH++R   PY +     R    ++  N  R+ + W 
Sbjct: 275 ENLELSFKTWMCGGRLEIIPCSHVGHIFRKRSPYKW-----RSGVNVLRRNSIRLAKVWM 329

Query: 347 DEKHKAYFYTREPLAMFL-DMGDISEQ 372
           DE    YF   E L   L D GDIS++
Sbjct: 330 DEYANYYF---ERLGNDLGDYGDISDR 353


>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           4 [Pan troglodytes]
          Length = 533

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 67  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376


>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
           musculus]
 gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           AltName: Full=Polypeptide GalNAc transferase 2;
           Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 2;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 2
           soluble form
 gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 570

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 104 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 163

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 164 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 218

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 219 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 275

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 276 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 335

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 336 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 390

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSR 413


>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
          Length = 570

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 197/323 (60%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++KR+P
Sbjct: 104 NQVESDKLHMDRGIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSP 163

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 164 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 218

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE    WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 219 LTFLDSHCECNERWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 275

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 276 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENL 335

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 336 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 390

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 391 YKHFYYAAVPSARNVPYGNIQSR 413


>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
           And Ea2
 gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
 gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
          Length = 501

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 35  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 94

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 95  PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 149

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 150 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 206

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 207 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 266

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 267 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 321

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 322 YKNFYYAAVPSARNVPYGNIQSR 344


>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_a [Homo sapiens]
          Length = 508

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 67  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 126

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 127 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 181

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 182 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 238

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 239 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 298

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 299 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 353

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 354 YKNFYYAAVPSARNVPYGNIQSR 376


>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
           [Macaca mulatta]
          Length = 554

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 88  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 147

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 148 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 202

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 203 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 259

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 260 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 319

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 320 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 374

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 375 YKNFYYAAVPSARNVPYGNIQSR 397


>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
          Length = 530

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 64  NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 123

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 124 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 178

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 179 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 235

Query: 231 GMLYKENELPEREAKKRKYNS-EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 236 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 295

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 296 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 350

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 351 YKNFYYAAVPSARNVPYGNIQSR 373


>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Papio anubis]
          Length = 571

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 219

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 277 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 336

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414


>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
          Length = 571

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 198/323 (61%), Gaps = 14/323 (4%)

Query: 51  NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           N   S+ +  DR IPD R ++C+   + +DLP  SV++ FHNE  S+L+RTV S++K++P
Sbjct: 105 NQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSP 164

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
              ++EIILVDD+S+  + D  L   I+    KVR++RN  REGL+R+R RGA  ++ +V
Sbjct: 165 PHLIKEIILVDDYSNDPE-DGALLGKIE----KVRVLRNDRREGLMRSRVRGADAAQAKV 219

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 230
           + FLD+HCE   +WL PLL  +  DR  +  P+ID I+   +++          +G F+W
Sbjct: 220 LTFLDSHCECNEHWLEPLLERVAEDRTRVVSPIIDVINMDNFQYVGA---SADLKGGFDW 276

Query: 231 GMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 289
            +++K + +   + + R+ N   P K+P  AGGLF MD+ +F ELG YD  + VWGGEN 
Sbjct: 277 NLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENL 336

Query: 290 ELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDEK 349
           E+SF++W CGGS+E +PCSR+GHV+R   PY F   +    G +   N +R  E W DE 
Sbjct: 337 EISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS----GTVFARNTRRAAEVWMDE- 391

Query: 350 HKAYFYTREPLAMFLDMGDISEQ 372
           +K ++Y   P A  +  G+I  +
Sbjct: 392 YKNFYYAAVPSARNVPYGNIQSR 414


>gi|167536139|ref|XP_001749742.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771890|gb|EDQ85551.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1275

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 132/327 (40%), Positives = 188/327 (57%), Gaps = 10/327 (3%)

Query: 46  GEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSI 105
             +  N   S+ +S  R +PD R  +CK   YP DLP A+VI+ F NE +S+L RTV S+
Sbjct: 214 ARFAFNEYRSSQLSLHRDVPDARPMQCKDVAYPPDLPAATVIICFVNEAWSALFRTVWSV 273

Query: 106 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 165
           + RTP   L EIIL+DD S  + L Q LE+ +QR   KV+L+R+  R GLIR R  GAK 
Sbjct: 274 LDRTPENLLHEIILLDDASDASWLQQPLEEELQRLPAKVKLVRSPRRLGLIRARLLGAKH 333

Query: 166 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 225
           +  + ++FLD+HCE  + W+ PLLA +  D   +  PVID I+     +          R
Sbjct: 334 ATADYMIFLDSHCEANVGWIQPLLAWMAGDPSRVVTPVIDSINNNDMSYHGAGGAS--SR 391

Query: 226 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 285
           G F W + +     PE  A+     ++P KSPT AGGLF ++R +F ++G YD G+  WG
Sbjct: 392 GTFHWTLDFSWEANPEPVAQV----TDPVKSPTMAGGLFGINRQYFYDVGSYDQGMDGWG 447

Query: 286 GENFELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETW 345
           GEN E+SF++W CGGS+  +PCS +GH++R   PY    + +         N  R+ ETW
Sbjct: 448 GENLEMSFRVWQCGGSLHILPCSHVGHIFRDSHPYT---IPNSTINDTFLRNSIRLAETW 504

Query: 346 FDEKHKAYFYTREPLAMFLDMGDISEQ 372
            D+ +K  FY   P A  +D GD+ E+
Sbjct: 505 MDD-YKEIFYQIRPSARKVDHGDVGER 530


>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Callithrix jacchus]
          Length = 633

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 153/388 (39%), Positives = 222/388 (57%), Gaps = 25/388 (6%)

Query: 1   RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 54
           RP  +       L+P L+ P ++   PG  GKA+   +L    +   +    ++  N   
Sbjct: 95  RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSIEEQKEKERGEAKHCFNAFA 154

Query: 55  SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 110
           S+ +S  R + PD R  EC   K+   P  LP  SVI+VFHNE +S+L+RTVHS++  +P
Sbjct: 155 SDRVSLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213

Query: 111 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 170
           A  L+EIILVDD S    L  KL++Y+++F+  V+++R  ER+GLI  R  GA  +  E 
Sbjct: 214 AVLLKEIILVDDASVDEYLHDKLDEYVKQFS-IVKIVRQRERKGLITARLLGASVATAET 272

Query: 171 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 228
           + FLDAHCE    WL PLLA I  +   +  P I  ID  T+EF   S Y   HH RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDMNTFEFNKPSPY-GSHHNRGNF 331

Query: 229 EWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGEN 288
           +W + +    LP+ E ++RK  + P K+PT AGGLF++ + +F  +G YD  + +WGGEN
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGEN 391

Query: 289 FELSFKIWMCGGSIEWVPCSRIGHVYRSFMPYNFGKLADRVKGPLITYNYKRVIETWFDE 348
            E+SF++W CGG +E +PCS +GHV+RS  P++F K        +I  N  R+ E W DE
Sbjct: 392 IEMSFRVWQCGGQLEIMPCSVVGHVFRSKSPHSFPKGTQ-----VIARNQVRLAEVWMDE 446

Query: 349 KHKAYFYTREPLAMFL----DMGDISEQ 372
            +K  FY R   A  +      GD+S++
Sbjct: 447 -YKEIFYRRNTDAAKIVKQKTFGDLSKR 473


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.141    0.443 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,661,254,994
Number of Sequences: 23463169
Number of extensions: 301466500
Number of successful extensions: 601944
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2183
Number of HSP's successfully gapped in prelim test: 2738
Number of HSP's that attempted gapping in prelim test: 593067
Number of HSP's gapped (non-prelim): 5446
length of query: 372
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 228
effective length of database: 8,980,499,031
effective search space: 2047553779068
effective search space used: 2047553779068
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)